Skip to main content

Amazon Bedrock AgentCore

Amazon Bedrock AgentCore Pricing

Tailor AgentCore to your needs—mix and match services, use them independently or together, and pay for what you use as your AI initiatives grow.

Pay only for what you use

Amazon Bedrock AgentCore offers flexible, consumption-based pricing with no upfront commitments or minimum fees. Each feature can be used independently or together, and you pay only for what you use. This modular approach allows you to start small and scale as your agent applications grow.

Get started with AgentCore at no cost - new AWS customers receive up to $200 in Free Tier credits. Explore AWS Free Tier benefits and start building today.

Note: There is no charge for using AWS Agent Registry during the preview period.

AgentCore features

Open all

    AgentCore Runtime is a secure, serverless runtime purpose-built for deploying and scaling agents and tools. Choose between direct code deployment for rapid iteration or container-based deployment for maximum control.

    You only pay for the active resources you consume 
    Unlike traditional compute services that charge for pre-allocated resources (i.e., fixed instance size and cost per second while hosting the agent), with AgentCore Runtime you only pay for active resource consumption. This delivers substantial cost savings for agentic workloads, which typically spend 30-70% of time in I/O wait (waiting for LLM responses, tool / API calls, or database queries). With pre-allocated pricing, you would pay for idle CPU during these wait periods. With the active resource consumption-based pricing in AgentCore Runtime, I/O wait and idle time is free, if no other background process is running.

    Billing is based on CPU and memory consumption across your session lifetime, calculated at per-second increments. For CPU resources, you are charged based on actual consumption - if your agent consumes no CPU during I/O wait, there are no CPU charges. For memory resources, you're charged for the peak memory consumed up to that second.

    Key details:

    • No upfront resource selection required
    • Billing is calculated per second, using actual CPU consumption and peak memory consumed up to that second, with a 1-second minimum
    • You only pay for actual resource consumption during your session, which spans from microVM boot, initialization, active processing, idle periods, until session termination (microVM shutdown)
    • Billing includes system overhead in addition to your application's resource usage
    • 128MB minimum memory billing applies for memory
    • Storage costs: Container deployment requires ECR storage (billed separately). Direct code deployment bills you for the size of the code artifacts you deployed at S3 Standard rates. The managed session storage (currently in public preview) is an early version of the S3-backed storage; we will come out with changes including pricing to that experience before GA.
    • Network data transfer charges apply at standard EC2 rates

    AgentCore Browser provides a fast, secure, cloud-based browser runtime to enable agents to interact with websites at scale. 

    You only pay for the active resources you consume
    Unlike traditional compute services that charge for pre-allocated resources (i.e., fixed instance size and cost per second while hosting the agent), with AgentCore Browser you only pay for active resource consumption. This delivers substantial cost savings for agentic workloads, which typically spend 30-70% of time in I/O wait (waiting for LLM responses, tool / API calls, or database queries). With pre-allocated pricing, you would pay for idle CPU during these wait periods. With the active resource consumption-based pricing in AgentCore Browser, I/O wait and idle time is free, if no other background process is running.

    Billing is based on CPU and memory consumption across your session lifetime, calculated at per-second increments. For CPU resources, you are charged based on actual consumption - if your agent consumes no CPU during I/O wait, there are no CPU charges. For memory resources, you're charged for the peak memory consumed up to that second.

    Key details:

    • No upfront resource selection required
    • Billing is calculated per second, using actual CPU consumption and peak memory consumed up to that second, with a 1-second minimum
    • You only pay for actual resource consumption during your session, which spans from microVM boot, initialization, active processing, idle periods, until session termination (microVM shutdown)
    • Billing includes system overhead in addition to your application's resource usage
    • 128MB minimum memory billing applies for memory
    • Storage costs: Browser Profiles require Amazon S3 storage for storing profile artifacts (cookies, local storage) and you will get billed at Amazon S3 Standard rates, starting April 15, 2026.
    • Network data transfer charges apply at standard EC2 rates.

    AgentCore Code Interpreter enables agents execute code securely in sandbox environments, enhancing their accuracy and expanding their ability to solve complex end-to-end tasks. 

    You only pay for the active resources you consume
    Unlike traditional compute services that charge for pre-allocated resources (i.e., fixed instance size and cost per second while hosting the agent), with AgentCore Code Interpreter you only pay for active resource consumption. This delivers substantial cost savings for agentic workloads, which typically spend 30-70% of time in I/O wait (waiting for LLM responses, tool / API calls, or database queries). With pre-allocated pricing, you would pay for idle CPU during these wait periods. With the active resource consumption-based pricing in AgentCore Code Interpreter, I/O wait and idle time is free, if no other background process is running.

    Billing is based on CPU and memory consumption across your session lifetime, calculated at per-second increments. For CPU resources, you are charged based on actual consumption - if your agent consumes no CPU during I/O wait, there are no CPU charges. For memory resources, you're charged for the peak memory consumed up to that second.

    Key details:

    • No upfront resource selection required
    • Billing is calculated per second, using actual CPU consumption and peak memory consumed up to that second, with a 1-second minimum
    • You only pay for actual resource consumption during your session, which spans from microVM boot, initialization, active processing, idle periods, until session termination (microVM shutdown)
    • Billing includes system overhead in addition to your application's resource usage
    • 128MB minimum memory billing applies for memory
    • Network data transfer charges apply at standard EC2 rates


    Amazon Bedrock AgentCore Gateway enables agents to securely access tools by transforming APIs and Lambda functions into agent-compatible tools and connecting to existing MCP servers.

    Consumption-based pricing
    You pay only for the API calls your agents make through Gateway. You're charged based on the number of MCP operations (such as ListTools, CallTool, and Ping), search queries, and tools indexed for semantic search functionality.

    Key details:

    • No upfront costs or minimum commitments required
    • Network data transfer charges apply at standard EC2 rates

    Policy in Amazon Bedrock AgentCore gives you comprehensive control over actions agents take, helping ensure agents stay within defined boundaries without slowing down.

    Consumption-based pricing

    You only pay for the authorization requests performed during agent execution. Each time an agent calls a tool through AgentCore Gateway, Policy checks the action against your rules to determine whether it is allowed or denied.

    In addition, Policy offers natural language policy authoring, which lets you create Cedar policies using simple natural language descriptions. You are charged per 1,000 user input tokens processed when converting natural language into Cedar policy statements.

    AgentCore Identity simplifies agent identity and access management and allows your agents to securely access AWS resources and third-party tools and services on behalf of users or by themselves with pre-authorized user consent.

    Consumption-based pricing
    Customers who use AgentCore Identity through either AgentCore Runtime or AgentCore Gateway, do not incur any additional charges for their use of AgentCore Identity. For all other scenarios, you pay for only what you use and are charged based on the number of requests from the agent to AgentCore Identity for an OAuth token or an API key.

    Key details:

    • No minimum fees and no upfront commitments
    • Billing is calculated per successful OAuth token or API key requested to perform a task requiring authorization for a non-AWS resource
    • No additional charges incurred when customers use AgentCore Identity through AgentCore Runtime or AgentCore Gateway 

    AgentCore Memory makes it easy for developers to build context-aware agents by eliminating complex memory infrastructure management while providing full control over what the agent remembers.

    Consumption-based pricing
    You only pay for what you use. Our pricing is simple and usage-based, aligning directly with how your agents create value:

    1. Short-term memory is priced based on the number of raw events created, giving you predictable costs for in-session context.
    2. Long-term memory records is priced based on the number of memories processed and stored each month and the number of memory record retrieval calls, so you only pay when your agents store and use processed knowledge.
    3. To extract long-term memory from raw events, you can choose between built-in memory strategies, which include automatic processing, or more configurable memory strategies that run in your account using your choice of model and prompt.

    Key details:

    • No upfront resource selection required
    • For short-term memory, billing is calculated per create event request
    • For long-term memory storage, billing is calculated per stored memory record per day and averaged across the month
    • For long-term memory retrieval, billing is calculated per retrieve memory request

    AgentCore Observability gives developers complete visibility into agent workflows to trace, debug, and monitor agents' performance in production environments.

    Consumption-based pricing
    You pay as you go for telemetry generated, stored, and queried for your agents. The telemetry data is ingested and stored in your Amazon CloudWatch account. You are charged for data ingestion and storage, queries to retrieve and analyze information, and masking of sensitive/Personally Identifiable Information (PII) data in logs. To review pricing details visit Amazon CloudWatch pricing page

    AgentCore Evaluations helps continuously inspect agent quality based on real-world behavior. Teams can perform agentic evaluations using 13 built-in evaluators on common quality dimensions or create custom evaluators for specific business requirements. The results are integrated into AgentCore Observability powered by Amazon CloudWatch for unified monitoring.

    Consumption based pricing
    You pay for what you use. For built-in evaluators, pricing is charged by AgentCore based on input and output tokens processed during evaluation. For custom evaluations using your own LLM infrastructure, you pay per evaluation performed, with separate inference costs based on the model used.

    Key details:

    • No upfront commitments or minimum fees required
    • Includes CI/CD integration with configurable quality thresholds
    • Production monitoring with sampling rules and dashboard aggregation
    • Cost control through percentage-based sampling, conditional sampling, and selective metric monitoring
    • Model usage costs are included for built-in evaluators - no separate model charges
    • Custom evaluations incur additional model usage charges in your account

    The AWS Agent Registry provides a centralized catalog for organizing, curating, and discovering resources across your organization. With registry, you can publish MCP servers, agents, agent skills, and custom resources into a searchable registry, control access through an approval workflow, and enable both human users and AI agents to discover the right tools and agents using semantic and keyword search.

    Consumption based Pricing with Free Tier
    You only pay for what you use. Pricing for registry is based on the number of records you have added into your registry, and the number of API calls (Search, List, and Get) you make to discover resources added into your registry. Registry comes with a Free Tier, where every month your first 5,000 records, first 1,000,000 Search API calls, and first 2,000,000 combined Get and List API calls are free of charge. You are charged only for usage above these thresholds.

    Key details:

    • No upfront commitments or minimum fees required
    • You pay only for what you use in terms of Records added in the Registry, and Search, List and Get calls made on Records in the Registry
    • For Records, you are only charged for ‘Net Records’ in the Registry present at any given point of time. If you add and then delete a record, it no longer counts towards your Net total number of records

Pricing Table

Service/Capability
Type
Resource
Price
Runtime

Active Consumption Based

Active Consumption Based

CPU

Memory

$0.0895 per vCPU-hour

$0.00945 per GB-hour

Browser Tool

Active Consumption Based

Active Consumption Based

CPU

Memory

$0.0895 per vCPU-hour

$0.00945 per GB-hour

Code Interpreter

Active Consumption Based

Active Consumption Based

CPU

Memory

$0.0895 per vCPU-hour

$0.00945 per GB-hour

Gateway

Consumption Based

Consumption Based

Consumption Based

API Invocations (ListTools, InvokeTool, Ping)

Search API

Tool Indexing

$0.005 per 1,000 invocations

$0.025 per 1,000 invocations

$0.02 per 100 tools indexed per month

Identity

Consumption Based

Token or API key requests for non-AWS resources

$0.010 per 1,000 token or API keys requested by the agent

(Note: AgentCore Identity is available at no additional charge to customers when they use it through either AgentCore Runtime or AgentCore Gateway)

Memory

Consumption Based

Consumption Based

Consumption Based

Short-Term Memory

Long-Term Memory Storage

Long-Term Memory Retrieval

$0.25 per 1,000 new events

Using built-in memory strategies: $0.75 per 1000 memory records stored per month

Using built-in with override or self-managed memory strategies: $0.25 per 1000 memory records stored per month*

$0.50 per 1000 memory record retrievals

Observability

Consumption Based





 

Spans, logs, and metrics

Charged as per Amazon CloudWatch pricing

Evaluations

Consumption Based
 

Consumption Based
 

Built-in evaluators
 

Custom evaluators

$0.0024 per 1000 input tokens
$0.012 per 1000 output tokens

$1.50 per 1000 evaluations (model usage charges billed seperately)

Policy

Consumption Based

Consumption Based

Authorization Request

Input Tokens Processed

$0.000025 per authorization request

$0.13 per 1000 Input Tokens

AWS Agent Registry (Preview)

Consumption Based

Consumption Based

Consumption Based

Registry Records

Search API Invocation

List and Get API Invocations

First 5,000 records free monthly, then $0.400 per 1,000 records

First 1,000,000 invocations free monthly, then $0.020 per 1,000 invocations

 

First 2,000,000 combined invocations free monthly, then $0.004 per 1,000 invocations

*For built-in with override and self-managed strategies, you may incur additional charges for the model usage in your account

I want to

Did you find what you were looking for today?

Let us know so we can improve the quality of the content on our pages