Pricing for AI features
Learn about pricing for AI features, such as Agent Runners and AI Gateway.
AI inference as a usage meter
Section titled “AI inference as a usage meter”AI inference is a usage meter that measures the costs of using AI models and agents on Netlify. This usage meter calculates costs for using Agent Runners and AI Gateway.
AI inference is one of several usage meters, such as compute, bandwidth, and web requests, that convert usage to Netlify credits.
Note that the “Why did it fail?” troubleshooting failed deploys feature does not generate AI inference costs. If you use an Agent Runner to fix a failed deploy though, AI inference usage and costs will be calculated.
How costs are calculated
Section titled “How costs are calculated”AI inference costs are calculated based on your AI model usage.
AI inference is measured based on the costs set by AI model providers, which convert AI model usage tokens to USD. Netlify then converts every $1 USD spent on AI model usage to 180 Netlify credits.
This rate is designed to make using AI features like Agent Runners and AI Gateway competitive with using AI providers without Netlify.
Feature/resource | Unit | Credits Consumed | What this covers |
---|---|---|---|
AI inference | AI model usage in USD (US Dollars) | 180 credits per $1 USD | Agent Runners and AI Gateway |
Pricing for Agent Runners
Section titled “Pricing for Agent Runners”The costs of using Agent Runners is calculated using AI inference and compute as usage meters.
AI inference costs are determined by the AI model/s used in your agent runs. When you choose an agent for an agent run, the agent you choose picks an AI model based on the task type and complexity. To learn more about how AI models are priced, check out our AI model pricing rates table.
Agent Runners also use compute resources. Learn more about how compute is calculated in our how credits work docs.
Pricing for AI Gateway (Beta)
Section titled “Pricing for AI Gateway (Beta)”The cost of using AI Gateway is calculated using only the AI inference usage meter.
When you use AI Gateway, you can choose a specific AI model to use for your requests.
How costs are calculated
Section titled “How costs are calculated”For each request to the AI Gateway that ends successfully, the following is calculated:
- Your account is billed for the amount of actual tokens used in that request.
- The count of actual tokens used for each token type (input, output, cache reads, and cache writes) is converted to US dollar amounts, using the rates listed below.
- The exact total dollar value for the request is converted to credits. Each dollar equals 180 credits.
There are no special credits for the AI Gateway. AI Gateway uses the same pool of credits included in your Netlify pricing plan, or any add-on credits you purchase.
To estimate your costs, you can check the Pricing per AI model table or explore our pricing calculator.
AI providers and models
Section titled “AI providers and models”AI inference supports the following AI providers and models.
If you are using AI Gateway, you can choose a specific AI model to use for your requests.
If you are using Agent Runners, your AI agent will choose which AI models to use on your behalf to optimize performance and cost based on task type and complexity.
AI Provider | Model |
---|---|
OpenAI | gpt-5 |
OpenAI | gpt-5-codex |
OpenAI | gpt-5-mini |
OpenAI | gpt-5-nano |
OpenAI | gpt-4.1 |
OpenAI | gpt-4.1-mini |
OpenAI | gpt-4.1-nano |
OpenAI | gpt-4o |
OpenAI | gpt-4o-mini |
OpenAI | o4-mini |
OpenAI | o3 |
OpenAI | o3-mini |
OpenAI | codex-mini-latest |
Anthropic | claude-opus-4-1-20250805 |
Anthropic | claude-opus-4-20250514 |
Anthropic | claude-sonnet-4-5-20250929 |
Anthropic | claude-sonnet-4-20250514 |
Anthropic | claude-3-7-sonnet-20250219 |
Anthropic | claude-3-7-sonnet-latest |
Anthropic | claude-3-5-haiku-20241022 |
Anthropic | claude-3-5-haiku-latest |
Anthropic | claude-3-haiku-20240307 |
gemini-2.5-pro | |
gemini-flash-latest | |
gemini-2.5-flash | |
gemini-2.5-flash-preview-09-2025 | |
gemini-flash-lite-latest | |
gemini-2.5-flash-lite | |
gemini-2.5-flash-lite-preview-09-2025 | |
gemini-2.5-flash-image-preview | |
gemini-2.0-flash | |
gemini-2.0-flash-lite |
You can also programatically access the up-to-date list in JSON format via a public API endpoint: https://api.netlify.com/api/v1/ai-gateway/providers
.
Pricing per AI model
Section titled “Pricing per AI model”The following table shows the current rates for AI tokens in USD (US dollars) for different AI providers before costs are converted to Netlify credits.
AI model cost per 1 million tokens
Section titled “AI model cost per 1 million tokens”Provider | Model | Input Tokens | Cache read | Output Tokens | Cache write |
---|---|---|---|---|---|
OpenAI | gpt-5 | $1.25 | $0.125 | $10.00 | N/A |
OpenAI | gpt-5-codex | $1.25 | $0.125 | $10.00 | N/A |
OpenAI | gpt-5-mini | $0.25 | $0.025 | $2.00 | N/A |
OpenAI | gpt-5-nano | $0.05 | $0.005 | $0.40 | N/A |
OpenAI | gpt-4.1 | $2.00 | $0.500 | $8.00 | N/A |
OpenAI | gpt-4.1-mini | $0.40 | $0.100 | $1.60 | N/A |
OpenAI | gpt-4.1-nano | $0.10 | $0.025 | $0.40 | N/A |
OpenAI | gpt-4o | $2.50 | $1.250 | $10.00 | N/A |
OpenAI | gpt-4o-mini | $0.15 | $0.075 | $0.60 | N/A |
OpenAI | o4-mini | $1.10 | $0.275 | $4.40 | N/A |
OpenAI | o3 | $2.00 | $0.500 | $8.00 | N/A |
OpenAI | o3-mini | $1.10 | $0.550 | $4.40 | N/A |
OpenAI | codex-mini-latest | $1.50 | $0.375 | $6.00 | N/A |
Anthropic | claude-opus-4-1-20250805 | $15.00 | $1.500 | $75.00 | $18.75 |
Anthropic | claude-opus-4-20250514 | $15.00 | $1.500 | $75.00 | $18.75 |
Anthropic | claude-sonnet-4-5-20250929 | $3.00 | $0.300 | $15.00 | $3.75 |
Anthropic | claude-sonnet-4-20250514 | $3.00 | $0.300 | $15.00 | $3.75 |
Anthropic | claude-3-7-sonnet-20250219 | $3.00 | $0.300 | $15.00 | $3.75 |
Anthropic | claude-3-7-sonnet-latest | $3.00 | $0.300 | $15.00 | $3.75 |
Anthropic | claude-3-5-haiku-20241022 | $0.80 | $0.080 | $4.00 | $1.00 |
Anthropic | claude-3-5-haiku-latest | $0.80 | $0.080 | $4.00 | $1.00 |
Anthropic | claude-3-haiku-20240307 | $0.25 | $0.030 | $1.25 | $0.30 |
gemini-2.5-pro | $1.25 | $0.310 | $10.00 | N/A | |
gemini-2.5-flash | $0.30 | $0.075 | $2.50 | N/A | |
gemini-2.5-flash-preview-09-2025 | $0.30 | $0.075 | $2.50 | N/A | |
gemini-flash-latest | $0.30 | $0.075 | $2.50 | N/A | |
gemini-2.5-flash-lite | $0.10 | $0.025 | $0.40 | N/A | |
gemini-2.5-flash-lite-preview-09-2025 | $0.10 | $0.025 | $0.40 | N/A | |
gemini-flash-lite-latest | $0.10 | $0.025 | $0.40 | N/A | |
gemini-2.5-flash-image-preview | $0.30 | N/A | $30.00 | N/A | |
gemini-2.0-flash | $0.10 | $0.025 | $0.40 | N/A | |
gemini-2.0-flash-lite | $0.075 | N/A | $0.30 | N/A |
Did you find this doc useful?
Your feedback helps us improve our docs.