Cost-effective Solutions

Pricing

Pricing

Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.

GPT-5.4

Our most capable model for professional work

Price

Input:

$2.50 / 1M tokens

Cached input:

$0.25 / 1M tokens

Output:

$15.00 / 1M tokens

GPT-5 mini

A faster, cheaper version of GPT-5 for well-defined tasks

Price

Input:

$0.250 / 1M tokens

Cached input:

$0.025 / 1M tokens

Output:

$2.000 / 1M tokens

Customize our models to get even higher performance for your specific use cases.

GPT-4.1

Fine-tuning price

Input:

$3.00 / 1M tokens

Cached input:

$0.75 / 1M tokens

Output:

$12.00 / 1M tokens

Training:

$25.00 / 1M tokens

GPT-4.1 mini

Fine-tuning price

Input:

$0.80 / 1M tokens

Cached input:

$0.20 / 1M tokens

Output:

$3.20 / 1M tokens

Training:

$5.00 / 1M tokens

GPT-4.1 nano

Fine-tuning price

Input:

$0.20 / 1M tokens

Cached input:

$0.05 / 1M tokens

Output:

$0.80 / 1M tokens

Training:

$1.50 / 1M tokens

o4-mini

Reinforcement fine-tuning price

Input:

$4.00 / 1M tokens

Cached input:

$1.00 / 1M tokens

Output:

$16.00 / 1M tokens

Training:

$100.00 / training hour

Realtime API

Build low-latency, multimodal experiences including speech-to-speech.

Text

gpt-realtime-1.5

$4.00 / 1M input tokens

$0.40 / 1M cached input tokens

$16.00 / 1M output tokens

gpt-realtime-1.5

$4.00 / 1M input tokens

$0.40 / 1M cached input tokens

$16.00 / 1M output tokens

gpt-realtime

$4.00 / 1M input tokens

$0.40 / 1M cached input tokens

$16.00 / 1M output tokens

gpt-realtime

$4.00 / 1M input tokens

$0.40 / 1M cached input tokens

$16.00 / 1M output tokens

gpt-realtime-mini

$0.60 / 1M input tokens

$0.06 / 1M cached input tokens

$2.40 / 1M output tokens

gpt-realtime-mini

$0.60 / 1M input tokens

$0.06 / 1M cached input tokens

$2.40 / 1M output tokens

Sora Video API

Richly detailed, dynamic video generation and remixing with our latest generative model.

Models	Size	Price per second
sora-2	Portrait: 720 x 1280 Landscape: 1280 x 720	$0.10
sora-2-pro	Portrait: 720 x 1280 Landscape: 1280 x 720	$0.30
sora-2-pro	Portrait: 1024 x 1792 Landscape: 1792 x 1024	$0.50

Image Generation API

Precise, high-fidelity image generation and editing with our latest multimodal model.

Text

GPT-image-1.5

$5.00 / 1M input tokens

$1.25 / 1M cached input tokens

$10.00 / 1M output tokens

GPT-image-1.5

$5.00 / 1M input tokens

$1.25 / 1M cached input tokens

$10.00 / 1M output tokens

GPT-image-1

$5.00 / 1M input tokens

$1.25 / 1M cached input tokens

GPT-image-1

$5.00 / 1M input tokens

$1.25 / 1M cached input tokens

GPT-image-1-mini

$2.00 / 1M input tokens

$0.20 / 1M cached input tokens

GPT-image-1-mini

$2.00 / 1M input tokens

$0.20 / 1M cached input tokens

Prompts are billed similarly to other GPT models. Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images.

available via the Responses API
text output tokens include model reasoning tokens

For detailed token usage by image quality and size, see the docs.

Responses API

Our newest API combining the simplicity of Chat Completions with the built-in tool use of Assistants.

Price

Responses API is not priced separately. Tokens are billed at the chosen language model’s input and output rates.

Chat Completions API

Build text-based conversational experiences.

Price

Chat Completions API is not priced separately. Tokens are billed at the chosen language model's input and output rates.

Assistants API

Build assistant-like experiences with our tools.

Price

Assistants API is not priced separately. Tokens are billed at the chosen language model's input and output rates.

Built-in tools

Extend model capabilities with built-in tools in the API Platform.

Tool

Cost

Containers (Hosted Shell and Code Interpreter)

Current prices	Starting March 31, 2026
1 GB (default): $0.03 / container	1 GB (default): $0.03 / per 20 minute session
4 GB: $0.12 / container	4 GB: $0.12 / per 20 minute session
16 GB: $0.48 / container	16 GB: $0.48 / per 20 minute session
64 GB: $1.92 / container	64 GB: $1.92 / per 20 minute session

File Search Storage

$0.10 / GB of vector storage per day (first GB free)

File Search Tool Call (Responses API only)

$2.50 / 1k tool calls

Web Search Tool Call

There are two components that contribute to the cost of using the web search tool: (1) Tool calls and (2) Search content tokens.

Tool calls are billed per 1,000 calls, according to the tool version and model type.
Search content tokens are tokens retrieved from the search index and fed to the model alongside your prompt to generate an answer. These are billed at the model’s input token rate, unless otherwise specified.

Tool Version	Cost
Web search (all models)	$10.00 / 1K calls + search content tokens billed at model rates 1
Web search preview (reasoning models)	$10.00 / 1K calls + search content tokens billed at model rates
Web search preview (non-reasoning models)	$25.00 / 1K calls + search content tokens are free

1 For gpt-4o-mini and gpt-4.1-mini with the web search non-preview tool, search content tokens are charged as a fixed block of 8,000 input tokens per call.

The billing dashboard will report gpt-4.1 and gpt-4.1-mini search line items as ‘web search tool calls | gpt-4o’ and ‘web search tool calls | gpt-4o-mini’

GB refers to binary gigabytes of storage (also known as gibibyte), where 1GB is 2^30 bytes.

AgentKit

Build, deploy, and optimize production-grade agents with Agent Builder, ChatKit, and Evals.

Billing

Begins on November 1, 2025 — no charges will apply before then.

Usage meter

Storage for ChatKit File / Image Uploads.

Free tier (per account, per month)

1 GB

Price beyond free tier

$0.10 / GB-day

What’s always free

Agent Builder – design and iterate with zero cost until you hit Run. Self-hosted ChatKit – host a custom ChatKit backend and pay only normal model-token charges. Enterprise controls – SSO, RBAC, and audit logs are included at no additional fee.

We recommend that developers use our large and mini GPT models for everyday tasks. Our large GPT models generally perform better on a wide range of tasks, while our mini GPT models are fast and inexpensive for simpler tasks.

Our large and mini reasoning models are ideal for complex, multi-step tasks and STEM use cases that require deep thinking about tough problems. You can choose the mini reasoning model if you're looking for a faster, more inexpensive option.

We recommend experimenting with all of these models in the Playground to explore which models provide the best price performance trade-off for your usage.