OpenAI API Pricing Calculator

Plug in input and output token counts. Compare list pricing across GPT-4o, GPT-4o mini, and other popular chat models — handy for budget planning and prototype-to-production cost forecasting.

Estimated monthly cost (GPT-4o)

$2.75

Uses public list prices; excludes caching, batch, and volume tiers. Confirm on openai.com/api/pricing.

Chat apps resend conversation history each turn, so input tokens often grow with thread length.

About this openai api pricing calculator

OpenAI prices its API per million tokens, split between input (your prompt + context) and output (the model's response). The cost difference between GPT-4o and GPT-4o mini is huge — roughly 20x — so picking the right model and getting your token math right is the difference between a sustainable AI feature and one that bleeds money. This free calculator lets you plug in real token counts and see per-request, per-conversation, and monthly costs side-by-side.

It's built for founders sizing up an AI feature, engineers comparing model choices for a new endpoint, and ops/finance teams forecasting LLM spend before signing off on a launch. Numbers are list prices from OpenAI's public pricing page — if you're on an enterprise plan with negotiated rates, swap in your own.

When to use this tool

  • Sizing the cost of a new AI feature before you ship it.
  • Deciding between GPT-4o and GPT-4o mini for a specific use case.
  • Forecasting monthly OpenAI spend based on expected request volume.
  • Estimating per-conversation cost for a chatbot you're scoping.
  • Comparing chatbot platforms — knowing the raw API cost lets you spot inflated SaaS markups.

How it works

  1. 1

    Choose a model

    Pick from GPT-4o, GPT-4o mini, or whichever model you're using. Each has different per-million-token pricing for input vs output.

  2. 2

    Enter token counts

    Input tokens = your prompt + any context (system prompt + user message + retrieved documents). Output tokens = the model's response. As a rule of thumb, 1 token ≈ 0.75 English words, so a 1,000-word response is ~1,333 output tokens.

  3. 3

    Read the cost breakdown

    You get cost per request, cost per 1,000 requests, and projected monthly cost at a request volume you control. Compare two models side-by-side to see which makes sense for your workload.

Frequently asked questions

How much does the OpenAI API cost?

+

It depends on the model. As of 2026, GPT-4o is roughly $2.50 per million input tokens and $10 per million output tokens; GPT-4o mini is around $0.15 per million input and $0.60 per million output. That makes GPT-4o mini ~16-20x cheaper for the same volume. This calculator uses the latest published list prices.

What is a token?

+

A token is roughly a piece of a word — about 0.75 English words on average. The word 'extraordinary' is 3 tokens; 'cat' is 1; punctuation usually gets its own token. To count tokens precisely, use OpenAI's tiktoken library; for budgeting, multiply word count by 1.33 to get a rough token count.

How do I estimate cost for a chatbot conversation?

+

Each turn = (system prompt + conversation history + new user message) input tokens + the response output tokens. Conversations grow as history accumulates, so by turn 5 the input cost can dwarf the output cost. A typical RAG chatbot turn runs 1k-3k input tokens (system + retrieved docs) and 200-500 output tokens — about $0.003-$0.010 on GPT-4o.

Is GPT-4o mini good enough for production?

+

For most customer-facing chatbot use cases, yes — it's noticeably weaker than GPT-4o at multi-step reasoning and code, but for grounded Q&A over your documents it's often indistinguishable in user testing. Start with GPT-4o mini, measure quality, and upgrade only the specific calls where you can prove GPT-4o is worth the price.

Are these prices current?

+

We update the catalog whenever OpenAI publishes a new pricing page. For real-time confirmation, check openai.com/api/pricing — and remember that enterprise customers with annual commits typically get 10-25% off list, so build that into your forecast if you'll qualify.

Does this include image, audio, or fine-tuning costs?

+

Currently chat completion pricing only. For DALL-E, Whisper, or fine-tuning costs, check the OpenAI pricing page directly — those models have different per-unit pricing (per image, per audio minute, per training token) that doesn't fit the same calculator.

Stop paying per token — get a flat-price chatbot

BuiltABot bundles GPT-4o, RAG, hosting, escalation, and analytics into a flat monthly price. Predictable cost, no surprise OpenAI bills, ship in 5 minutes.