GPT-4o
OpenAI’s vision-and-voice flagship from 2024 — still a great default.
At a glance
- Context window
- 128k tokens
- Max output
- 16k tokens
- Input price
- $2.50 / 1M tokens
- Output price
- $10.00 / 1M tokens
- Time to first token
- 0.4s
- Input modalities
- text, image, audio
- Output modalities
- text, audio
- License
- Proprietary
- Provider
- OpenAI
Benchmark scores
Public scores from each provider; bars compare this model against the leader in each benchmark.
- Strong multimodal
- Voice-native
- Mature ecosystem
- Weaker than GPT-5.5 on reasoning
- Smaller context vs newer models
- General-purpose chat
- Voice apps
- Vision tasks
Models you should also evaluate
OpenAI’s 2026 flagship — strongest at reasoning, coding and tool use.
Production workhorse — GPT-5.5 quality reasoning at fast-tier prices.
Cheap, fast, and still surprisingly capable — OpenAI’s budget tier.
More from OpenAI
OpenAI’s 2026 flagship — strongest at reasoning, coding and tool use.
Long chain-of-thought reasoning — unbeatable on hard math and code.
Reasoning quality at fast-tier prices — the practical o-series default.
Production workhorse — GPT-5.5 quality reasoning at fast-tier prices.
Cheap, fast, and still surprisingly capable — OpenAI’s budget tier.
OpenAI’s flagship video model — most coherent long clips on the market.
GPT-4o vs… popular head-to-heads
One-click matchups against the models people compare GPT-4o with most.
GPT-4o — frequently asked questions
Need help choosing between models?
Compare every option in one sortable table — intelligence, speed and price on a single page.