AI Models · Text-to-Speech
Best Text-to-Speech Models Text-to-Speech
Generate natural-sounding speech audio from text.
Text-to-Speech models
3 models matched. Click any column to sort.
| Notes | ||||
|---|---|---|---|---|
| Cartesia Sonic | Cartesia | Proprietary | $0.07 | Sub-100ms time-to-first-byte — built for realtime voice agents. |
| OpenAI TTS (HD) | OpenAI | Proprietary | $0.03 | Cheap, natural-sounding TTS bundled with the OpenAI API. |
| ElevenLabs Multilingual v2 | ElevenLabs | Proprietary | $0.18 | The industry standard for expressive cloned voices. |
Showing 3 of 3 models. Click any column header to sort. Prices are USD per 1M tokens unless noted otherwise. Estimates marked with *.
Browse by category
Drill into a slice of the catalog — open-source models, video models, or all models from one provider.
By License
By Purpose
Frontier
10The most capable models from each major lab.
Reasoning
7Long chain-of-thought models built for hard math, code, and planning.
Fast & Cheap
6Production-grade workhorses with the best speed and cost.
Code
4Models specialised for software engineering tasks.
Multimodal
11Models that natively understand text, images, and beyond.
By Modality
By Provider
OpenAI
9All models from OpenAI — GPT, o-series and beyond.
Anthropic
4Claude family of models from Anthropic.
Gemini family of models from Google DeepMind.
Meta
3Open-weights Llama family from Meta AI.
Mistral
2Models from Mistral AI — EU-hosted, multilingual.
DeepSeek
2Frontier-class open-weights models from DeepSeek.
xAI
1Grok models from xAI.
Frequently asked questions
Explore the full catalog
See every AI model in one place — intelligence, speed and price on a single sortable table.