Models
AI models available through OneMux.
Compare providers, context windows, cost, speed, and tags.
GPT-4o
OpenAI
A balanced multimodal model for production assistants and high-quality generation.
Context: 128K
Input: from $2.50 / 1M tokens
Output: from $10 / 1M tokens
GPT-4.1
OpenAI
A strong long-context model for coding, retrieval, and complex workflows.
Context: 1M
Input: from $2 / 1M tokens
Output: from $8 / 1M tokens
Claude Sonnet
Anthropic
A high-quality model for reasoning, coding, and careful business workflows.
Context: 200K
Input: from $3 / 1M tokens
Output: from $15 / 1M tokens
Claude Opus
Anthropic
A premium model for difficult reasoning, analysis, and agentic tasks.
Context: 200K
Input: from $15 / 1M tokens
Output: from $75 / 1M tokens
Gemini
A long-context model family for multimodal and document-heavy applications.
Context: 1M+
Input: from $1.25 / 1M tokens
Output: from $5 / 1M tokens
DeepSeek
DeepSeek
A cost-efficient model option for coding, chat, and high-volume workloads.
Context: 64K
Input: from $0.14 / 1M tokens
Output: from $0.28 / 1M tokens
Qwen
Alibaba Cloud
A multilingual model family for global products and developer tools.
Context: 128K
Input: from $0.35 / 1M tokens
Output: from $1.40 / 1M tokens
Llama
Meta
A flexible open model family for scalable AI applications.
Context: 128K
Input: from $0.20 / 1M tokens
Output: from $0.80 / 1M tokens