Models

AI models available through OneMux.

Compare providers, context windows, cost, speed, and tags.

GPT-4o

OpenAI

Fast

A balanced multimodal model for production assistants and high-quality generation.

Context: 128K

Input: from $2.50 / 1M tokens

Output: from $10 / 1M tokens

visiongeneralreasoning

GPT-4.1

OpenAI

Fast

A strong long-context model for coding, retrieval, and complex workflows.

Context: 1M

Input: from $2 / 1M tokens

Output: from $8 / 1M tokens

codinglong context

Claude Sonnet

Anthropic

Fast

A high-quality model for reasoning, coding, and careful business workflows.

Context: 200K

Input: from $3 / 1M tokens

Output: from $15 / 1M tokens

writingcoding

Claude Opus

Anthropic

Balanced

A premium model for difficult reasoning, analysis, and agentic tasks.

Context: 200K

Input: from $15 / 1M tokens

Output: from $75 / 1M tokens

deep reasoningagents

Gemini

Google

Fast

A long-context model family for multimodal and document-heavy applications.

Context: 1M+

Input: from $1.25 / 1M tokens

Output: from $5 / 1M tokens

multimodallong context

DeepSeek

DeepSeek

Fast

A cost-efficient model option for coding, chat, and high-volume workloads.

Context: 64K

Input: from $0.14 / 1M tokens

Output: from $0.28 / 1M tokens

cost-efficientcoding

Qwen

Alibaba Cloud

Fast

A multilingual model family for global products and developer tools.

Context: 128K

Input: from $0.35 / 1M tokens

Output: from $1.40 / 1M tokens

multilingualcoding

Llama

Meta

Fast

A flexible open model family for scalable AI applications.

Context: 128K

Input: from $0.20 / 1M tokens

Output: from $0.80 / 1M tokens

open modelflexible