Important: What FlowDot Is (And Isn't)
FlowDot does not train or run any AI models. We are a workflow platform that connects you to third-party AI providers (OpenAI, Anthropic, Google, etc.) through their APIs. When you use an AI model through FlowDot, the actual computation happens on the provider's servers - not ours.
This means we cannot directly measure energy consumption. We don't have access to the providers' data centers, power meters, or infrastructure. Everything on this page is our best attempt to estimate and inform - not measured reality.
Key Terms & Definitions
- Watt-hour (Wh)
- A unit of energy equal to one watt of power sustained for one hour. Used to measure AI inference energy consumption.
- Token
- A unit of text processing in AI models. One token is approximately 4 characters or 0.75 words in English.
- Inference
- The process of running a trained AI model to generate responses. Consumes energy for each request processed.
- Parameters
- The learned weights in a neural network. GPT-4 has approximately 1.8 trillion parameters; more parameters typically means higher energy use.
- PUE (Power Usage Effectiveness)
- A ratio measuring data center efficiency. A PUE of 1.2 means 20% of energy goes to cooling and overhead.
- Carbon Footprint
- The total greenhouse gas emissions caused by AI computation, measured in CO2 equivalent (CO2e).
The Transparency Problem
Large Language Models require substantial computational resources. Most AI providers do not disclose energy data, creating challenges:
- Understand the true environmental cost of AI services
- Make informed choices about which models to use
- Hold providers accountable for their carbon footprint
- Develop more energy-efficient AI systems
- Compare energy efficiency across different providers
- Track improvements in model efficiency over time
- Set meaningful sustainability goals
- Report accurately on AI environmental impact
Key AI Energy Statistics
Training Energy Facts
- GPT-3 training: ~1,287 MWh
- Equal to ~120 US homes per year
- Training runs 24/7 for weeks
- Multiple GPUs running in parallel
- Cooling adds 10-20% overhead
- One-time cost per model version
Inference Energy Facts
- Scales with usage volume
- Can exceed training cost over time
- Optimizations reduce per-query cost
- Batching improves efficiency
- Smaller models use less per query
- Caching reduces redundant compute
Model Size Impact
- Energy scales with parameters
- GPT-4 has ~1.8T parameters
- GPT-4o-mini is ~10x more efficient
- Smaller models for simple tasks
- Larger models for complex reasoning
- Architecture affects efficiency
Efficiency Trends
- 10-100x efficiency gains per generation
- Better architectures use less energy
- Quantization reduces memory needs
- Pruning removes unnecessary weights
- Distillation creates efficient models
- Hardware improvements continue
Practical Tips for Reducing AI Energy Use
Choose the Right Model
- Use smaller models for simple tasks
- GPT-4o-mini uses ~10x less energy than GPT-4
- Claude 3 Haiku is optimized for efficiency
- Match model capability to task complexity
Optimize Your Prompts
- Write concise, clear instructions
- Avoid unnecessary context in prompts
- Request shorter responses when appropriate
- Batch multiple queries when possible
Use Local Models
- Run Ollama on your own hardware
- Power local setups with renewable energy
- Llama 3.1 8B offers good efficiency
- Full control over your energy source
Monitor with FlowDot
- Track energy estimates per workflow
- Identify high-consumption patterns
- Compare models for the same task
- Set team energy awareness goals
Our Methodology
We compile energy data from multiple sources and are fully transparent about the reliability and origin of each data point. Our goal is to provide the best available estimates while being honest about their limitations.
Data Source Types
How We Calculate Estimates
- Start with known parameter counts when available
- Apply energy-per-parameter formulas from research papers
- Factor in typical data center PUE (1.1-1.2)
- Account for inference optimization techniques
- Cross-reference with pricing tiers as sanity check
- Update estimates when new data becomes available
- Flag uncertainty with tilde (~) prefix on values
Key Research Sources
- Patterson et al. (2021) - "Carbon Emissions and Large Neural Network Training" - Google Research
- Luccioni, Viguier & Ligozat (2023) - "Estimating the Carbon Footprint of BLOOM" - Hugging Face
- Touvron et al. (2023) - "LLaMA: Open and Efficient Foundation Language Models" - Meta AI
- Strubell, Ganesh & McCallum (2019) - "Energy and Policy Considerations for Deep Learning in NLP" - ACL
- Dodge et al. (2022) - "Measuring the Carbon Intensity of AI in Cloud Instances" - ACM FAccT
Per-Model Energy Estimates
Important Disclaimer
The data below is NOT official. Unless explicitly marked as "Official" (green badge), all values are our own estimates and educated guesses based on publicly available information.
We make no claims about actual energy consumption. The true energy usage of these models is known only to their providers. Our estimates may be significantly different from reality.
| Model | Provider | Params (B) | Context | Input Wh/1M | Output Wh/1M | Capabilities | Source |
|---|---|---|---|---|---|---|---|
|
QwQ-32B-Preview
Qwen/QwQ-32B-Preview |
Together | 32.0B | 33K | ~19.20 | ~57.60 |
V
T
I
A
J
|
Estimated |
|
Qwen2.5-72B-Instruct-Turbo
Qwen/Qwen2.5-72B-Instruct-Turbo |
Together | 72.0B | 131K | ~43.20 | ~129.60 |
V
T
I
A
J
|
Estimated |
|
Qwen2.5-7B-Instruct-Turbo
Qwen/Qwen2.5-7B-Instruct-Turbo |
Together | 7.0B | 131K | ~4.20 | ~12.60 |
V
T
I
A
J
|
Estimated |
|
Qwen3-Coder-Next
Qwen/Qwen3-Coder-Next |
Together | 80.0B | 131K | ~24.00 | ~72.00 |
V
T
I
A
J
|
Estimated |
|
Qwen3-Next-80B-A3B-Instruct
Qwen/Qwen3-Next-80B-A3B-Instruct |
Together | 35.0B | 131K | ~21.00 | ~63.00 |
V
T
I
A
J
|
Inferred |
|
GLM-4.7
THUDM/GLM-4.7 |
Together | 42.0B | 131K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Cogito 671b V2 P1
accounts/cogito/models/cogito-671b-v2-p1 |
Fireworks | 6.5B | 164K | ~3.90 | ~11.70 |
V
T
I
A
J
|
Inferred |
|
Deepseek R1
accounts/fireworks/models/deepseek-r1 |
Fireworks | 90.0B | 66K | ~32.40 | ~97.20 |
V
T
I
A
J
|
Inferred |
|
Deepseek V3
accounts/fireworks/models/deepseek-v3 |
Fireworks | 17.0B | 66K | ~6.12 | ~18.36 |
V
T
I
A
J
|
Inferred |
|
Deepseek V3p1
accounts/fireworks/models/deepseek-v3p1 |
Fireworks | 6.5B | 164K | ~2.34 | ~7.02 |
V
T
I
A
J
|
Inferred |
|
Deepseek V3p2
accounts/fireworks/models/deepseek-v3p2 |
Fireworks | 6.5B | 164K | ~2.34 | ~7.02 |
V
T
I
A
J
|
Inferred |
|
Flux 1 Schnell Fp8
accounts/fireworks/models/flux-1-schnell-fp8 |
Fireworks | 5.9B | - | ~3.54 | ~10.62 |
V
T
I
A
J
|
Inferred |
|
Flux Kontext Max
accounts/fireworks/models/flux-kontext-max |
Fireworks | 5.9B | - | ~3.54 | ~10.62 |
V
T
I
A
J
|
Inferred |
|
Flux Kontext Pro
accounts/fireworks/models/flux-kontext-pro |
Fireworks | 5.9B | - | ~3.54 | ~10.62 |
V
T
I
A
J
|
Inferred |
|
Gemma2 9b It
accounts/fireworks/models/gemma2-9b-it |
Fireworks | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Glm 4p7
accounts/fireworks/models/glm-4p7 |
Fireworks | 7.1B | 203K | ~4.26 | ~12.78 |
V
T
I
A
J
|
Inferred |
|
Glm 5
accounts/fireworks/models/glm-5 |
Fireworks | 7.1B | 203K | ~4.26 | ~12.78 |
V
T
I
A
J
|
Inferred |
|
Gpt Oss 120b
accounts/fireworks/models/gpt-oss-120b |
Fireworks | 6.5B | 131K | ~3.90 | ~11.70 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 Instruct 0905
accounts/fireworks/models/kimi-k2-instruct-0905 |
Fireworks | 7.1B | 262K | ~4.26 | ~12.78 |
V
T
I
A
J
|
Inferred |
|
Kimi K2p5
accounts/fireworks/models/kimi-k2p5 |
Fireworks | 7.1B | 262K | ~4.26 | ~12.78 |
V
T
I
A
J
|
Inferred |
|
Llama V3p1 405b Instruct
accounts/fireworks/models/llama-v3p1-405b-instruct |
Fireworks | 55.0B | 131K | ~33.00 | ~99.00 |
V
T
I
A
J
|
Inferred |
|
Llama V3p1 70b Instruct
accounts/fireworks/models/llama-v3p1-70b-instruct |
Fireworks | 19.0B | 131K | ~11.40 | ~34.20 |
V
T
I
A
J
|
Inferred |
|
Llama V3p1 8b Instruct
accounts/fireworks/models/llama-v3p1-8b-instruct |
Fireworks | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama V3p2 11b Vision Instruct
accounts/fireworks/models/llama-v3p2-11b-vision-instruct |
Fireworks | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama V3p2 1b Instruct
accounts/fireworks/models/llama-v3p2-1b-instruct |
Fireworks | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama V3p2 3b Instruct
accounts/fireworks/models/llama-v3p2-3b-instruct |
Fireworks | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama V3p2 90b Vision Instruct
accounts/fireworks/models/llama-v3p2-90b-vision-instruct |
Fireworks | 19.0B | 131K | ~11.40 | ~34.20 |
V
T
I
A
J
|
Inferred |
|
Llama V3p3 70b Instruct
accounts/fireworks/models/llama-v3p3-70b-instruct |
Fireworks | 19.0B | 131K | ~11.40 | ~34.20 |
V
T
I
A
J
|
Inferred |
|
Mixtral 8x22b Instruct
accounts/fireworks/models/mixtral-8x22b-instruct |
Fireworks | 17.0B | 66K | ~6.12 | ~18.36 |
V
T
I
A
J
|
Inferred |
|
Mixtral 8x7b Instruct
accounts/fireworks/models/mixtral-8x7b-instruct |
Fireworks | 5.9B | 33K | ~2.12 | ~6.37 |
V
T
I
A
J
|
Inferred |
|
Phi 3 Vision 128k Instruct
accounts/fireworks/models/phi-3-vision-128k-instruct |
Fireworks | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Qwen2 72b Instruct
accounts/fireworks/models/qwen2-72b-instruct |
Fireworks | 17.0B | 33K | ~10.20 | ~30.60 |
V
T
I
A
J
|
Inferred |
|
Qwen2p5 72b Instruct
accounts/fireworks/models/qwen2p5-72b-instruct |
Fireworks | 17.0B | 33K | ~10.20 | ~30.60 |
V
T
I
A
J
|
Inferred |
|
AionLabs: Aion-1.0-Mini
aion-labs/aion-1.0-mini |
Openrouter | 32.0B | 131K | ~16.00 | ~48.00 |
-
|
Estimated |
|
AionLabs: Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8b |
Openrouter | 8.0B | 33K | ~4.00 | ~12.00 |
-
|
Estimated |
|
AlfredPros: CodeLLaMa 7B Instruct Solidity
alfredpros/codellama-7b-instruct-solidity |
Openrouter | 7.0B | 4K | ~3.50 | ~10.50 |
-
|
Estimated |
|
Tongyi DeepResearch 30B A3B
alibaba/tongyi-deepresearch-30b-a3b |
Openrouter | 30.0B | 131K | ~15.00 | ~45.00 |
-
|
Estimated |
|
Tongyi DeepResearch 30B A3B (free)
alibaba/tongyi-deepresearch-30b-a3b:free |
Openrouter | 30.0B | 131K | ~15.00 | ~45.00 |
-
|
Estimated |
|
Allam 2 7b
allam-2-7b |
Groq | 7.2B | 4K | ~4.32 | ~12.96 |
V
T
I
A
J
|
Inferred |
|
AllenAI: Molmo2 8B
allenai/molmo-2-8b |
Openrouter | 8.0B | 37K | ~4.80 | ~14.40 |
-
|
Estimated |
|
AllenAI: Olmo 2 32B Instruct
allenai/olmo-2-0325-32b-instruct |
Openrouter | 32.0B | 128K | ~16.00 | ~48.00 |
-
|
Estimated |
|
AllenAI: Olmo 3 32B Think
allenai/olmo-3-32b-think |
Openrouter | 32.0B | 66K | ~19.20 | ~57.60 |
-
|
Estimated |
|
AllenAI: Olmo 3 32B Think (free)
allenai/olmo-3-32b-think:free |
Openrouter | 32.0B | 66K | ~16.00 | ~48.00 |
-
|
Estimated |
|
AllenAI: Olmo 3 7B Instruct
allenai/olmo-3-7b-instruct |
Openrouter | 7.0B | 66K | ~4.20 | ~12.60 |
-
|
Estimated |
|
AllenAI: Olmo 3 7B Think
allenai/olmo-3-7b-think |
Openrouter | 7.0B | 66K | ~4.20 | ~12.60 |
-
|
Estimated |
|
AllenAI: Olmo 3.1 32B Instruct
allenai/olmo-3.1-32b-instruct |
Openrouter | 32.0B | 66K | ~16.00 | ~48.00 |
-
|
Estimated |
|
AllenAI: Olmo 3.1 32B Think
allenai/olmo-3.1-32b-think |
Openrouter | 32.0B | 66K | ~16.00 | ~48.00 |
-
|
Estimated |
|
AllenAI: Olmo 3.1 32B Think (free)
allenai/olmo-3.1-32b-think:free |
Openrouter | 32.0B | 66K | ~16.00 | ~48.00 |
-
|
Estimated |
|
Goliath 120B
alpindale/goliath-120b |
Openrouter | 70.0B | 6K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Amazon: Nova Lite 1.0
amazon/nova-lite-v1 |
Openrouter | 20.0B | 300K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Magnum v4 72B
anthracite-org/magnum-v4-72b |
Openrouter | 72.0B | 16K | ~36.00 | ~108.00 |
-
|
Estimated |
|
Anthropic: Claude 3 Haiku
anthropic/claude-3-haiku |
Openrouter | 20.0B | 200K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Anthropic: Claude 3 Opus
anthropic/claude-3-opus |
Openrouter | 175.0B | 200K | ~126.00 | ~378.00 |
-
|
Estimated |
|
Anthropic: Claude 3.5 Haiku
anthropic/claude-3.5-haiku |
Redpill | 20.0B | 200K | ~14.40 | ~43.20 |
V
T
I
A
J
|
Estimated |
|
Anthropic: Claude 3.5 Haiku (2024-10-22)
anthropic/claude-3.5-haiku-20241022 |
Openrouter | 20.0B | 200K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Anthropic: Claude 3.5 Sonnet
anthropic/claude-3.5-sonnet |
Openrouter | 175.0B | 200K | ~84.00 | ~252.00 |
-
|
Research |
|
Anthropic: Claude 3.7 Sonnet
anthropic/claude-3.7-sonnet |
Redpill | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5 |
Redpill | 76.0B | 200K | ~45.60 | ~136.80 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Opus 4
anthropic/claude-opus-4 |
Redpill | 800.0B | 200K | ~480.00 | ~1,440.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Opus 4.1
anthropic/claude-opus-4.1 |
Redpill | 800.0B | 200K | ~480.00 | ~1,440.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5 |
Redpill | 400.0B | 200K | ~240.00 | ~720.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6 |
Redpill | 540.0B | 1.0M | ~324.00 | ~972.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Sonnet 4
anthropic/claude-sonnet-4 |
Redpill | 240.0B | 1.0M | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5 |
Redpill | 240.0B | 1.0M | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6 |
Redpill | 240.0B | 1.0M | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Arcee AI: Trinity Large Preview (free)
arcee-ai/trinity-large-preview:free |
Openrouter | 400.0B | 131K | ~200.00 | ~600.00 |
-
|
Estimated |
|
Arcee AI: Trinity Mini
arcee-ai/trinity-mini |
Openrouter | 26.0B | 131K | ~13.00 | ~39.00 |
-
|
Estimated |
|
Arcee AI: Trinity Mini (free)
arcee-ai/trinity-mini:free |
Openrouter | 26.0B | 131K | ~13.00 | ~39.00 |
-
|
Estimated |
|
ArliAI: QwQ 32B RpR v1
arliai/qwq-32b-arliai-rpr-v1 |
Openrouter | 32.0B | 33K | ~16.00 | ~48.00 |
-
|
Estimated |
|
Baidu: ERNIE 4.5 21B A3B
baidu/ernie-4.5-21b-a3b |
Openrouter | 21.0B | 120K | ~6.30 | ~18.90 |
-
|
Estimated |
|
Baidu: ERNIE 4.5 21B A3B Thinking
baidu/ernie-4.5-21b-a3b-thinking |
Openrouter | 21.0B | 131K | ~6.30 | ~18.90 |
-
|
Estimated |
|
Baidu: ERNIE 4.5 300B A47B
baidu/ernie-4.5-300b-a47b |
Openrouter | 300.0B | 123K | ~90.00 | ~270.00 |
-
|
Estimated |
|
Baidu: ERNIE 4.5 VL 28B A3B
baidu/ernie-4.5-vl-28b-a3b |
Openrouter | 28.0B | 30K | ~10.08 | ~30.24 |
-
|
Estimated |
|
Baidu: ERNIE 4.5 VL 424B A47B
baidu/ernie-4.5-vl-424b-a47b |
Openrouter | 424.0B | 123K | ~152.64 | ~457.92 |
-
|
Estimated |
|
ByteDance Seed: Seed 1.6
bytedance-seed/seed-1.6 |
Openrouter | 1.6B | 262K | ~2.00 | ~6.00 |
-
|
Estimated |
|
ByteDance: UI-TARS 7B
bytedance/ui-tars-1.5-7b |
Openrouter | 72.0B | 128K | ~43.20 | ~129.60 |
-
|
Estimated |
|
Canopylabs/orpheus Arabic Saudi
canopylabs/orpheus-arabic-saudi |
Groq | 7.2B | 4K | ~4.32 | ~12.96 |
V
T
I
A
J
|
Inferred |
|
Canopylabs/orpheus V1 English
canopylabs/orpheus-v1-english |
Groq | 7.2B | 4K | ~4.32 | ~12.96 |
V
T
I
A
J
|
Inferred |
| chatgpt-4o-latest | Openai | 220.0B | 128K | ~132.00 | ~396.00 |
V
T
I
A
J
|
Inferred |
| chatgpt-image-latest | Openai | 220.0B | 128K | ~132.00 | ~396.00 |
V
T
I
A
J
|
Inferred |
|
Claude 2.0
claude-2.0 |
Anthropic | 380.0B | 200K | ~228.00 | ~684.00 |
V
T
I
A
J
|
Inferred |
|
Claude 2.1
claude-2.1 |
Anthropic | 380.0B | 200K | ~228.00 | ~684.00 |
V
T
I
A
J
|
Inferred |
|
Claude 3.5 Haiku
claude-3-5-haiku |
Anthropic | 68.0B | 200K | ~40.80 | ~122.40 |
V
T
I
A
J
|
Inferred |
|
Claude 3.5 Haiku
claude-3-5-haiku-20241022 |
Anthropic | 20.0B | 200K | ~14.40 | ~43.20 |
V
T
I
A
J
|
Estimated |
|
Claude 3.5 Haiku Latest
claude-3-5-haiku-latest |
Anthropic | 20.0B | 200K | ~14.40 | ~43.20 |
V
T
I
A
J
|
Estimated |
|
Claude 3.5 Sonnet
claude-3-5-sonnet |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude 3.5 Sonnet
claude-3-5-sonnet-20240620 |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude 3.5 Sonnet
claude-3-5-sonnet-20241022 |
Anthropic | 175.0B | 200K | ~126.00 | ~378.00 |
V
T
I
A
J
|
Estimated |
|
Claude 3.5 Sonnet Latest
claude-3-5-sonnet-latest |
Anthropic | 175.0B | 200K | ~126.00 | ~378.00 |
V
T
I
A
J
|
Estimated |
|
Claude 3.7 Sonnet
claude-3-7-sonnet |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude 3.7 Sonnet
claude-3-7-sonnet-20250219 |
Anthropic | 175.0B | 200K | ~126.00 | ~378.00 |
V
T
I
A
J
|
Estimated |
|
Claude 3 Haiku
claude-3-haiku |
Anthropic | 31.0B | 200K | ~18.60 | ~55.80 |
V
T
I
A
J
|
Inferred |
|
Claude 3 Haiku
claude-3-haiku-20240307 |
Anthropic | 20.0B | 200K | ~14.40 | ~43.20 |
V
T
I
A
J
|
Estimated |
|
Claude 3 Opus
claude-3-opus |
Anthropic | 800.0B | 200K | ~480.00 | ~1,440.00 |
V
T
I
A
J
|
Inferred |
|
Claude 3 Opus
claude-3-opus-20240229 |
Anthropic | 175.0B | 200K | ~126.00 | ~378.00 |
V
T
I
A
J
|
Estimated |
|
Claude 3 Opus Latest
claude-3-opus-latest |
Anthropic | 175.0B | 200K | ~126.00 | ~378.00 |
V
T
I
A
J
|
Estimated |
|
Claude 3 Sonnet
claude-3-sonnet |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude 3 Sonnet
claude-3-sonnet-20240229 |
Anthropic | 70.0B | 200K | ~50.40 | ~151.20 |
V
T
I
A
J
|
Estimated |
|
Claude Haiku 4
claude-haiku-4 |
Anthropic | 68.0B | 200K | ~40.80 | ~122.40 |
V
T
I
A
J
|
Inferred |
|
Claude Haiku 4.1
claude-haiku-4-1 |
Anthropic | 68.0B | 200K | ~40.80 | ~122.40 |
V
T
I
A
J
|
Inferred |
|
Claude Haiku 4.5
claude-haiku-4-5-20251001 |
Anthropic | 4.5B | - | ~2.70 | ~8.10 |
-
|
Estimated |
|
Claude Instant 1.2
claude-instant-1.2 |
Anthropic | 43.0B | 100K | ~25.80 | ~77.40 |
V
T
I
A
J
|
Inferred |
|
Claude Opus 4
claude-opus-4 |
Anthropic | 400.0B | 200K | ~240.00 | ~720.00 |
V
T
I
A
J
|
Inferred |
|
Claude Opus 4.1
claude-opus-4-1 |
Anthropic | 400.0B | 200K | ~240.00 | ~720.00 |
V
T
I
A
J
|
Inferred |
|
Claude Opus 4.1
claude-opus-4-1-20250805 |
Anthropic | 200.0B | - | ~120.00 | ~360.00 |
-
|
Estimated |
|
Claude Opus 4
claude-opus-4-20250514 |
Anthropic | 200.0B | 200K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
Claude Opus 4.5
claude-opus-4-5 |
Anthropic | 400.0B | 200K | ~240.00 | ~720.00 |
V
T
I
A
J
|
Inferred |
|
Claude Opus 4.5
claude-opus-4-5-20251101 |
Anthropic | 200.0B | 200K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
Claude Opus 4.6
claude-opus-4-6 |
Anthropic | 400.0B | 200K | ~240.00 | ~720.00 |
V
T
I
A
J
|
Inferred |
|
Claude Sonnet 4
claude-sonnet-4 |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude Sonnet 4
claude-sonnet-4-20250514 |
Anthropic | 175.0B | 200K | ~105.00 | ~315.00 |
V
T
I
A
J
|
Estimated |
|
Claude Sonnet 4.5
claude-sonnet-4-5 |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude Sonnet 4.5
claude-sonnet-4-5-20250514 |
Anthropic | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Claude Sonnet 4.5
claude-sonnet-4-5-20250929 |
Anthropic | 175.0B | - | ~105.00 | ~315.00 |
-
|
Estimated |
| codellama:13b | Ollama | 13.0B | - | ~7.80 | ~23.40 |
-
|
Estimated |
| codellama:34b | Ollama | 34.0B | - | ~20.40 | ~61.20 |
-
|
Estimated |
| codellama:7b | Ollama | 7.0B | - | ~4.20 | ~12.60 |
-
|
Estimated |
| codestral-2508 | Mistral | 20.0B | 256K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Inferred |
| codestral-embed | Mistral | 20.0B | 256K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Inferred |
|
codestral-embed
codestral-embed-2505 |
Mistral | 20.0B | 256K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Inferred |
|
codestral-2508
codestral-latest |
Mistral | 20.0B | 256K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Inferred |
|
Codestral Mamba
codestral-mamba |
Mistral | 7.0B | - | ~4.20 | ~12.60 |
-
|
Estimated |
|
Codestral Mamba Latest
codestral-mamba-latest |
Mistral | 3.7B | 256K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
| codex-mini-latest | Openai | 84.0B | 200K | ~42.00 | ~126.00 |
-
|
Inferred |
|
Venice: Uncensored (free)
cognitivecomputations/dolphin-mistral-24b-venice-edition:free |
Openrouter | 24.0B | 33K | ~12.00 | ~36.00 |
-
|
Estimated |
|
Cohere: Command A
cohere/command-a |
Openrouter | 111.0B | 256K | ~55.50 | ~166.50 |
-
|
Estimated |
|
Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024 |
Openrouter | 7.0B | 128K | ~3.50 | ~10.50 |
-
|
Estimated |
|
Command
command |
Cohere | 34.0B | 4K | ~20.40 | ~61.20 |
V
T
I
A
J
|
Inferred |
|
Command A
command-a |
Cohere | 76.0B | 256K | ~45.60 | ~136.80 |
V
T
I
A
J
|
Inferred |
|
Command A 03 2025
command-a-03-2025 |
Cohere | 76.0B | 256K | ~45.60 | ~136.80 |
V
T
I
A
J
|
Inferred |
| command-a-reasoning-08-2025 | Cohere | 76.0B | 289K | ~45.60 | ~136.80 |
V
T
I
A
J
|
Inferred |
| command-a-vision-07-2025 | Cohere | 70.0B | 128K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Inferred |
|
Command Light
command-light |
Cohere | 7.2B | 4K | ~4.32 | ~12.96 |
V
T
I
A
J
|
Inferred |
|
Command Light Nightly
command-light-nightly |
Cohere | 8.0B | 8K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Inferred |
|
Command Nightly
command-nightly |
Cohere | 38.0B | 8K | ~22.80 | ~68.40 |
V
T
I
A
J
|
Inferred |
|
Command R
command-r |
Cohere | 35.0B | 128K | ~21.00 | ~63.00 |
V
T
I
A
J
|
Inferred |
|
Command R 03 2024
command-r-03-2024 |
Cohere | 35.0B | 128K | ~21.00 | ~63.00 |
V
T
I
A
J
|
Inferred |
| command-r-08-2024 | Cohere | 2.2B | 132K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Command R Plus
command-r-plus |
Cohere | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
|
Command R Plus 04 2024
command-r-plus-04-2024 |
Cohere | 220.0B | 128K | ~132.00 | ~396.00 |
V
T
I
A
J
|
Inferred |
|
Command R Plus 08 2024
command-r-plus-08-2024 |
Cohere | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
| dall-e-2 | Openai | 3.5B | - | ~2.52 | ~7.56 |
V
T
I
A
J
|
Estimated |
| dall-e-3 | Openai | 12.0B | - | ~8.64 | ~25.92 |
V
T
I
A
J
|
Estimated |
|
dbrx-instruct
databricks/dbrx-instruct |
Together | 25.0B | 33K | ~15.00 | ~45.00 |
V
T
I
A
J
|
Inferred |
|
Cogito V2 Preview Llama 109B
deepcogito/cogito-v2-preview-llama-109b-moe |
Openrouter | 17.0B | 33K | ~10.20 | ~30.60 |
-
|
Estimated |
|
Deep Cogito: Cogito V2 Preview Llama 405B
deepcogito/cogito-v2-preview-llama-405b |
Openrouter | 405.0B | 33K | ~202.50 | ~607.50 |
-
|
Estimated |
|
Deep Cogito: Cogito V2 Preview Llama 70B
deepcogito/cogito-v2-preview-llama-70b |
Openrouter | 70.0B | 33K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Deep Cogito: Cogito v2.1 671B
deepcogito/cogito-v2.1-671b |
Openrouter | 671.0B | 128K | ~201.30 | ~603.90 |
-
|
Estimated |
|
DeepSeek-R1
deepseek-ai/DeepSeek-R1 |
Together | 80.0B | 66K | ~28.80 | ~86.40 |
V
T
I
A
J
|
Inferred |
|
DeepSeek-R1-0528
deepseek-ai/DeepSeek-R1-0528 |
Together | 80.0B | 66K | ~28.80 | ~86.40 |
V
T
I
A
J
|
Inferred |
|
DeepSeek-R1-Distill-Llama-70B
deepseek-ai/DeepSeek-R1-Distill-Llama-70B |
Together | 6.1B | 66K | ~2.20 | ~6.59 |
V
T
I
A
J
|
Inferred |
|
DeepSeek-V3
deepseek-ai/DeepSeek-V3 |
Together | 671.0B | 66K | ~201.30 | ~603.90 |
V
T
I
A
J
|
Estimated |
|
DeepSeek-V3.1
deepseek-ai/DeepSeek-V3.1 |
Together | 34.0B | 66K | ~12.24 | ~36.72 |
V
T
I
A
J
|
Inferred |
|
DeepSeek Deepseek Chat
deepseek-chat |
Deepseek | 671.0B | 64K | ~201.30 | ~603.90 |
V
T
I
A
J
|
Estimated |
|
DeepSeek Deepseek Coder
deepseek-coder |
Deepseek | 3.8B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| deepseek-coder:33b | Ollama | 33.0B | - | ~19.80 | ~59.40 |
-
|
Estimated |
| deepseek-coder:6.7b | Ollama | 6.7B | - | ~4.02 | ~12.06 |
-
|
Estimated |
|
DeepSeek Deepseek Lite
deepseek-lite |
Deepseek | 3.4B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Deepseek R1 Distill Llama 70b
deepseek-r1-distill-llama-70b |
Groq | 14.0B | 33K | ~5.04 | ~15.12 |
V
T
I
A
J
|
Inferred |
|
DeepSeek Deepseek Reasoner
deepseek-reasoner |
Deepseek | 671.0B | 64K | ~201.30 | ~603.90 |
V
T
I
A
J
|
Estimated |
|
DeepSeek Deepseek V2
deepseek-v2 |
Deepseek | 22.0B | 64K | ~7.92 | ~23.76 |
V
T
I
A
J
|
Inferred |
|
DeepSeek Deepseek V2.5
deepseek-v2.5 |
Deepseek | 22.0B | 64K | ~7.92 | ~23.76 |
V
T
I
A
J
|
Inferred |
|
DeepSeek: DeepSeek V3
deepseek/deepseek-chat |
Openrouter | 671.0B | 164K | ~201.30 | ~603.90 |
-
|
Estimated |
|
DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324 |
Openrouter | 685.0B | 164K | ~342.50 | ~1,027.50 |
-
|
Estimated |
|
DeepSeek V3.1
deepseek/deepseek-chat-v3.1 |
Redpill | 48.0B | 164K | ~17.28 | ~51.84 |
V
T
I
A
J
|
Inferred |
|
DeepSeek: DeepSeek Prover V2
deepseek/deepseek-prover-v2 |
Openrouter | 671.0B | 164K | ~335.50 | ~1,006.50 |
-
|
Estimated |
|
DeepSeek: R1
deepseek/deepseek-r1 |
Openrouter | 671.0B | 164K | ~201.30 | ~603.90 |
-
|
Estimated |
|
DeepSeek: R1 0528
deepseek/deepseek-r1-0528 |
Redpill | 42.0B | 164K | ~15.12 | ~45.36 |
V
T
I
A
J
|
Inferred |
|
DeepSeek: DeepSeek R1 0528 Qwen3 8B
deepseek/deepseek-r1-0528-qwen3-8b |
Openrouter | 8.0B | 33K | ~4.00 | ~12.00 |
-
|
Estimated |
|
DeepSeek: R1 0528 (free)
deepseek/deepseek-r1-0528:free |
Openrouter | 671.0B | 164K | ~201.30 | ~603.90 |
-
|
Estimated |
|
DeepSeek: R1 Distill Llama 70B
deepseek/deepseek-r1-distill-llama-70b |
Openrouter | 70.0B | 131K | ~42.00 | ~126.00 |
-
|
Estimated |
|
DeepSeek: R1 Distill Qwen 14B
deepseek/deepseek-r1-distill-qwen-14b |
Openrouter | 14.0B | 33K | ~8.40 | ~25.20 |
-
|
Estimated |
|
DeepSeek: R1 Distill Qwen 32B
deepseek/deepseek-r1-distill-qwen-32b |
Openrouter | 32.0B | 64K | ~19.20 | ~57.60 |
-
|
Estimated |
|
DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus |
Openrouter | 671.0B | 164K | ~335.50 | ~1,006.50 |
-
|
Estimated |
|
DeepSeek: DeepSeek V3.1 Terminus (exacto)
deepseek/deepseek-v3.1-terminus:exacto |
Openrouter | 671.0B | 164K | ~335.50 | ~1,006.50 |
-
|
Estimated |
|
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2 |
Redpill | 5.3B | 164K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Devstral 2
devstral-2 |
Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| devstral-2512 | Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
devstral-2512
devstral-latest |
Mistral | 4.0B | 128K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Inferred |
|
Devstral Medium
devstral-medium |
Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
| devstral-medium-2507 | Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
devstral-2512
devstral-medium-latest |
Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Devstral Small
devstral-small |
Mistral | 4.0B | 128K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Inferred |
| devstral-small-2507 | Mistral | 4.0B | 128K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Inferred |
|
labs-devstral-small-2512
devstral-small-latest |
Mistral | 4.0B | 128K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Inferred |
|
Distil Whisper Large V3 En
distil-whisper-large-v3-en |
Groq | 2.0B | - | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
EleutherAI: Llemma 7b
eleutherai/llemma_7b |
Openrouter | 7.0B | 4K | ~3.50 | ~10.50 |
-
|
Estimated |
| embed-english-light-v3.0 | Cohere | 3.3B | 512 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| embed-english-v3.0 | Cohere | 3.3B | 512 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Embed Multilingual Light V3.0
embed-multilingual-light-v3.0 |
Cohere | 3.3B | 512 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Embed Multilingual V3.0
embed-multilingual-v3.0 |
Cohere | 3.3B | 512 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Embedding 001
embedding-001 |
1.8B | 2K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred | |
|
EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instruct |
Openrouter | 8.0B | 33K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Gemini 1.0 Pro
gemini-1.0-pro |
32.0B | 32K | ~11.52 | ~34.56 |
V
T
I
A
J
|
Inferred | |
|
Gemini 1.5 Flash
gemini-1.5-flash |
4.4B | 1.0M | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred | |
|
Gemini 1.5 Flash 8b
gemini-1.5-flash-8b |
2.4B | 1.0M | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred | |
|
Gemini 1.5 Flash Latest
gemini-1.5-flash-latest |
4.4B | 1.0M | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred | |
|
Gemini 1.5 Pro
gemini-1.5-pro |
76.0B | 2.0M | ~27.36 | ~82.08 |
V
T
I
A
J
|
Inferred | |
|
Gemini 1.5 Pro Latest
gemini-1.5-pro-latest |
76.0B | 2.0M | ~27.36 | ~82.08 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.0 Flash
gemini-2.0-flash |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.0 Flash 001
gemini-2.0-flash-001 |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.0 Flash Exp
gemini-2.0-flash-exp |
50.0B | 1.0M | ~18.00 | ~54.00 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.0 Flash-Lite
gemini-2.0-flash-lite |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.0 Flash-Lite 001
gemini-2.0-flash-lite-001 |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.0 Flash Thinking Exp
gemini-2.0-flash-thinking-exp |
50.0B | - | ~18.00 | ~54.00 |
-
|
Estimated | |
|
Gemini 2.5 Flash
gemini-2.5-flash |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Nano Banana
gemini-2.5-flash-image |
50.0B | 33K | ~18.00 | ~54.00 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.5 Flash-Lite Preview Sep 2025
gemini-2.5-flash-lite-preview-09-2025 |
60.0B | 1.0M | ~21.60 | ~64.80 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.5 Flash Native Audio Latest
gemini-2.5-flash-native-audio-latest |
55.0B | 131K | ~19.80 | ~59.40 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.5 Flash Native Audio Preview 09-2025
gemini-2.5-flash-native-audio-preview-09-2025 |
55.0B | 131K | ~19.80 | ~59.40 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.5 Flash Native Audio Preview 12-2025
gemini-2.5-flash-native-audio-preview-12-2025 |
55.0B | 131K | ~19.80 | ~59.40 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.5 Flash Preview TTS
gemini-2.5-flash-preview-tts |
50.0B | 8K | ~18.00 | ~54.00 |
V
T
I
A
J
|
Inferred | |
|
Gemini 2.5 Pro
gemini-2.5-pro |
8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated | |
|
Gemini 2.5 Pro Preview TTS
gemini-2.5-pro-preview-tts |
110.0B | 8K | ~39.60 | ~118.80 |
V
T
I
A
J
|
Inferred | |
|
Gemini 3 Flash Preview
gemini-3-flash-preview |
170.0B | 1.0M | ~61.20 | ~183.60 |
V
T
I
A
J
|
Inferred | |
|
Nano Banana Pro
gemini-3-pro-image-preview |
150.0B | 131K | ~54.00 | ~162.00 |
V
T
I
A
J
|
Inferred | |
|
Gemini 3 Pro Preview
gemini-3-pro-preview |
170.0B | 1.0M | ~61.20 | ~183.60 |
V
T
I
A
J
|
Inferred | |
|
Gemini Pro
gemini-pro |
32.0B | 32K | ~11.52 | ~34.56 |
V
T
I
A
J
|
Inferred | |
|
Gemini Pro Vision
gemini-pro-vision |
32.0B | 16K | ~11.52 | ~34.56 |
V
T
I
A
J
|
Inferred | |
|
Gemma 7b It
gemma-7b-it |
Groq | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Gemma2 9b It
gemma2-9b-it |
Groq | 9.0B | 8K | ~5.40 | ~16.20 |
V
T
I
A
J
|
Estimated |
| gemma2:27b | Ollama | 27.0B | - | ~16.20 | ~48.60 |
-
|
Estimated |
| gemma2:2b | Ollama | 2.0B | - | ~2.00 | ~6.00 |
-
|
Estimated |
| gemma2:9b | Ollama | 9.0B | - | ~5.40 | ~16.20 |
-
|
Estimated |
|
Google: Gemini 2.0 Flash Experimental (free)
google/gemini-2.0-flash-exp:free |
Openrouter | 50.0B | 1.0M | ~12.00 | ~36.00 |
-
|
Research |
|
Gemini 2.5 Flash
google/gemini-2.5-flash |
Perplexity | 50.0B | - | ~18.00 | ~54.00 |
-
|
Estimated |
|
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite |
Redpill | 20.0B | 1.0M | ~7.20 | ~21.60 |
V
T
I
A
J
|
Estimated |
|
Google: Gemini 2.5 Pro
google/gemini-2.5-pro |
Redpill | 240.0B | 1.0M | ~86.40 | ~259.20 |
V
T
I
A
J
|
Inferred |
|
Google: Gemini 3 Pro Preview
google/gemini-3-pro-preview |
Redpill | 290.0B | 1.0M | ~104.40 | ~313.20 |
V
T
I
A
J
|
Inferred |
|
Google: Gemma 2 27B
google/gemma-2-27b-it |
Openrouter | 27.0B | 8K | ~16.20 | ~48.60 |
-
|
Estimated |
|
Google: Gemma 2 9B
google/gemma-2-9b-it |
Openrouter | 9.0B | 8K | ~5.40 | ~16.20 |
-
|
Estimated |
|
Google: Gemma 3 12B
google/gemma-3-12b-it |
Openrouter | 12.0B | 131K | ~7.20 | ~21.60 |
-
|
Estimated |
|
Google: Gemma 3 12B (free)
google/gemma-3-12b-it:free |
Openrouter | 12.0B | 33K | ~7.20 | ~21.60 |
-
|
Estimated |
|
Google: Gemma 3 27B
google/gemma-3-27b-it |
Redpill | 4.8B | 54K | ~2.88 | ~8.64 |
V
T
I
A
J
|
Inferred |
|
Google: Gemma 3 27B (free)
google/gemma-3-27b-it:free |
Openrouter | 27.0B | 131K | ~16.20 | ~48.60 |
-
|
Estimated |
|
Google: Gemma 3 4B
google/gemma-3-4b-it |
Openrouter | 4.0B | 96K | ~2.40 | ~7.20 |
-
|
Estimated |
|
Google: Gemma 3 4B (free)
google/gemma-3-4b-it:free |
Openrouter | 4.0B | 33K | ~2.40 | ~7.20 |
-
|
Estimated |
|
Google: Gemma 3n 2B (free)
google/gemma-3n-e2b-it:free |
Openrouter | 2.0B | 8K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Google: Gemma 3n 4B
google/gemma-3n-e4b-it |
Openrouter | 4.0B | 33K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Google: Gemma 3n 4B (free)
google/gemma-3n-e4b-it:free |
Openrouter | 4.0B | 8K | ~2.00 | ~6.00 |
-
|
Estimated |
| gpt-3.5-turbo | Openai | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
| gpt-3.5-turbo-0125 | Openai | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
| gpt-3.5-turbo-1106 | Openai | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
| gpt-3.5-turbo-16k | Openai | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
| gpt-3.5-turbo-instruct | Openai | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
| gpt-3.5-turbo-instruct-0914 | Openai | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
| gpt-4 | Openai | 200.0B | 8K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
| gpt-4-0125-preview | Openai | 200.0B | 8K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
| gpt-4-0613 | Openai | 0.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Estimated |
| gpt-4-1106-preview | Openai | 600.0B | 8K | ~360.00 | ~1,080.00 |
V
T
I
A
J
|
Inferred |
|
Gpt 4 32k
gpt-4-32k |
Openai | 220.0B | - | ~66.00 | ~198.00 |
-
|
Estimated |
|
Gpt 4 32k 0613
gpt-4-32k-0613 |
Openai | 220.0B | - | ~66.00 | ~198.00 |
-
|
Estimated |
| gpt-4-turbo | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4-turbo-2024-04-09 | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4-turbo-preview | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4.1 | Openai | 110.0B | 1.0M | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
| gpt-4.1-2025-04-14 | Openai | 110.0B | 1.0M | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
| gpt-4.1-mini | Openai | 8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4.1-mini-2025-04-14 | Openai | 8.0B | 1.0M | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4.1-nano | Openai | 4.0B | 200K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Estimated |
| gpt-4.1-nano-2025-04-14 | Openai | 4.0B | 1.0M | ~2.40 | ~7.20 |
V
T
I
A
J
|
Estimated |
| gpt-4o | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-2024-05-13 | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-2024-08-06 | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-2024-11-20 | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-audio-preview | Openai | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
| gpt-4o-audio-preview-2024-12-17 | Openai | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
| gpt-4o-audio-preview-2025-06-03 | Openai | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
| gpt-4o-mini | Openai | 8.0B | 128K | ~5.76 | ~17.28 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-2024-07-18 | Openai | 8.0B | 128K | ~5.76 | ~17.28 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-audio-preview | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-audio-preview-2024-12-17 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-realtime-preview | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-realtime-preview-2024-12-17 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-search-preview | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-search-preview-2025-03-11 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-transcribe | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-transcribe-2025-03-20 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-transcribe-2025-12-15 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-tts | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-tts-2025-03-20 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-mini-tts-2025-12-15 | Openai | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-4o-realtime-preview | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-realtime-preview-2024-12-17 | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-realtime-preview-2025-06-03 | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-search-preview | Openai | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
| gpt-4o-search-preview-2025-03-11 | Openai | 120.0B | 128K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
| gpt-4o-transcribe | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-4o-transcribe-diarize | Openai | 200.0B | 128K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Estimated |
| gpt-5 | Openai | 0.0B | 200K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Estimated |
| gpt-5-2025-08-07 | Openai | 0.0B | 200K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Estimated |
| gpt-5-chat-latest | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5-codex | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5-mini | Openai | 8.0B | 200K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-5-mini-2025-08-07 | Openai | 8.0B | 200K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-5-nano | Openai | 4.0B | 200K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Estimated |
| gpt-5-nano-2025-08-07 | Openai | 4.0B | 200K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Estimated |
| gpt-5-pro | Openai | 50.0B | 200K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
| gpt-5-pro-2025-10-06 | Openai | 50.0B | 200K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
| gpt-5-search-api | Openai | 300.0B | 200K | ~180.00 | ~540.00 |
V
T
I
A
J
|
Estimated |
| gpt-5-search-api-2025-10-14 | Openai | 300.0B | 200K | ~180.00 | ~540.00 |
V
T
I
A
J
|
Estimated |
| gpt-5.1 | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.1-2025-11-13 | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.1-chat-latest | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.1-codex | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.1-codex-max | Openai | 300.0B | 200K | ~180.00 | ~540.00 |
V
T
I
A
J
|
Estimated |
| gpt-5.1-codex-mini | Openai | 8.0B | 200K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-5.2 | Openai | 8.0B | 200K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| gpt-5.2-2025-12-11 | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.2-chat-latest | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.2-codex | Openai | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
| gpt-5.2-pro | Openai | 50.0B | 200K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
| gpt-5.2-pro-2025-12-11 | Openai | 50.0B | 200K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
| gpt-audio | Openai | 200.0B | - | ~144.00 | ~432.00 |
-
|
Estimated |
| gpt-audio-2025-08-28 | Openai | 200.0B | - | ~144.00 | ~432.00 |
-
|
Estimated |
| gpt-audio-mini | Openai | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| gpt-audio-mini-2025-10-06 | Openai | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| gpt-audio-mini-2025-12-15 | Openai | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| gpt-image-1 | Openai | 12.0B | - | ~8.64 | ~25.92 |
-
|
Estimated |
| gpt-image-1-mini | Openai | 8.0B | - | ~5.76 | ~17.28 |
-
|
Estimated |
|
gpt-oss-120B
gpt-oss/gpt-oss-120B |
Together | 8.8B | 128K | ~5.28 | ~15.84 |
V
T
I
A
J
|
Inferred |
| gpt-realtime | Openai | 200.0B | - | ~144.00 | ~432.00 |
-
|
Estimated |
| gpt-realtime-2025-08-28 | Openai | 200.0B | - | ~144.00 | ~432.00 |
-
|
Estimated |
| gpt-realtime-mini | Openai | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| gpt-realtime-mini-2025-10-06 | Openai | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| gpt-realtime-mini-2025-12-15 | Openai | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
|
Grok 2
grok-2 |
Grok | 314.0B | - | ~94.20 | ~282.60 |
-
|
Estimated |
|
Grok 2 1212
grok-2-1212 |
Xai | 314.0B | 131K | ~94.20 | ~282.60 |
V
T
I
A
J
|
Estimated |
| grok-2-image-1212 | Grok | 314.0B | - | ~113.04 | ~339.12 |
-
|
Estimated |
|
Grok 2 Latest
grok-2-latest |
Xai | 120.0B | 131K | ~43.20 | ~129.60 |
V
T
I
A
J
|
Inferred |
|
Grok 2 Mini
grok-2-mini |
Grok | 70.0B | - | ~21.00 | ~63.00 |
-
|
Estimated |
|
Grok 2 Vision 1212
grok-2-vision-1212 |
Xai | 314.0B | 33K | ~113.04 | ~339.12 |
V
T
I
A
J
|
Estimated |
|
Grok 3
grok-3 |
Xai | 220.0B | 131K | ~79.20 | ~237.60 |
V
T
I
A
J
|
Inferred |
|
Grok 3 Fast
grok-3-fast |
Xai | 48.0B | 131K | ~17.28 | ~51.84 |
V
T
I
A
J
|
Inferred |
|
Grok 3 Mini
grok-3-mini |
Xai | 100.0B | 131K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
|
Grok 4
grok-4 |
Xai | 220.0B | 131K | ~79.20 | ~237.60 |
V
T
I
A
J
|
Inferred |
|
Grok 4 0709
grok-4-0709 |
Xai | 220.0B | 131K | ~79.20 | ~237.60 |
V
T
I
A
J
|
Inferred |
|
Grok 4 1 Fast Non Reasoning
grok-4-1-fast-non-reasoning |
Xai | 400.0B | 131K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
Grok 4 1 Fast Reasoning
grok-4-1-fast-reasoning |
Xai | 400.0B | 131K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
Grok 4 Fast Non Reasoning
grok-4-fast-non-reasoning |
Xai | 400.0B | 131K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
Grok 4 Fast Reasoning
grok-4-fast-reasoning |
Xai | 400.0B | 131K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
Grok 4.1
grok-4.1 |
Xai | 140.0B | 2.0M | ~50.40 | ~151.20 |
V
T
I
A
J
|
Inferred |
|
Grok 4.1 Fast
grok-4.1-fast |
Xai | 7.1B | 2.0M | ~2.56 | ~7.67 |
V
T
I
A
J
|
Inferred |
|
Grok 4.1 Mini
grok-4.1-mini |
Xai | 4.4B | 2.0M | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Grok Beta
grok-beta |
Grok | 314.0B | - | ~94.20 | ~282.60 |
-
|
Estimated |
|
Grok Vision Beta
grok-vision-beta |
Xai | 200.0B | 8K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
|
Groq/compound
groq/compound |
Groq | 8.8B | 131K | ~5.28 | ~15.84 |
V
T
I
A
J
|
Inferred |
|
Groq/compound Mini
groq/compound-mini |
Groq | 8.8B | 131K | ~5.28 | ~15.84 |
V
T
I
A
J
|
Inferred |
|
MythoMax 13B
gryphe/mythomax-l2-13b |
Openrouter | 13.0B | 4K | ~6.50 | ~19.50 |
-
|
Estimated |
|
IBM: Granite 4.0 Micro
ibm-granite/granite-4.0-h-micro |
Openrouter | 3.0B | 131K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Inception: Mercury 2
inception/mercury-2 |
Openrouter | 0.0B | 128K | ~2.00 | ~6.00 |
-
|
Estimated |
|
J2 Light
j2-light |
Ai21 | 50.0B | 8K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Inferred |
|
J2 Mid
j2-mid |
Ai21 | 110.0B | 8K | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
|
J2 Ultra
j2-ultra |
Ai21 | 200.0B | 8K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Inferred |
|
Jamba 1.5 Large
jamba-1.5-large |
Ai21 | 110.0B | 256K | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
|
Jamba 1.5 Mini
jamba-1.5-mini |
Ai21 | 5.7B | 256K | ~3.42 | ~10.26 |
V
T
I
A
J
|
Inferred |
|
Jamba Instruct
jamba-instruct |
Ai21 | 13.0B | 256K | ~7.80 | ~23.40 |
V
T
I
A
J
|
Inferred |
|
Jamba Instruct Preview
jamba-instruct-preview |
Ai21 | 13.0B | 256K | ~7.80 | ~23.40 |
V
T
I
A
J
|
Inferred |
|
Kimi K2
kimi-k2 |
Moonshot | 48.0B | 131K | ~28.80 | ~86.40 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 0711
kimi-k2-0711 |
Moonshot | 47.0B | 131K | ~28.20 | ~84.60 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 0711 Preview
kimi-k2-0711-preview |
Moonshot | 47.0B | 131K | ~28.20 | ~84.60 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 0905
kimi-k2-0905 |
Moonshot | 44.0B | 262K | ~26.40 | ~79.20 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 0905 Preview
kimi-k2-0905-preview |
Moonshot | 44.0B | 262K | ~26.40 | ~79.20 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 Exacto
kimi-k2-exacto |
Moonshot | 44.0B | 262K | ~26.40 | ~79.20 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 Thinking
kimi-k2-thinking |
Moonshot | 53.0B | 262K | ~31.80 | ~95.40 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 Thinking Turbo
kimi-k2-thinking-turbo |
Moonshot | 53.0B | 262K | ~31.80 | ~95.40 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 Turbo
kimi-k2-turbo |
Moonshot | 99.0B | 131K | ~59.40 | ~178.20 |
V
T
I
A
J
|
Inferred |
|
Kimi K2 Turbo Preview
kimi-k2-turbo-preview |
Moonshot | 110.0B | 262K | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
|
Kimi K2.5
kimi-k2.5 |
Moonshot | 57.0B | 262K | ~34.20 | ~102.60 |
V
T
I
A
J
|
Inferred |
|
Kimi Latest
kimi-latest |
Moonshot | 52.0B | 131K | ~31.20 | ~93.60 |
V
T
I
A
J
|
Inferred |
|
Kimi Long
kimi-long |
Moonshot | 170.0B | 1.0M | ~102.00 | ~306.00 |
V
T
I
A
J
|
Inferred |
|
LiquidAI: LFM2-24B-A2B
liquid/lfm-2-24b-a2b |
Openrouter | 24.0B | 33K | ~12.00 | ~36.00 |
-
|
Estimated |
|
LiquidAI/LFM2-2.6B
liquid/lfm-2.2-6b |
Openrouter | 2.6B | 33K | ~2.00 | ~6.00 |
-
|
Estimated |
|
LiquidAI: LFM2.5-1.2B-Instruct (free)
liquid/lfm-2.5-1.2b-instruct:free |
Openrouter | 1.2B | 33K | ~2.00 | ~6.00 |
-
|
Estimated |
|
LiquidAI: LFM2.5-1.2B-Thinking (free)
liquid/lfm-2.5-1.2b-thinking:free |
Openrouter | 1.2B | 33K | ~2.00 | ~6.00 |
-
|
Estimated |
|
LiquidAI/LFM2-8B-A1B
liquid/lfm2-8b-a1b |
Openrouter | 8.0B | 33K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Llama 3.1 405b Reasoning
llama-3.1-405b-reasoning |
Groq | 20.0B | 33K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Inferred |
|
Llama 3.1 70b Versatile
llama-3.1-70b-versatile |
Groq | 70.0B | 131K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Estimated |
|
Llama 3.1 8b Instant
llama-3.1-8b-instant |
Groq | 8.0B | 131K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
|
Llama 3.1 Sonar Huge 128k Online
llama-3.1-sonar-huge-128k-online |
Perplexity | 70.0B | 128K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Inferred |
|
Llama 3.1 Sonar Large 128k Chat
llama-3.1-sonar-large-128k-chat |
Perplexity | 70.0B | - | ~42.00 | ~126.00 |
-
|
Estimated |
|
Llama 3.1 Sonar Large 128k Online
llama-3.1-sonar-large-128k-online |
Perplexity | 22.0B | 128K | ~13.20 | ~39.60 |
V
T
I
A
J
|
Inferred |
|
Llama 3.1 Sonar Small 128k Chat
llama-3.1-sonar-small-128k-chat |
Perplexity | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
|
Llama 3.1 Sonar Small 128k Online
llama-3.1-sonar-small-128k-online |
Perplexity | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama 3.2 11b Vision Preview
llama-3.2-11b-vision-preview |
Groq | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama 3.2 1b Preview
llama-3.2-1b-preview |
Groq | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama 3.2 3b Preview
llama-3.2-3b-preview |
Groq | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama 3.2 90b Vision Preview
llama-3.2-90b-vision-preview |
Groq | 17.0B | 8K | ~10.20 | ~30.60 |
V
T
I
A
J
|
Inferred |
|
Llama 3.3 70b Specdec
llama-3.3-70b-specdec |
Groq | 70.0B | - | ~42.00 | ~126.00 |
-
|
Estimated |
|
Llama 3.3 70b Versatile
llama-3.3-70b-versatile |
Groq | 70.0B | 131K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Estimated |
|
Llama3 70b 8192
llama3-70b-8192 |
Groq | 70.0B | 8K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Estimated |
|
Llama3 8b 8192
llama3-8b-8192 |
Groq | 8.0B | 8K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
|
Llama3 1 8b
llama3.1-8b |
Cerebras | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| llama3.1:70b | Ollama | 70.0B | - | ~42.00 | ~126.00 |
-
|
Estimated |
| llama3.1:8b | Ollama | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| llama3.2:1b | Ollama | 1.0B | - | ~2.00 | ~6.00 |
-
|
Estimated |
| llama3.2:3b | Ollama | 3.0B | - | ~2.00 | ~6.00 |
-
|
Estimated |
| llama3:70b | Ollama | 70.0B | - | ~42.00 | ~126.00 |
-
|
Estimated |
| llama3:8b | Ollama | 8.0B | - | ~4.80 | ~14.40 |
-
|
Estimated |
| llava:13b | Ollama | 13.0B | - | ~9.36 | ~28.08 |
-
|
Estimated |
| llava:34b | Ollama | 34.0B | - | ~24.48 | ~73.44 |
-
|
Estimated |
| llava:7b | Ollama | 7.0B | - | ~5.04 | ~15.12 |
-
|
Estimated |
|
Magistral Medium 2506
magistral-medium-2506 |
Mistral | 63.0B | 40K | ~37.80 | ~113.40 |
V
T
I
A
J
|
Inferred |
|
Magistral Medium 2507
magistral-medium-2507 |
Mistral | 63.0B | 40K | ~37.80 | ~113.40 |
V
T
I
A
J
|
Inferred |
| magistral-medium-2509 | Mistral | 63.0B | 40K | ~37.80 | ~113.40 |
V
T
I
A
J
|
Inferred |
|
magistral-medium-2509
magistral-medium-latest |
Mistral | 63.0B | 40K | ~37.80 | ~113.40 |
V
T
I
A
J
|
Inferred |
|
Magistral Small 2507
magistral-small-2507 |
Mistral | 32.0B | 40K | ~19.20 | ~57.60 |
V
T
I
A
J
|
Inferred |
| magistral-small-2509 | Mistral | 32.0B | 40K | ~19.20 | ~57.60 |
V
T
I
A
J
|
Inferred |
|
magistral-small-2509
magistral-small-latest |
Mistral | 32.0B | 40K | ~19.20 | ~57.60 |
V
T
I
A
J
|
Inferred |
|
Meituan: LongCat Flash Chat
meituan/longcat-flash-chat |
Openrouter | 560.0B | 131K | ~168.00 | ~504.00 |
-
|
Estimated |
|
Meituan: LongCat Flash Chat (free)
meituan/longcat-flash-chat:free |
Openrouter | 560.0B | 131K | ~168.00 | ~504.00 |
-
|
Estimated |
|
Llama-3-8B-Instruct-Lite
meta-llama/Llama-3-8B-Instruct-Lite |
Together | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama-3.2-11B-Vision-Instruct-Turbo
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo |
Together | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama-3.2-3B-Instruct-Turbo
meta-llama/Llama-3.2-3B-Instruct-Turbo |
Together | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama-3.2-90B-Vision-Instruct-Turbo
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo |
Together | 27.0B | 131K | ~16.20 | ~48.60 |
V
T
I
A
J
|
Inferred |
|
Llama-3.3-70B-Instruct-Turbo
meta-llama/Llama-3.3-70B-Instruct-Turbo |
Together | 70.0B | 131K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Estimated |
|
Meta-Llama-3.1-405B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo |
Together | 405.0B | 131K | ~243.00 | ~729.00 |
V
T
I
A
J
|
Estimated |
|
Meta-Llama-3.1-70B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo |
Together | 70.0B | 131K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Estimated |
|
Meta-Llama-3.1-8B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo |
Together | 8.0B | 131K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
|
Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct |
Openrouter | 70.0B | 8K | ~28.00 | ~84.00 |
-
|
Research |
|
Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instruct |
Openrouter | 8.0B | 8K | ~3.20 | ~9.60 |
-
|
Research |
|
Meta: Llama 3.1 405B (base)
meta-llama/llama-3.1-405b |
Openrouter | 405.0B | 33K | ~202.50 | ~607.50 |
-
|
Estimated |
|
Meta: Llama 3.1 405B Instruct
meta-llama/llama-3.1-405b-instruct |
Openrouter | 405.0B | 131K | ~162.00 | ~486.00 |
-
|
Research |
|
Meta: Llama 3.1 405B Instruct (free)
meta-llama/llama-3.1-405b-instruct:free |
Openrouter | 400.0B | 131K | ~200.00 | ~600.00 |
-
|
Estimated |
|
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct |
Openrouter | 70.0B | 131K | ~28.00 | ~84.00 |
-
|
Research |
|
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct |
Openrouter | 8.0B | 131K | ~3.20 | ~9.60 |
-
|
Research |
|
Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct |
Openrouter | 11.0B | 131K | ~7.92 | ~23.76 |
-
|
Estimated |
|
Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instruct |
Openrouter | 1.0B | 60K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct |
Openrouter | 3.0B | 131K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Meta: Llama 3.2 3B Instruct (free)
meta-llama/llama-3.2-3b-instruct:free |
Openrouter | 3.0B | 131K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Meta: Llama 3.2 90B Vision Instruct
meta-llama/llama-3.2-90b-vision-instruct |
Openrouter | 90.0B | 33K | ~64.80 | ~194.40 |
-
|
Estimated |
|
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct |
Redpill | 70.0B | 131K | ~28.00 | ~84.00 |
V
T
I
A
J
|
Research |
|
Meta: Llama 3.3 70B Instruct (free)
meta-llama/llama-3.3-70b-instruct:free |
Openrouter | 70.0B | 131K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Meta: Llama 4 Maverick
meta-llama/llama-4-maverick |
Openrouter | 17.0B | 1.0M | ~6.12 | ~18.36 |
-
|
Estimated |
|
Meta: Llama 4 Scout
meta-llama/llama-4-scout |
Openrouter | 17.0B | 328K | ~6.12 | ~18.36 |
-
|
Estimated |
|
Meta Llama/llama 4 Scout 17b 16e Instruct
meta-llama/llama-4-scout-17b-16e-instruct |
Groq | 4.5B | 131K | ~2.70 | ~8.10 |
V
T
I
A
J
|
Inferred |
|
Meta: LlamaGuard 2 8B
meta-llama/llama-guard-2-8b |
Openrouter | 8.0B | 8K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Llama Guard 3 8B
meta-llama/llama-guard-3-8b |
Openrouter | 8.0B | 131K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Meta: Llama Guard 4 12B
meta-llama/llama-guard-4-12b |
Openrouter | 12.0B | 164K | ~7.20 | ~21.60 |
-
|
Estimated |
|
Meta Llama/llama Prompt Guard 2 22m
meta-llama/llama-prompt-guard-2-22m |
Groq | 1.8B | 512 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Meta Llama/llama Prompt Guard 2 86m
meta-llama/llama-prompt-guard-2-86m |
Groq | 1.8B | 512 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Llama 2 13b Chat
meta/llama-2-13b-chat |
Replicate | 5.3B | 4K | ~3.18 | ~9.54 |
V
T
I
A
J
|
Inferred |
|
Llama 2 70b Chat
meta/llama-2-70b-chat |
Replicate | 42.0B | 4K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Llama 2 7b Chat
meta/llama-2-7b-chat |
Replicate | 2.8B | 4K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Meta Llama 3 70b Instruct
meta/meta-llama-3-70b-instruct |
Replicate | 47.0B | 8K | ~28.20 | ~84.60 |
V
T
I
A
J
|
Inferred |
|
Meta Llama 3 8b Instruct
meta/meta-llama-3-8b-instruct |
Replicate | 3.1B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Meta Llama 3.1 405b Instruct
meta/meta-llama-3.1-405b-instruct |
Replicate | 120.0B | 131K | ~72.00 | ~216.00 |
V
T
I
A
J
|
Inferred |
|
Microsoft: Phi-3 Medium 128K Instruct
microsoft/phi-3-medium-128k-instruct |
Openrouter | 14.0B | 128K | ~7.00 | ~21.00 |
-
|
Estimated |
|
Microsoft: Phi-3 Mini 128K Instruct
microsoft/phi-3-mini-128k-instruct |
Openrouter | 3.8B | 128K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Microsoft: Phi-3.5 Mini 128K Instruct
microsoft/phi-3.5-mini-128k-instruct |
Openrouter | 3.8B | 128K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Microsoft: Phi 4
microsoft/phi-4 |
Openrouter | 14.0B | 16K | ~7.00 | ~21.00 |
-
|
Estimated |
|
Microsoft: Phi 4 Multimodal Instruct
microsoft/phi-4-multimodal-instruct |
Openrouter | 5.6B | 131K | ~3.36 | ~10.08 |
-
|
Estimated |
|
Microsoft: Phi 4 Reasoning Plus
microsoft/phi-4-reasoning-plus |
Openrouter | 14.0B | 33K | ~7.00 | ~21.00 |
-
|
Estimated |
|
WizardLM-2 8x22B
microsoft/wizardlm-2-8x22b |
Openrouter | 176.0B | 66K | ~52.80 | ~158.40 |
-
|
Estimated |
|
MiniMax: MiniMax-01
minimax/minimax-01 |
Openrouter | 456.0B | 1.0M | ~164.16 | ~492.48 |
-
|
Estimated |
|
MiniMax: MiniMax M1
minimax/minimax-m1 |
Openrouter | 456.0B | 1.0M | ~136.80 | ~410.40 |
-
|
Estimated |
|
MiniMax: MiniMax M2
minimax/minimax-m2 |
Openrouter | 10.0B | 262K | ~5.00 | ~15.00 |
-
|
Estimated |
|
MiniMax: MiniMax M2.1
minimax/minimax-m2.1 |
Openrouter | 10.0B | 205K | ~5.00 | ~15.00 |
-
|
Estimated |
|
MiniMax: MiniMax M2.5
minimax/minimax-m2.5 |
Openrouter | 2.5B | 197K | ~2.00 | ~6.00 |
-
|
Estimated |
| ministral-14b-2512 | Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
ministral-14b-2512
ministral-14b-latest |
Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| ministral-3b-2512 | Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
ministral-3b-2512
ministral-3b-latest |
Mistral | 3.0B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Estimated |
| ministral-8b-2512 | Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
ministral-8b-2512
ministral-8b-latest |
Mistral | 8.0B | 128K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
|
Mistral 7b Instruct
mistral-7b-instruct |
Mistral | 2.0B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
mistral-embed-2312
mistral-embed |
Mistral | 3.7B | 8K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
Mistral Large
mistral-large |
Mistral | 38.0B | 256K | ~22.80 | ~68.40 |
V
T
I
A
J
|
Inferred |
|
Mistral Large 2407
mistral-large-2407 |
Mistral | 77.0B | 128K | ~46.20 | ~138.60 |
V
T
I
A
J
|
Inferred |
| mistral-large-2411 | Mistral | 123.0B | 128K | ~73.80 | ~221.40 |
V
T
I
A
J
|
Estimated |
| mistral-large-2512 | Mistral | 38.0B | 256K | ~22.80 | ~68.40 |
V
T
I
A
J
|
Inferred |
|
mistral-large-2512
mistral-large-latest |
Mistral | 38.0B | 256K | ~22.80 | ~68.40 |
V
T
I
A
J
|
Inferred |
|
pixtral-large-2411
mistral-large-pixtral-2411 |
Mistral | 38.0B | 256K | ~22.80 | ~68.40 |
V
T
I
A
J
|
Inferred |
|
mistral-medium-2508
mistral-medium |
Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Mistral Medium 3
mistral-medium-3 |
Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Mistral Medium 3.1
mistral-medium-3.1 |
Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
mistral-medium-2508
mistral-medium-latest |
Mistral | 42.0B | 128K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Mistral Nemo
mistral-nemo |
Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Mistral Saba
mistral-saba |
Mistral | 8.0B | 32K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Inferred |
|
Mistral Small
mistral-small |
Mistral | 2.0B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Mistral Small 2409
mistral-small-2409 |
Mistral | 22.0B | - | ~13.20 | ~39.60 |
-
|
Estimated |
|
Mistral Small 3.1
mistral-small-3.1 |
Mistral | 2.0B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Mistral Small 3.2
mistral-small-3.2 |
Mistral | 2.0B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Mistral Small Creative
mistral-small-creative |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
mistral-small-2506
mistral-small-latest |
Mistral | 2.0B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Mistral Tiny
mistral-tiny |
Mistral | 3.1B | 32K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| mistral:7b | Ollama | 7.0B | - | ~4.20 | ~12.60 |
-
|
Estimated |
|
Mistral 7B Instruct V0.3
mistralai/Mistral-7B-Instruct-v0.3 |
Together | 7.0B | - | ~4.20 | ~12.60 |
-
|
Estimated |
|
Mistral-Small-3
mistralai/Mistral-Small-3 |
Together | 3.7B | 33K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
Mixtral-8x22B-Instruct-v0.1
mistralai/Mixtral-8x22B-Instruct-v0.1 |
Together | 141.0B | 66K | ~42.30 | ~126.90 |
V
T
I
A
J
|
Estimated |
|
Mixtral-8x7B-Instruct-v0.1
mistralai/Mixtral-8x7B-Instruct-v0.1 |
Together | 47.0B | 33K | ~14.10 | ~42.30 |
V
T
I
A
J
|
Estimated |
|
Mistral: Codestral 2508
mistralai/codestral-2508 |
Openrouter | 2.5B | 256K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Mistral: Devstral 2 2512
mistralai/devstral-2512 |
Openrouter | 123.0B | 262K | ~73.80 | ~221.40 |
-
|
Estimated |
|
Mistral: Devstral 2 2512 (free)
mistralai/devstral-2512:free |
Openrouter | 123.0B | 262K | ~73.80 | ~221.40 |
-
|
Estimated |
|
Mistral: Devstral Small 1.1
mistralai/devstral-small |
Openrouter | 24.0B | 128K | ~12.00 | ~36.00 |
-
|
Estimated |
|
Mistral: Devstral Small 2505
mistralai/devstral-small-2505 |
Openrouter | 24.0B | 128K | ~12.00 | ~36.00 |
-
|
Estimated |
|
Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512 |
Openrouter | 14.0B | 262K | ~8.40 | ~25.20 |
-
|
Estimated |
|
Mistral: Ministral 3B
mistralai/ministral-3b |
Openrouter | 3.0B | 131K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512 |
Openrouter | 3.0B | 131K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Mistral: Ministral 8B
mistralai/ministral-8b |
Openrouter | 8.0B | 131K | ~4.80 | ~14.40 |
-
|
Estimated |
|
Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512 |
Openrouter | 8.0B | 262K | ~4.80 | ~14.40 |
-
|
Estimated |
|
Mistral: Mistral 7B Instruct
mistralai/mistral-7b-instruct |
Openrouter | 7.0B | 33K | ~4.20 | ~12.60 |
-
|
Estimated |
|
Mistral: Mistral 7B Instruct v0.1
mistralai/mistral-7b-instruct-v0.1 |
Openrouter | 7.3B | 3K | ~3.65 | ~10.95 |
-
|
Estimated |
|
Mistral 7b Instruct V0.2
mistralai/mistral-7b-instruct-v0.2 |
Replicate | 3.1B | 33K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Mistral: Mistral 7B Instruct v0.3
mistralai/mistral-7b-instruct-v0.3 |
Openrouter | 7.0B | 33K | ~4.20 | ~12.60 |
-
|
Estimated |
|
Mistral: Mistral 7B Instruct (free)
mistralai/mistral-7b-instruct:free |
Openrouter | 7.3B | 33K | ~3.65 | ~10.95 |
-
|
Estimated |
|
Mistral Large 2407
mistralai/mistral-large-2407 |
Openrouter | 2.4B | 131K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Mistral Large 2411
mistralai/mistral-large-2411 |
Openrouter | 123.0B | 131K | ~49.20 | ~147.60 |
-
|
Research |
|
Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512 |
Openrouter | 41.0B | 262K | ~24.60 | ~73.80 |
-
|
Estimated |
|
Mistral: Mistral Nemo
mistralai/mistral-nemo |
Openrouter | 12.0B | 131K | ~7.20 | ~21.60 |
-
|
Estimated |
|
Mistral: Saba
mistralai/mistral-saba |
Openrouter | 24.0B | 33K | ~12.00 | ~36.00 |
-
|
Estimated |
|
Mistral: Mistral Small 3
mistralai/mistral-small-24b-instruct-2501 |
Openrouter | 24.0B | 33K | ~12.00 | ~36.00 |
-
|
Estimated |
|
Mistral: Mistral Small 3.1 24B
mistralai/mistral-small-3.1-24b-instruct |
Openrouter | 24.0B | 131K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Mistral: Mistral Small 3.1 24B (free)
mistralai/mistral-small-3.1-24b-instruct:free |
Openrouter | 24.0B | 128K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct |
Openrouter | 24.0B | 131K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Mistral Tiny
mistralai/mistral-tiny |
Openrouter | 8.0B | 33K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instruct |
Openrouter | 141.0B | 66K | ~42.30 | ~126.90 |
-
|
Estimated |
|
Mistral: Mixtral 8x7B Instruct
mistralai/mixtral-8x7b-instruct |
Openrouter | 47.0B | 33K | ~14.10 | ~42.30 |
-
|
Estimated |
|
Mixtral 8x7b Instruct V0.1
mistralai/mixtral-8x7b-instruct-v0.1 |
Replicate | 20.0B | 33K | ~7.20 | ~21.60 |
V
T
I
A
J
|
Inferred |
|
Mistral: Pixtral 12B
mistralai/pixtral-12b |
Openrouter | 12.0B | 33K | ~8.64 | ~25.92 |
-
|
Estimated |
|
Mistral: Pixtral Large 2411
mistralai/pixtral-large-2411 |
Openrouter | 124.0B | 131K | ~74.40 | ~223.20 |
-
|
Estimated |
|
Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507 |
Openrouter | 24.0B | 32K | ~14.40 | ~43.20 |
-
|
Estimated |
|
Mixtral 8x22b Instruct
mixtral-8x22b-instruct |
Mistral | 70.0B | 64K | ~25.20 | ~75.60 |
V
T
I
A
J
|
Inferred |
|
Mixtral 8x7b 32768
mixtral-8x7b-32768 |
Groq | 47.0B | 33K | ~14.10 | ~42.30 |
V
T
I
A
J
|
Estimated |
|
Mixtral 8x7b Instruct
mixtral-8x7b-instruct |
Mistral | 6.3B | 32K | ~2.27 | ~6.80 |
V
T
I
A
J
|
Inferred |
| mixtral:8x22b | Ollama | 141.0B | - | ~42.30 | ~126.90 |
-
|
Estimated |
| mixtral:8x7b | Ollama | 47.0B | - | ~14.10 | ~42.30 |
-
|
Estimated |
|
Moonshot V1 128k
moonshot-v1-128k |
Moonshot | 77.0B | 131K | ~46.20 | ~138.60 |
V
T
I
A
J
|
Inferred |
|
Moonshot V1 128k Vision Preview
moonshot-v1-128k-vision-preview |
Moonshot | 77.0B | 131K | ~46.20 | ~138.60 |
V
T
I
A
J
|
Inferred |
|
Moonshot V1 32k
moonshot-v1-32k |
Moonshot | 43.0B | 33K | ~25.80 | ~77.40 |
V
T
I
A
J
|
Inferred |
|
Moonshot V1 32k Vision Preview
moonshot-v1-32k-vision-preview |
Moonshot | 43.0B | 33K | ~25.80 | ~77.40 |
V
T
I
A
J
|
Inferred |
|
Moonshot V1 8k
moonshot-v1-8k |
Moonshot | 25.0B | 8K | ~15.00 | ~45.00 |
V
T
I
A
J
|
Inferred |
|
Moonshot V1 8k Vision Preview
moonshot-v1-8k-vision-preview |
Moonshot | 25.0B | 8K | ~15.00 | ~45.00 |
V
T
I
A
J
|
Inferred |
|
Kimi-K2.5
moonshotai/Kimi-K2.5 |
Together | 57.0B | 256K | ~34.20 | ~102.60 |
V
T
I
A
J
|
Inferred |
|
MoonshotAI: Kimi Dev 72B
moonshotai/kimi-dev-72b |
Openrouter | 72.0B | 131K | ~36.00 | ~108.00 |
-
|
Estimated |
|
MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2 |
Openrouter | 32.0B | 131K | ~9.60 | ~28.80 |
-
|
Estimated |
|
MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905 |
Openrouter | 32.0B | 262K | ~9.60 | ~28.80 |
-
|
Estimated |
|
MoonshotAI: Kimi K2 0905 (exacto)
moonshotai/kimi-k2-0905:exacto |
Openrouter | 32.0B | 262K | ~9.60 | ~28.80 |
-
|
Estimated |
|
Moonshotai/kimi K2 Instruct
moonshotai/kimi-k2-instruct |
Groq | 55.0B | 131K | ~33.00 | ~99.00 |
V
T
I
A
J
|
Inferred |
|
Moonshotai/kimi K2 Instruct 0905
moonshotai/kimi-k2-instruct-0905 |
Groq | 60.0B | 262K | ~36.00 | ~108.00 |
V
T
I
A
J
|
Inferred |
|
MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinking |
Redpill | 46.0B | 262K | ~27.60 | ~82.80 |
V
T
I
A
J
|
Inferred |
|
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5 |
Redpill | 60.0B | 262K | ~36.00 | ~108.00 |
V
T
I
A
J
|
Inferred |
|
MoonshotAI: Kimi K2 0711 (free)
moonshotai/kimi-k2:free |
Openrouter | 32.0B | 33K | ~9.60 | ~28.80 |
-
|
Estimated |
|
MoonshotAI: Kimi Linear 48B A3B Instruct
moonshotai/kimi-linear-48b-a3b-instruct |
Openrouter | 48.0B | 1.0M | ~24.00 | ~72.00 |
-
|
Estimated |
|
NeverSleep: Lumimaid v0.2 8B
neversleep/llama-3.1-lumimaid-8b |
Openrouter | 8.0B | 33K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Noromaid 20B
neversleep/noromaid-20b |
Openrouter | 20.0B | 4K | ~10.00 | ~30.00 |
-
|
Estimated |
|
Nous: DeepHermes 3 Mistral 24B Preview
nousresearch/deephermes-3-mistral-24b-preview |
Openrouter | 24.0B | 33K | ~12.00 | ~36.00 |
-
|
Estimated |
|
NousResearch: Hermes 2 Pro - Llama-3 8B
nousresearch/hermes-2-pro-llama-3-8b |
Openrouter | 8.0B | 8K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405b |
Redpill | 22.0B | 131K | ~13.20 | ~39.60 |
V
T
I
A
J
|
Inferred |
|
Nous: Hermes 3 405B Instruct (free)
nousresearch/hermes-3-llama-3.1-405b:free |
Openrouter | 405.0B | 131K | ~202.50 | ~607.50 |
-
|
Estimated |
|
Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70b |
Openrouter | 7.0B | 66K | ~3.50 | ~10.50 |
-
|
Estimated |
|
Nous: Hermes 4 405B
nousresearch/hermes-4-405b |
Openrouter | 405.0B | 131K | ~202.50 | ~607.50 |
-
|
Estimated |
|
Nous: Hermes 4 70B
nousresearch/hermes-4-70b |
Openrouter | 70.0B | 131K | ~35.00 | ~105.00 |
-
|
Estimated |
|
NVIDIA: Llama 3.1 Nemotron 70B Instruct
nvidia/llama-3.1-nemotron-70b-instruct |
Openrouter | 70.0B | 131K | ~42.00 | ~126.00 |
-
|
Estimated |
|
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
nvidia/llama-3.1-nemotron-ultra-253b-v1 |
Openrouter | 253.0B | 131K | ~126.50 | ~379.50 |
-
|
Estimated |
|
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5 |
Openrouter | 49.0B | 131K | ~24.50 | ~73.50 |
-
|
Estimated |
|
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b |
Openrouter | 30.0B | 262K | ~9.00 | ~27.00 |
-
|
Estimated |
|
NVIDIA: Nemotron 3 Nano 30B A3B (free)
nvidia/nemotron-3-nano-30b-a3b:free |
Openrouter | 30.0B | 256K | ~9.00 | ~27.00 |
-
|
Estimated |
|
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-120b-a12b:free |
Openrouter | 120.0B | 262K | ~36.00 | ~108.00 |
-
|
Estimated |
|
NVIDIA: Nemotron Nano 12B 2 VL
nvidia/nemotron-nano-12b-v2-vl |
Openrouter | 12.0B | 131K | ~7.20 | ~21.60 |
-
|
Estimated |
|
NVIDIA: Nemotron Nano 12B 2 VL (free)
nvidia/nemotron-nano-12b-v2-vl:free |
Openrouter | 12.0B | 128K | ~7.20 | ~21.60 |
-
|
Estimated |
|
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2 |
Openrouter | 9.0B | 131K | ~4.50 | ~13.50 |
-
|
Estimated |
|
NVIDIA: Nemotron Nano 9B V2 (free)
nvidia/nemotron-nano-9b-v2:free |
Openrouter | 9.0B | 128K | ~4.50 | ~13.50 |
-
|
Estimated |
| o1 | Openai | 200.0B | 200K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
| o1-2024-12-17 | Openai | 8.0B | 200K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Estimated |
| o1-mini | Openai | 150.0B | 128K | ~90.00 | ~270.00 |
V
T
I
A
J
|
Inferred |
|
O1 Mini 2024 09 12
o1-mini-2024-09-12 |
Openai | 100.0B | - | ~30.00 | ~90.00 |
-
|
Estimated |
| o1-preview | Openai | 660.0B | 128K | ~396.00 | ~1,188.00 |
V
T
I
A
J
|
Inferred |
|
O1 Preview 2024 09 12
o1-preview-2024-09-12 |
Openai | 300.0B | - | ~90.00 | ~270.00 |
-
|
Estimated |
| o1-pro | Openai | 50.0B | 200K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
| o1-pro-2025-03-19 | Openai | 50.0B | 200K | ~30.00 | ~90.00 |
V
T
I
A
J
|
Estimated |
| o3 | Openai | 560.0B | 200K | ~336.00 | ~1,008.00 |
V
T
I
A
J
|
Inferred |
| o3-2025-04-16 | Openai | 560.0B | 200K | ~336.00 | ~1,008.00 |
V
T
I
A
J
|
Inferred |
| o3-mini | Openai | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
| o3-mini-2025-01-31 | Openai | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
| o4-mini | Openai | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
| o4-mini-2025-04-16 | Openai | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
| o4-mini-deep-research | Openai | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
| o4-mini-deep-research-2025-06-26 | Openai | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
|
Mistral: Mistral 7B Instruct
open-mistral-7b |
Mistral | 7.0B | - | ~4.20 | ~12.60 |
-
|
Estimated |
| open-mistral-nemo | Mistral | 12.0B | 128K | ~7.20 | ~21.60 |
V
T
I
A
J
|
Estimated |
|
open-mistral-nemo
open-mistral-nemo-2407 |
Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Open Mixtral 8x22b
open-mixtral-8x22b |
Mistral | 141.0B | 64K | ~42.30 | ~126.90 |
V
T
I
A
J
|
Estimated |
|
Open Mixtral 8x7b
open-mixtral-8x7b |
Mistral | 47.0B | 32K | ~14.10 | ~42.30 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turbo |
Redpill | 20.0B | 16K | ~12.00 | ~36.00 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-4
openai/gpt-4 |
Redpill | 200.0B | 8K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-4 Turbo
openai/gpt-4-turbo |
Openrouter | 200.0B | 128K | ~144.00 | ~432.00 |
-
|
Estimated |
|
OpenAI: GPT-4 Turbo Preview
openai/gpt-4-turbo-preview |
Openrouter | 200.0B | 128K | ~144.00 | ~432.00 |
-
|
Estimated |
|
OpenAI: GPT-4.1
openai/gpt-4.1 |
Redpill | 110.0B | 1.0M | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT-4.1 Mini
openai/gpt-4.1-mini |
Redpill | 8.0B | 1.0M | ~5.76 | ~17.28 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-4.1 Nano
openai/gpt-4.1-nano |
Redpill | 4.0B | 1.0M | ~2.88 | ~8.64 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-4o
openai/gpt-4o |
Redpill | 200.0B | 128K | ~96.00 | ~288.00 |
V
T
I
A
J
|
Research |
|
OpenAI: GPT-4o (2024-05-13)
openai/gpt-4o-2024-05-13 |
Openrouter | 200.0B | 128K | ~144.00 | ~432.00 |
-
|
Estimated |
|
OpenAI: GPT-4o (2024-08-06)
openai/gpt-4o-2024-08-06 |
Openrouter | 200.0B | 128K | ~144.00 | ~432.00 |
-
|
Estimated |
|
OpenAI: GPT-4o (2024-11-20)
openai/gpt-4o-2024-11-20 |
Openrouter | 200.0B | 128K | ~144.00 | ~432.00 |
-
|
Estimated |
|
OpenAI: GPT-4o-mini
openai/gpt-4o-mini |
Redpill | 8.0B | 128K | ~5.76 | ~17.28 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-4o-mini (2024-07-18)
openai/gpt-4o-mini-2024-07-18 |
Openrouter | 8.0B | 128K | ~5.76 | ~17.28 |
-
|
Estimated |
|
OpenAI: GPT-5
openai/gpt-5 |
Redpill | 140.0B | 400K | ~84.00 | ~252.00 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT-5 Image
openai/gpt-5-image |
Openrouter | 0.0B | 400K | ~2.00 | ~6.00 |
-
|
Estimated |
|
OpenAI: GPT-5 Image Mini
openai/gpt-5-image-mini |
Openrouter | 0.0B | 400K | ~2.00 | ~6.00 |
-
|
Estimated |
|
OpenAI: GPT-5 Mini
openai/gpt-5-mini |
Redpill | 46.0B | 400K | ~27.60 | ~82.80 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT-5 Nano
openai/gpt-5-nano |
Redpill | 10.0B | 400K | ~7.20 | ~21.60 |
V
T
I
A
J
|
Estimated |
|
OpenAI: GPT-5.1
openai/gpt-5.1 |
Redpill | 140.0B | 400K | ~84.00 | ~252.00 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT-5.2
openai/gpt-5.2 |
Redpill | 200.0B | 400K | ~120.00 | ~360.00 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT OSS 120B
openai/gpt-oss-120b |
Redpill | 6.4B | 131K | ~3.84 | ~11.52 |
V
T
I
A
J
|
Inferred |
|
OpenAI: gpt-oss-120b (exacto)
openai/gpt-oss-120b:exacto |
Openrouter | 120.0B | 131K | ~36.00 | ~108.00 |
-
|
Estimated |
|
OpenAI: gpt-oss-120b (free)
openai/gpt-oss-120b:free |
Openrouter | 120.0B | 131K | ~36.00 | ~108.00 |
-
|
Estimated |
|
OpenAI: GPT OSS 20B
openai/gpt-oss-20b |
Redpill | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
OpenAI: gpt-oss-20b (free)
openai/gpt-oss-20b:free |
Openrouter | 20.0B | 131K | ~6.00 | ~18.00 |
-
|
Estimated |
|
Openai/gpt Oss Safeguard 20b
openai/gpt-oss-safeguard-20b |
Groq | 4.0B | 131K | ~2.40 | ~7.20 |
V
T
I
A
J
|
Inferred |
|
OpenAI: o1
openai/o1 |
Openrouter | 200.0B | 200K | ~120.00 | ~360.00 |
-
|
Estimated |
|
OpenAI: o3
openai/o3 |
Redpill | 110.0B | 200K | ~66.00 | ~198.00 |
V
T
I
A
J
|
Inferred |
|
OpenAI: o3 Mini
openai/o3-mini |
Openrouter | 50.0B | 200K | ~30.00 | ~90.00 |
-
|
Estimated |
|
OpenAI: o4 Mini
openai/o4-mini |
Redpill | 71.0B | 200K | ~42.60 | ~127.80 |
V
T
I
A
J
|
Inferred |
|
Whisper
openai/whisper |
Replicate | 17.0B | - | ~10.20 | ~30.60 |
V
T
I
A
J
|
Inferred |
|
OpenGVLab: InternVL3 78B
opengvlab/internvl3-78b |
Openrouter | 78.0B | 33K | ~46.80 | ~140.40 |
-
|
Estimated |
|
Auto Router
openrouter/auto |
Openrouter | 176.0B | 2.0M | ~52.80 | ~158.40 |
-
|
Estimated |
|
DeepSeek V3.1
phala/deepseek-chat-v3.1 |
Redpill | 48.0B | 164K | ~17.28 | ~51.84 |
V
T
I
A
J
|
Inferred |
|
DeepSeek: DeepSeek V3.2
phala/deepseek-v3.2 |
Redpill | 5.3B | 164K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Google: Gemma 3 27B
phala/gemma-3-27b-it |
Redpill | 4.8B | 54K | ~2.88 | ~8.64 |
V
T
I
A
J
|
Inferred |
|
Z.AI: GLM 4.7
phala/glm-4.7 |
Redpill | 57.0B | 131K | ~34.20 | ~102.60 |
V
T
I
A
J
|
Inferred |
|
Z.AI: GLM 4.7 Flash
phala/glm-4.7-flash |
Redpill | 6.1B | 203K | ~3.66 | ~10.98 |
V
T
I
A
J
|
Inferred |
|
Z.AI: GLM 5
phala/glm-5 |
Redpill | 64.0B | 203K | ~38.40 | ~115.20 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT OSS 120B
phala/gpt-oss-120b |
Redpill | 6.4B | 131K | ~3.84 | ~11.52 |
V
T
I
A
J
|
Inferred |
|
OpenAI: GPT OSS 20B
phala/gpt-oss-20b |
Redpill | 2.2B | 131K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
MoonshotAI: Kimi K2.5
phala/kimi-k2.5 |
Redpill | 60.0B | 262K | ~36.00 | ~108.00 |
V
T
I
A
J
|
Inferred |
|
Qwen2.5 7B Instruct
phala/qwen-2.5-7b-instruct |
Redpill | 2.0B | 33K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3 VL 30B A3B Instruct
phala/qwen2.5-vl-72b-instruct |
Redpill | 12.0B | 128K | ~7.20 | ~21.60 |
V
T
I
A
J
|
Inferred |
|
Qwen3 30B A3B Instruct 2507
phala/qwen3-30b-a3b-instruct-2507 |
Redpill | 6.4B | 262K | ~3.84 | ~11.52 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3 VL 30B A3B Instruct
phala/qwen3-vl-30b-a3b-instruct |
Redpill | 12.0B | 128K | ~7.20 | ~21.60 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3.5-27B
phala/qwen3.5-27b |
Redpill | 51.0B | 262K | ~30.60 | ~91.80 |
V
T
I
A
J
|
Inferred |
|
Phala: Venice Uncensored 24B
phala/uncensored-24b |
Redpill | 17.0B | 33K | ~10.20 | ~30.60 |
V
T
I
A
J
|
Inferred |
| phi3:medium | Ollama | 14.0B | - | ~8.40 | ~25.20 |
-
|
Estimated |
| phi3:mini | Ollama | 3.8B | - | ~2.28 | ~6.84 |
-
|
Estimated |
|
Mistral: Pixtral 12B
pixtral-12b-2409 |
Mistral | 12.0B | - | ~8.64 | ~25.92 |
-
|
Estimated |
|
Pixtral 12b Latest
pixtral-12b-latest |
Mistral | 2.2B | 128K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| pixtral-large-2411 | Mistral | 77.0B | 128K | ~46.20 | ~138.60 |
V
T
I
A
J
|
Inferred |
|
pixtral-large-2411
pixtral-large-latest |
Mistral | 77.0B | 128K | ~46.20 | ~138.60 |
V
T
I
A
J
|
Inferred |
|
Prime Intellect: INTELLECT-3
prime-intellect/intellect-3 |
Openrouter | 106.0B | 131K | ~31.80 | ~95.40 |
-
|
Estimated |
|
Qwen 2.5 32b
qwen-2.5-32b |
Groq | 4.7B | 33K | ~2.82 | ~8.46 |
V
T
I
A
J
|
Inferred |
|
Qwen 2.5 Coder 32b
qwen-2.5-coder-32b |
Groq | 4.7B | 33K | ~2.82 | ~8.46 |
V
T
I
A
J
|
Inferred |
|
Qwen 3 235b A22b Instruct 2507
qwen-3-235b-a22b-instruct-2507 |
Cerebras | 25.0B | 33K | ~15.00 | ~45.00 |
V
T
I
A
J
|
Inferred |
|
Qwen Qwq 32b
qwen-qwq-32b |
Groq | 7.6B | 131K | ~4.56 | ~13.68 |
V
T
I
A
J
|
Inferred |
|
Qwen2.5 72B Instruct
qwen/qwen-2.5-72b-instruct |
Openrouter | 72.0B | 33K | ~43.20 | ~129.60 |
-
|
Estimated |
|
Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct |
Redpill | 7.0B | 33K | ~4.20 | ~12.60 |
V
T
I
A
J
|
Estimated |
|
Qwen2.5 Coder 32B Instruct
qwen/qwen-2.5-coder-32b-instruct |
Openrouter | 32.0B | 33K | ~19.20 | ~57.60 |
-
|
Estimated |
|
Qwen: Qwen2.5-VL 7B Instruct
qwen/qwen-2.5-vl-7b-instruct |
Openrouter | 7.0B | 33K | ~4.20 | ~12.60 |
-
|
Estimated |
|
Qwen: Qwen2.5-VL 7B Instruct (free)
qwen/qwen-2.5-vl-7b-instruct:free |
Openrouter | 7.0B | 33K | ~4.20 | ~12.60 |
-
|
Estimated |
|
Qwen: Qwen2.5 Coder 7B Instruct
qwen/qwen2.5-coder-7b-instruct |
Openrouter | 7.0B | 33K | ~3.50 | ~10.50 |
-
|
Estimated |
|
Qwen: Qwen2.5 VL 32B Instruct
qwen/qwen2.5-vl-32b-instruct |
Openrouter | 32.0B | 16K | ~19.20 | ~57.60 |
-
|
Estimated |
|
Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct |
Openrouter | 72.0B | 33K | ~43.20 | ~129.60 |
-
|
Estimated |
|
Qwen: Qwen3 14B
qwen/qwen3-14b |
Openrouter | 14.0B | 41K | ~7.00 | ~21.00 |
-
|
Estimated |
|
Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22b |
Openrouter | 235.0B | 41K | ~70.50 | ~211.50 |
-
|
Estimated |
|
Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507 |
Openrouter | 235.0B | 262K | ~117.50 | ~352.50 |
-
|
Estimated |
|
Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507 |
Openrouter | 235.0B | 262K | ~70.50 | ~211.50 |
-
|
Estimated |
|
Qwen: Qwen3 235B A22B (free)
qwen/qwen3-235b-a22b:free |
Openrouter | 235.0B | 131K | ~70.50 | ~211.50 |
-
|
Estimated |
|
Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3b |
Openrouter | 30.0B | 41K | ~9.00 | ~27.00 |
-
|
Estimated |
|
Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507 |
Redpill | 6.4B | 262K | ~3.84 | ~11.52 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3 30B A3B Thinking 2507
qwen/qwen3-30b-a3b-thinking-2507 |
Openrouter | 30.0B | 33K | ~15.00 | ~45.00 |
-
|
Estimated |
|
Qwen/qwen3 32b
qwen/qwen3-32b |
Groq | 7.6B | 131K | ~4.56 | ~13.68 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3 4B (free)
qwen/qwen3-4b:free |
Openrouter | 4.0B | 41K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Qwen: Qwen3 8B
qwen/qwen3-8b |
Openrouter | 8.0B | 128K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder |
Openrouter | 480.0B | 262K | ~144.00 | ~432.00 |
-
|
Estimated |
|
Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instruct |
Openrouter | 30.0B | 262K | ~9.00 | ~27.00 |
-
|
Estimated |
|
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder-480b-a35b-instruct |
Redpill | 46.0B | 262K | ~27.60 | ~82.80 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus |
Openrouter | 480.0B | 128K | ~240.00 | ~720.00 |
-
|
Estimated |
|
Qwen: Qwen3 Coder 480B A35B (exacto)
qwen/qwen3-coder:exacto |
Openrouter | 480.0B | 262K | ~144.00 | ~432.00 |
-
|
Estimated |
|
Qwen: Qwen3 Coder 480B A35B (free)
qwen/qwen3-coder:free |
Openrouter | 480.0B | 262K | ~144.00 | ~432.00 |
-
|
Estimated |
|
Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct |
Openrouter | 80.0B | 262K | ~40.00 | ~120.00 |
-
|
Estimated |
|
Qwen: Qwen3 Next 80B A3B Instruct (free)
qwen/qwen3-next-80b-a3b-instruct:free |
Openrouter | 80.0B | 262K | ~40.00 | ~120.00 |
-
|
Estimated |
|
Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinking |
Openrouter | 80.0B | 131K | ~40.00 | ~120.00 |
-
|
Estimated |
|
Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct |
Openrouter | 235.0B | 262K | ~141.00 | ~423.00 |
-
|
Estimated |
|
Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinking |
Openrouter | 235.0B | 262K | ~141.00 | ~423.00 |
-
|
Estimated |
|
Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct |
Redpill | 12.0B | 128K | ~7.20 | ~21.60 |
V
T
I
A
J
|
Inferred |
|
Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinking |
Openrouter | 30.0B | 131K | ~18.00 | ~54.00 |
-
|
Estimated |
|
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct |
Openrouter | 32.0B | 262K | ~19.20 | ~57.60 |
-
|
Estimated |
|
Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instruct |
Openrouter | 8.0B | 131K | ~4.80 | ~14.40 |
-
|
Estimated |
|
Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking |
Openrouter | 8.0B | 256K | ~4.80 | ~14.40 |
-
|
Estimated |
|
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b |
Openrouter | 122.0B | 262K | ~73.20 | ~219.60 |
-
|
Estimated |
|
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b |
Redpill | 27.0B | 262K | ~19.44 | ~58.32 |
V
T
I
A
J
|
Estimated |
|
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b |
Openrouter | 35.0B | 262K | ~21.00 | ~63.00 |
-
|
Estimated |
|
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b |
Redpill | 397.0B | 262K | ~238.20 | ~714.60 |
V
T
I
A
J
|
Estimated |
|
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b |
Openrouter | 9.0B | 262K | ~5.40 | ~16.20 |
-
|
Estimated |
|
Qwen: QwQ 32B
qwen/qwq-32b |
Openrouter | 32.0B | 33K | ~16.00 | ~48.00 |
-
|
Estimated |
| qwen2.5:14b | Ollama | 14.0B | - | ~8.40 | ~25.20 |
-
|
Estimated |
| qwen2.5:32b | Ollama | 32.0B | - | ~19.20 | ~57.60 |
-
|
Estimated |
| qwen2.5:72b | Ollama | 72.0B | - | ~43.20 | ~129.60 |
-
|
Estimated |
| qwen2.5:7b | Ollama | 7.0B | - | ~4.20 | ~12.60 |
-
|
Estimated |
|
SorcererLM 8x22B
raifle/sorcererlm-8x22b |
Openrouter | 176.0B | 16K | ~52.80 | ~158.40 |
-
|
Estimated |
|
Rerank English V2.0
rerank-english-v2.0 |
Cohere | 50.0B | - | ~25.00 | ~75.00 |
-
|
Inferred |
|
Rerank English V3.0
rerank-english-v3.0 |
Cohere | 70.0B | - | ~35.00 | ~105.00 |
-
|
Inferred |
|
Rerank Multilingual V2.0
rerank-multilingual-v2.0 |
Cohere | 50.0B | - | ~25.00 | ~75.00 |
-
|
Inferred |
| rerank-multilingual-v3.0 | Cohere | 63.0B | 4K | ~31.50 | ~94.50 |
-
|
Inferred |
|
Sao10k: Llama 3 Euryale 70B v2.1
sao10k/l3-euryale-70b |
Openrouter | 70.0B | 8K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Sao10K: Llama 3 8B Lunaris
sao10k/l3-lunaris-8b |
Openrouter | 8.0B | 8K | ~4.00 | ~12.00 |
-
|
Estimated |
|
Sao10K: Llama 3.1 70B Hanami x1
sao10k/l3.1-70b-hanami-x1 |
Openrouter | 70.0B | 16K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Sao10K: Llama 3.1 Euryale 70B v2.2
sao10k/l3.1-euryale-70b |
Openrouter | 70.0B | 33K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Sao10K: Llama 3.3 Euryale 70B
sao10k/l3.3-euryale-70b |
Openrouter | 70.0B | 131K | ~35.00 | ~105.00 |
-
|
Estimated |
|
Sonar
sonar |
Perplexity | 22.0B | 128K | ~13.20 | ~39.60 |
V
T
I
A
J
|
Inferred |
|
Sonar Medium Chat
sonar-medium-chat |
Perplexity | 8.0B | 16K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Inferred |
|
Sonar Medium Online
sonar-medium-online |
Perplexity | 8.0B | 12K | ~4.80 | ~14.40 |
V
T
I
A
J
|
Inferred |
|
Sonar Pro
sonar-pro |
Perplexity | 240.0B | 200K | ~144.00 | ~432.00 |
V
T
I
A
J
|
Inferred |
|
Sonar Reasoning
sonar-reasoning |
Perplexity | 70.0B | 128K | ~42.00 | ~126.00 |
V
T
I
A
J
|
Inferred |
|
Sonar Reasoning Pro
sonar-reasoning-pro |
Perplexity | 99.0B | 128K | ~59.40 | ~178.20 |
V
T
I
A
J
|
Inferred |
|
Sonar Small Chat
sonar-small-chat |
Perplexity | 2.0B | 16K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
Sonar Small Online
sonar-small-online |
Perplexity | 2.0B | 12K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
StepFun: Step3
stepfun-ai/step3 |
Openrouter | 321.0B | 66K | ~192.60 | ~577.80 |
-
|
Estimated |
|
StepFun: Step 3.5 Flash
stepfun/step-3.5-flash |
Openrouter | 11.0B | 256K | ~3.30 | ~9.90 |
-
|
Estimated |
|
StepFun: Step 3.5 Flash (free)
stepfun/step-3.5-flash:free |
Openrouter | 11.0B | 256K | ~3.30 | ~9.90 |
-
|
Estimated |
|
Tencent: Hunyuan A13B Instruct
tencent/hunyuan-a13b-instruct |
Openrouter | 13.0B | 131K | ~3.90 | ~11.70 |
-
|
Estimated |
|
Text Embedding 004
text-embedding-004 |
1.8B | 2K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred | |
| text-embedding-3-large | Openai | 4.7B | 8K | ~2.82 | ~8.46 |
V
T
I
A
J
|
Inferred |
| text-embedding-3-small | Openai | 2.0B | 8K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
| text-embedding-ada-002 | Openai | 3.7B | 8K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
TheDrummer: Anubis 70B V1.1
thedrummer/anubis-70b-v1.1 |
Openrouter | 70.0B | 131K | ~35.00 | ~105.00 |
-
|
Estimated |
|
TheDrummer: Cydonia 24B V4.1
thedrummer/cydonia-24b-v4.1 |
Openrouter | 24.0B | 131K | ~12.00 | ~36.00 |
-
|
Estimated |
|
TheDrummer: Rocinante 12B
thedrummer/rocinante-12b |
Openrouter | 12.0B | 33K | ~6.00 | ~18.00 |
-
|
Estimated |
|
TheDrummer: Skyfall 36B V2
thedrummer/skyfall-36b-v2 |
Openrouter | 36.0B | 33K | ~18.00 | ~54.00 |
-
|
Estimated |
|
TheDrummer: UnslopNemo 12B
thedrummer/unslopnemo-12b |
Openrouter | 12.0B | 33K | ~6.00 | ~18.00 |
-
|
Estimated |
|
THUDM: GLM 4.1V 9B Thinking
thudm/glm-4.1v-9b-thinking |
Openrouter | 9.0B | 66K | ~5.40 | ~16.20 |
-
|
Estimated |
|
TNG: DeepSeek R1T2 Chimera
tngtech/deepseek-r1t2-chimera |
Openrouter | 671.0B | 164K | ~335.50 | ~1,006.50 |
-
|
Estimated |
|
TNG: DeepSeek R1T2 Chimera (free)
tngtech/deepseek-r1t2-chimera:free |
Openrouter | 671.0B | 164K | ~335.50 | ~1,006.50 |
-
|
Estimated |
|
ReMM SLERP 13B
undi95/remm-slerp-l2-13b |
Openrouter | 2.0B | 6K | ~2.00 | ~6.00 |
-
|
Estimated |
|
Upstage: Solar Pro 3
upstage/solar-pro-3 |
Openrouter | 102.0B | 128K | ~30.60 | ~91.80 |
-
|
Estimated |
|
voxtral-mini-transcribe-2507
voxtral-mini-2507 |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
| voxtral-mini-2602 | Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
voxtral-mini-2602
voxtral-mini-latest |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
voxtral-mini-transcribe-realtime-2602
voxtral-mini-realtime-2602 |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
voxtral-mini-transcribe-realtime-2602
voxtral-mini-realtime-latest |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
| voxtral-mini-transcribe-2507 | Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
| voxtral-mini-transcribe-realtime-2602 | Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
Voxtral Small
voxtral-small |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
| voxtral-small-2507 | Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
voxtral-small-2507
voxtral-small-latest |
Mistral | 3.7B | 32K | ~2.22 | ~6.66 |
V
T
I
A
J
|
Inferred |
|
Whisper Large V3
whisper-large-v3 |
Groq | 3.6B | 448 | ~2.16 | ~6.48 |
V
T
I
A
J
|
Inferred |
|
Whisper Large V3 Turbo
whisper-large-v3-turbo |
Groq | 1.8B | 448 | ~2.00 | ~6.00 |
V
T
I
A
J
|
Inferred |
|
xAI: Grok 3 Beta
x-ai/grok-3-beta |
Openrouter | 400.0B | 131K | ~120.00 | ~360.00 |
-
|
Estimated |
|
xAI: Grok 3 Mini Beta
x-ai/grok-3-mini-beta |
Openrouter | 100.0B | 131K | ~30.00 | ~90.00 |
-
|
Estimated |
|
xAI: Grok 4
x-ai/grok-4 |
Redpill | 240.0B | 256K | ~86.40 | ~259.20 |
V
T
I
A
J
|
Inferred |
|
xAI: Grok 4.1 Fast
x-ai/grok-4.1-fast |
Redpill | 400.0B | 2.0M | ~120.00 | ~360.00 |
V
T
I
A
J
|
Estimated |
|
xAI: Grok 4.20 Beta
x-ai/grok-4.20-beta |
Openrouter | 4.2B | 2.0M | ~2.52 | ~7.56 |
-
|
Estimated |
|
xAI: Grok Code Fast 1
x-ai/grok-code-fast-1 |
Redpill | 200.0B | 256K | ~60.00 | ~180.00 |
V
T
I
A
J
|
Estimated |
|
Xiaomi: MiMo-V2-Flash
xiaomi/mimo-v2-flash |
Openrouter | 309.0B | 262K | ~154.50 | ~463.50 |
-
|
Estimated |
|
Xiaomi: MiMo-V2-Flash (free)
xiaomi/mimo-v2-flash:free |
Openrouter | 309.0B | 262K | ~154.50 | ~463.50 |
-
|
Estimated |
|
Z.AI: GLM 4 32B
z-ai/glm-4-32b |
Openrouter | 32.0B | 128K | ~16.00 | ~48.00 |
-
|
Estimated |
|
Z.AI: GLM 4.5V
z-ai/glm-4.5v |
Openrouter | 106.0B | 66K | ~38.16 | ~114.48 |
-
|
Estimated |
|
Z.AI: GLM 4.7
z-ai/glm-4.7 |
Redpill | 57.0B | 131K | ~34.20 | ~102.60 |
V
T
I
A
J
|
Inferred |
|
Z.AI: GLM 4.7 Flash
z-ai/glm-4.7-flash |
Redpill | 30.0B | 203K | ~15.00 | ~45.00 |
V
T
I
A
J
|
Estimated |
|
Z.AI: GLM 5
z-ai/glm-5 |
Redpill | 0.0B | 203K | ~2.00 | ~6.00 |
V
T
I
A
J
|
Estimated |
Frequently Asked Questions
How much energy does AI use?
- Small models: 0.1-0.3 Wh per request
- Large models: 1-5 Wh per request
- GPT-4o-mini: optimized for efficiency
- GPT-4/Claude Opus: highest consumption
- FlowDot shows estimates per model
Why does FlowDot track AI energy usage?
- We believe in transparency
- Most platforms hide energy costs
- Helps users make informed choices
- Encourages efficient model selection
- Shows true workflow cost
Can I reduce my AI carbon footprint?
- Choose smaller models for simple tasks
- Use GPT-4o-mini instead of GPT-4 when possible
- Run local AI with Ollama on renewable power
- Batch requests to reduce overhead
- Track consumption with FlowDot estimates
- Write concise prompts for fewer tokens
- Cache repeated queries
- Choose efficient providers
What is the most energy-efficient AI model?
- GPT-4o-mini: OpenAI's efficient option
- Claude 3 Haiku: Anthropic's fast model
- Gemini 1.5 Flash: Google's efficient choice
- Llama 3.1 8B: excellent local option
- Mistral 7B: lightweight and capable
- Phi-3 Mini: Microsoft's small model
How does AI energy compare to other activities?
- GPT-4 query: 1-5 Wh (60W bulb for 1-5 min)
- GPT-3 training: ~1,287 MWh total
- Equivalent to ~120 US homes per year
- Efficiency improves 10-100x per generation
- Modern data centers: PUE of 1.1-1.2
- Cooling adds 10-20% energy overhead
Quick Reference Guide
Low Energy Models (Best for Simple Tasks)
- GPT-4o-mini: fast and efficient
- Claude 3 Haiku: optimized for speed
- Gemini 1.5 Flash: designed for efficiency
- Llama 3.1 8B: great for local use
- Mistral 7B: excellent efficiency ratio
- Phi-3 Mini: Microsoft's efficient option
When to Use Larger Models
- Complex reasoning tasks
- Multi-step analysis
- Creative writing requiring nuance
- Code generation for large projects
- Tasks requiring broad knowledge
- High-stakes decisions needing accuracy
Call for Transparency
We encourage all AI providers to publicly disclose energy consumption data. Transparency is essential for building a sustainable AI future.
Share Energy DataAre you an AI provider with official energy data? We'd love to hear from you.