About Workflows Compare Docs FAQs API Tutorials MCP

Environmental Impact of AI

Understanding the energy consumption of AI models is important for sustainability. We provide estimates to help inform your choices - but these are not measurements.

TL;DR - What You Need to Know

FlowDot is an AI workflow platform that provides energy transparency for AI model usage. Here's the key information:

Have questions? Check our FAQs page for more answers.

Important: What FlowDot Is (And Isn't)

FlowDot does not train or run any AI models. We are a workflow platform that connects you to third-party AI providers (OpenAI, Anthropic, Google, etc.) through their APIs. When you use an AI model through FlowDot, the actual computation happens on the provider's servers - not ours.

This means we cannot directly measure energy consumption. We don't have access to the providers' data centers, power meters, or infrastructure. Everything on this page is our best attempt to estimate and inform - not measured reality.

Key Terms & Definitions

Watt-hour (Wh)
A unit of energy equal to one watt of power sustained for one hour. Used to measure AI inference energy consumption.
Token
A unit of text processing in AI models. One token is approximately 4 characters or 0.75 words in English.
Inference
The process of running a trained AI model to generate responses. Consumes energy for each request processed.
Parameters
The learned weights in a neural network. GPT-4 has approximately 1.8 trillion parameters; more parameters typically means higher energy use.
PUE (Power Usage Effectiveness)
A ratio measuring data center efficiency. A PUE of 1.2 means 20% of energy goes to cooling and overhead.
Carbon Footprint
The total greenhouse gas emissions caused by AI computation, measured in CO2 equivalent (CO2e).

The Transparency Problem

Large Language Models require substantial computational resources. Most AI providers do not disclose energy data, creating challenges:

  • Understand the true environmental cost of AI services
  • Make informed choices about which models to use
  • Hold providers accountable for their carbon footprint
  • Develop more energy-efficient AI systems
  • Compare energy efficiency across different providers
  • Track improvements in model efficiency over time
  • Set meaningful sustainability goals
  • Report accurately on AI environmental impact

Key AI Energy Statistics

Training Energy Facts

  • GPT-3 training: ~1,287 MWh
  • Equal to ~120 US homes per year
  • Training runs 24/7 for weeks
  • Multiple GPUs running in parallel
  • Cooling adds 10-20% overhead
  • One-time cost per model version

Inference Energy Facts

  • Scales with usage volume
  • Can exceed training cost over time
  • Optimizations reduce per-query cost
  • Batching improves efficiency
  • Smaller models use less per query
  • Caching reduces redundant compute

Model Size Impact

  • Energy scales with parameters
  • GPT-4 has ~1.8T parameters
  • GPT-4o-mini is ~10x more efficient
  • Smaller models for simple tasks
  • Larger models for complex reasoning
  • Architecture affects efficiency

Efficiency Trends

  • 10-100x efficiency gains per generation
  • Better architectures use less energy
  • Quantization reduces memory needs
  • Pruning removes unnecessary weights
  • Distillation creates efficient models
  • Hardware improvements continue

Practical Tips for Reducing AI Energy Use

Choose the Right Model

  • Use smaller models for simple tasks
  • GPT-4o-mini uses ~10x less energy than GPT-4
  • Claude 3 Haiku is optimized for efficiency
  • Match model capability to task complexity

Optimize Your Prompts

  • Write concise, clear instructions
  • Avoid unnecessary context in prompts
  • Request shorter responses when appropriate
  • Batch multiple queries when possible

Use Local Models

  • Run Ollama on your own hardware
  • Power local setups with renewable energy
  • Llama 3.1 8B offers good efficiency
  • Full control over your energy source

Monitor with FlowDot

  • Track energy estimates per workflow
  • Identify high-consumption patterns
  • Compare models for the same task
  • Set team energy awareness goals

Our Methodology

We compile energy data from multiple sources and are fully transparent about the reliability and origin of each data point. Our goal is to provide the best available estimates while being honest about their limitations.

Data Source Types

Official
Provider Published Data - Data published directly by the AI provider in official documentation or sustainability reports. Most reliable but very rare.
Research
Academic Research - Data from peer-reviewed academic research and third-party studies.
Estimated
Estimated from Parameters - When we know a model's parameter count, we estimate energy using research-derived formulas.
Inferred
Inferred from Pricing - For models where providers don't disclose parameters, we infer approximate size from pricing. Least reliable.

How We Calculate Estimates

  • Start with known parameter counts when available
  • Apply energy-per-parameter formulas from research papers
  • Factor in typical data center PUE (1.1-1.2)
  • Account for inference optimization techniques
  • Cross-reference with pricing tiers as sanity check
  • Update estimates when new data becomes available
  • Flag uncertainty with tilde (~) prefix on values

Key Research Sources

Per-Model Energy Estimates

Important Disclaimer

The data below is NOT official. Unless explicitly marked as "Official" (green badge), all values are our own estimates and educated guesses based on publicly available information.

We make no claims about actual energy consumption. The true energy usage of these models is known only to their providers. Our estimates may be significantly different from reality.

750 models
Model ▴▾ Provider ▴▾ Params (B) ▴▾ Context ▴▾ Input Wh/1M ▴▾ Output Wh/1M ▴▾ Capabilities Source ▴▾
QwQ-32B-Preview
Qwen/QwQ-32B-Preview
Together 32.0B 33K ~19.20 ~57.60
V T I A J
Estimated
Qwen2.5-72B-Instruct-Turbo
Qwen/Qwen2.5-72B-Instruct-Turbo
Together 72.0B 131K ~43.20 ~129.60
V T I A J
Estimated
Qwen2.5-7B-Instruct-Turbo
Qwen/Qwen2.5-7B-Instruct-Turbo
Together 7.0B 131K ~4.20 ~12.60
V T I A J
Estimated
Qwen3-Coder-Next
Qwen/Qwen3-Coder-Next
Together 80.0B 131K ~24.00 ~72.00
V T I A J
Estimated
Qwen3-Next-80B-A3B-Instruct
Qwen/Qwen3-Next-80B-A3B-Instruct
Together 35.0B 131K ~21.00 ~63.00
V T I A J
Inferred
GLM-4.7
THUDM/GLM-4.7
Together 42.0B 131K ~25.20 ~75.60
V T I A J
Inferred
Cogito 671b V2 P1
accounts/cogito/models/cogito-671b-v2-p1
Fireworks 6.5B 164K ~3.90 ~11.70
V T I A J
Inferred
Deepseek R1
accounts/fireworks/models/deepseek-r1
Fireworks 90.0B 66K ~32.40 ~97.20
V T I A J
Inferred
Deepseek V3
accounts/fireworks/models/deepseek-v3
Fireworks 17.0B 66K ~6.12 ~18.36
V T I A J
Inferred
Deepseek V3p1
accounts/fireworks/models/deepseek-v3p1
Fireworks 6.5B 164K ~2.34 ~7.02
V T I A J
Inferred
Deepseek V3p2
accounts/fireworks/models/deepseek-v3p2
Fireworks 6.5B 164K ~2.34 ~7.02
V T I A J
Inferred
Flux 1 Schnell Fp8
accounts/fireworks/models/flux-1-schnell-fp8
Fireworks 5.9B - ~3.54 ~10.62
V T I A J
Inferred
Flux Kontext Max
accounts/fireworks/models/flux-kontext-max
Fireworks 5.9B - ~3.54 ~10.62
V T I A J
Inferred
Flux Kontext Pro
accounts/fireworks/models/flux-kontext-pro
Fireworks 5.9B - ~3.54 ~10.62
V T I A J
Inferred
Gemma2 9b It
accounts/fireworks/models/gemma2-9b-it
Fireworks 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
Glm 4p7
accounts/fireworks/models/glm-4p7
Fireworks 7.1B 203K ~4.26 ~12.78
V T I A J
Inferred
Glm 5
accounts/fireworks/models/glm-5
Fireworks 7.1B 203K ~4.26 ~12.78
V T I A J
Inferred
Gpt Oss 120b
accounts/fireworks/models/gpt-oss-120b
Fireworks 6.5B 131K ~3.90 ~11.70
V T I A J
Inferred
Kimi K2 Instruct 0905
accounts/fireworks/models/kimi-k2-instruct-0905
Fireworks 7.1B 262K ~4.26 ~12.78
V T I A J
Inferred
Kimi K2p5
accounts/fireworks/models/kimi-k2p5
Fireworks 7.1B 262K ~4.26 ~12.78
V T I A J
Inferred
Llama V3p1 405b Instruct
accounts/fireworks/models/llama-v3p1-405b-instruct
Fireworks 55.0B 131K ~33.00 ~99.00
V T I A J
Inferred
Llama V3p1 70b Instruct
accounts/fireworks/models/llama-v3p1-70b-instruct
Fireworks 19.0B 131K ~11.40 ~34.20
V T I A J
Inferred
Llama V3p1 8b Instruct
accounts/fireworks/models/llama-v3p1-8b-instruct
Fireworks 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Llama V3p2 11b Vision Instruct
accounts/fireworks/models/llama-v3p2-11b-vision-instruct
Fireworks 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Llama V3p2 1b Instruct
accounts/fireworks/models/llama-v3p2-1b-instruct
Fireworks 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Llama V3p2 3b Instruct
accounts/fireworks/models/llama-v3p2-3b-instruct
Fireworks 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Llama V3p2 90b Vision Instruct
accounts/fireworks/models/llama-v3p2-90b-vision-instruct
Fireworks 19.0B 131K ~11.40 ~34.20
V T I A J
Inferred
Llama V3p3 70b Instruct
accounts/fireworks/models/llama-v3p3-70b-instruct
Fireworks 19.0B 131K ~11.40 ~34.20
V T I A J
Inferred
Mixtral 8x22b Instruct
accounts/fireworks/models/mixtral-8x22b-instruct
Fireworks 17.0B 66K ~6.12 ~18.36
V T I A J
Inferred
Mixtral 8x7b Instruct
accounts/fireworks/models/mixtral-8x7b-instruct
Fireworks 5.9B 33K ~2.12 ~6.37
V T I A J
Inferred
Phi 3 Vision 128k Instruct
accounts/fireworks/models/phi-3-vision-128k-instruct
Fireworks 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Qwen2 72b Instruct
accounts/fireworks/models/qwen2-72b-instruct
Fireworks 17.0B 33K ~10.20 ~30.60
V T I A J
Inferred
Qwen2p5 72b Instruct
accounts/fireworks/models/qwen2p5-72b-instruct
Fireworks 17.0B 33K ~10.20 ~30.60
V T I A J
Inferred
AionLabs: Aion-1.0-Mini
aion-labs/aion-1.0-mini
Openrouter 32.0B 131K ~16.00 ~48.00
-
Estimated
AionLabs: Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8b
Openrouter 8.0B 33K ~4.00 ~12.00
-
Estimated
AlfredPros: CodeLLaMa 7B Instruct Solidity
alfredpros/codellama-7b-instruct-solidity
Openrouter 7.0B 4K ~3.50 ~10.50
-
Estimated
Tongyi DeepResearch 30B A3B
alibaba/tongyi-deepresearch-30b-a3b
Openrouter 30.0B 131K ~15.00 ~45.00
-
Estimated
Tongyi DeepResearch 30B A3B (free)
alibaba/tongyi-deepresearch-30b-a3b:free
Openrouter 30.0B 131K ~15.00 ~45.00
-
Estimated
Allam 2 7b
allam-2-7b
Groq 7.2B 4K ~4.32 ~12.96
V T I A J
Inferred
AllenAI: Molmo2 8B
allenai/molmo-2-8b
Openrouter 8.0B 37K ~4.80 ~14.40
-
Estimated
AllenAI: Olmo 2 32B Instruct
allenai/olmo-2-0325-32b-instruct
Openrouter 32.0B 128K ~16.00 ~48.00
-
Estimated
AllenAI: Olmo 3 32B Think
allenai/olmo-3-32b-think
Openrouter 32.0B 66K ~19.20 ~57.60
-
Estimated
AllenAI: Olmo 3 32B Think (free)
allenai/olmo-3-32b-think:free
Openrouter 32.0B 66K ~16.00 ~48.00
-
Estimated
AllenAI: Olmo 3 7B Instruct
allenai/olmo-3-7b-instruct
Openrouter 7.0B 66K ~4.20 ~12.60
-
Estimated
AllenAI: Olmo 3 7B Think
allenai/olmo-3-7b-think
Openrouter 7.0B 66K ~4.20 ~12.60
-
Estimated
AllenAI: Olmo 3.1 32B Instruct
allenai/olmo-3.1-32b-instruct
Openrouter 32.0B 66K ~16.00 ~48.00
-
Estimated
AllenAI: Olmo 3.1 32B Think
allenai/olmo-3.1-32b-think
Openrouter 32.0B 66K ~16.00 ~48.00
-
Estimated
AllenAI: Olmo 3.1 32B Think (free)
allenai/olmo-3.1-32b-think:free
Openrouter 32.0B 66K ~16.00 ~48.00
-
Estimated
Goliath 120B
alpindale/goliath-120b
Openrouter 70.0B 6K ~35.00 ~105.00
-
Estimated
Amazon: Nova Lite 1.0
amazon/nova-lite-v1
Openrouter 20.0B 300K ~14.40 ~43.20
-
Estimated
Magnum v4 72B
anthracite-org/magnum-v4-72b
Openrouter 72.0B 16K ~36.00 ~108.00
-
Estimated
Anthropic: Claude 3 Haiku
anthropic/claude-3-haiku
Openrouter 20.0B 200K ~14.40 ~43.20
-
Estimated
Anthropic: Claude 3 Opus
anthropic/claude-3-opus
Openrouter 175.0B 200K ~126.00 ~378.00
-
Estimated
Anthropic: Claude 3.5 Haiku
anthropic/claude-3.5-haiku
Redpill 20.0B 200K ~14.40 ~43.20
V T I A J
Estimated
Anthropic: Claude 3.5 Haiku (2024-10-22)
anthropic/claude-3.5-haiku-20241022
Openrouter 20.0B 200K ~14.40 ~43.20
-
Estimated
Anthropic: Claude 3.5 Sonnet
anthropic/claude-3.5-sonnet
Openrouter 175.0B 200K ~84.00 ~252.00
-
Research
Anthropic: Claude 3.7 Sonnet
anthropic/claude-3.7-sonnet
Redpill 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5
Redpill 76.0B 200K ~45.60 ~136.80
V T I A J
Inferred
Anthropic: Claude Opus 4
anthropic/claude-opus-4
Redpill 800.0B 200K ~480.00 ~1,440.00
V T I A J
Inferred
Anthropic: Claude Opus 4.1
anthropic/claude-opus-4.1
Redpill 800.0B 200K ~480.00 ~1,440.00
V T I A J
Inferred
Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5
Redpill 400.0B 200K ~240.00 ~720.00
V T I A J
Inferred
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6
Redpill 540.0B 1.0M ~324.00 ~972.00
V T I A J
Inferred
Anthropic: Claude Sonnet 4
anthropic/claude-sonnet-4
Redpill 240.0B 1.0M ~144.00 ~432.00
V T I A J
Inferred
Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5
Redpill 240.0B 1.0M ~144.00 ~432.00
V T I A J
Inferred
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
Redpill 240.0B 1.0M ~144.00 ~432.00
V T I A J
Inferred
Arcee AI: Trinity Large Preview (free)
arcee-ai/trinity-large-preview:free
Openrouter 400.0B 131K ~200.00 ~600.00
-
Estimated
Arcee AI: Trinity Mini
arcee-ai/trinity-mini
Openrouter 26.0B 131K ~13.00 ~39.00
-
Estimated
Arcee AI: Trinity Mini (free)
arcee-ai/trinity-mini:free
Openrouter 26.0B 131K ~13.00 ~39.00
-
Estimated
ArliAI: QwQ 32B RpR v1
arliai/qwq-32b-arliai-rpr-v1
Openrouter 32.0B 33K ~16.00 ~48.00
-
Estimated
Baidu: ERNIE 4.5 21B A3B
baidu/ernie-4.5-21b-a3b
Openrouter 21.0B 120K ~6.30 ~18.90
-
Estimated
Baidu: ERNIE 4.5 21B A3B Thinking
baidu/ernie-4.5-21b-a3b-thinking
Openrouter 21.0B 131K ~6.30 ~18.90
-
Estimated
Baidu: ERNIE 4.5 300B A47B
baidu/ernie-4.5-300b-a47b
Openrouter 300.0B 123K ~90.00 ~270.00
-
Estimated
Baidu: ERNIE 4.5 VL 28B A3B
baidu/ernie-4.5-vl-28b-a3b
Openrouter 28.0B 30K ~10.08 ~30.24
-
Estimated
Baidu: ERNIE 4.5 VL 424B A47B
baidu/ernie-4.5-vl-424b-a47b
Openrouter 424.0B 123K ~152.64 ~457.92
-
Estimated
ByteDance Seed: Seed 1.6
bytedance-seed/seed-1.6
Openrouter 1.6B 262K ~2.00 ~6.00
-
Estimated
ByteDance: UI-TARS 7B
bytedance/ui-tars-1.5-7b
Openrouter 72.0B 128K ~43.20 ~129.60
-
Estimated
Canopylabs/orpheus Arabic Saudi
canopylabs/orpheus-arabic-saudi
Groq 7.2B 4K ~4.32 ~12.96
V T I A J
Inferred
Canopylabs/orpheus V1 English
canopylabs/orpheus-v1-english
Groq 7.2B 4K ~4.32 ~12.96
V T I A J
Inferred
chatgpt-4o-latest Openai 220.0B 128K ~132.00 ~396.00
V T I A J
Inferred
chatgpt-image-latest Openai 220.0B 128K ~132.00 ~396.00
V T I A J
Inferred
Claude 2.0
claude-2.0
Anthropic 380.0B 200K ~228.00 ~684.00
V T I A J
Inferred
Claude 2.1
claude-2.1
Anthropic 380.0B 200K ~228.00 ~684.00
V T I A J
Inferred
Claude 3.5 Haiku
claude-3-5-haiku
Anthropic 68.0B 200K ~40.80 ~122.40
V T I A J
Inferred
Claude 3.5 Haiku
claude-3-5-haiku-20241022
Anthropic 20.0B 200K ~14.40 ~43.20
V T I A J
Estimated
Claude 3.5 Haiku Latest
claude-3-5-haiku-latest
Anthropic 20.0B 200K ~14.40 ~43.20
V T I A J
Estimated
Claude 3.5 Sonnet
claude-3-5-sonnet
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude 3.5 Sonnet
claude-3-5-sonnet-20240620
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude 3.5 Sonnet
claude-3-5-sonnet-20241022
Anthropic 175.0B 200K ~126.00 ~378.00
V T I A J
Estimated
Claude 3.5 Sonnet Latest
claude-3-5-sonnet-latest
Anthropic 175.0B 200K ~126.00 ~378.00
V T I A J
Estimated
Claude 3.7 Sonnet
claude-3-7-sonnet
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude 3.7 Sonnet
claude-3-7-sonnet-20250219
Anthropic 175.0B 200K ~126.00 ~378.00
V T I A J
Estimated
Claude 3 Haiku
claude-3-haiku
Anthropic 31.0B 200K ~18.60 ~55.80
V T I A J
Inferred
Claude 3 Haiku
claude-3-haiku-20240307
Anthropic 20.0B 200K ~14.40 ~43.20
V T I A J
Estimated
Claude 3 Opus
claude-3-opus
Anthropic 800.0B 200K ~480.00 ~1,440.00
V T I A J
Inferred
Claude 3 Opus
claude-3-opus-20240229
Anthropic 175.0B 200K ~126.00 ~378.00
V T I A J
Estimated
Claude 3 Opus Latest
claude-3-opus-latest
Anthropic 175.0B 200K ~126.00 ~378.00
V T I A J
Estimated
Claude 3 Sonnet
claude-3-sonnet
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude 3 Sonnet
claude-3-sonnet-20240229
Anthropic 70.0B 200K ~50.40 ~151.20
V T I A J
Estimated
Claude Haiku 4
claude-haiku-4
Anthropic 68.0B 200K ~40.80 ~122.40
V T I A J
Inferred
Claude Haiku 4.1
claude-haiku-4-1
Anthropic 68.0B 200K ~40.80 ~122.40
V T I A J
Inferred
Claude Haiku 4.5
claude-haiku-4-5-20251001
Anthropic 4.5B - ~2.70 ~8.10
-
Estimated
Claude Instant 1.2
claude-instant-1.2
Anthropic 43.0B 100K ~25.80 ~77.40
V T I A J
Inferred
Claude Opus 4
claude-opus-4
Anthropic 400.0B 200K ~240.00 ~720.00
V T I A J
Inferred
Claude Opus 4.1
claude-opus-4-1
Anthropic 400.0B 200K ~240.00 ~720.00
V T I A J
Inferred
Claude Opus 4.1
claude-opus-4-1-20250805
Anthropic 200.0B - ~120.00 ~360.00
-
Estimated
Claude Opus 4
claude-opus-4-20250514
Anthropic 200.0B 200K ~120.00 ~360.00
V T I A J
Estimated
Claude Opus 4.5
claude-opus-4-5
Anthropic 400.0B 200K ~240.00 ~720.00
V T I A J
Inferred
Claude Opus 4.5
claude-opus-4-5-20251101
Anthropic 200.0B 200K ~120.00 ~360.00
V T I A J
Estimated
Claude Opus 4.6
claude-opus-4-6
Anthropic 400.0B 200K ~240.00 ~720.00
V T I A J
Inferred
Claude Sonnet 4
claude-sonnet-4
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude Sonnet 4
claude-sonnet-4-20250514
Anthropic 175.0B 200K ~105.00 ~315.00
V T I A J
Estimated
Claude Sonnet 4.5
claude-sonnet-4-5
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude Sonnet 4.5
claude-sonnet-4-5-20250514
Anthropic 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Claude Sonnet 4.5
claude-sonnet-4-5-20250929
Anthropic 175.0B - ~105.00 ~315.00
-
Estimated
codellama:13b Ollama 13.0B - ~7.80 ~23.40
-
Estimated
codellama:34b Ollama 34.0B - ~20.40 ~61.20
-
Estimated
codellama:7b Ollama 7.0B - ~4.20 ~12.60
-
Estimated
codestral-2508 Mistral 20.0B 256K ~12.00 ~36.00
V T I A J
Inferred
codestral-embed Mistral 20.0B 256K ~12.00 ~36.00
V T I A J
Inferred
codestral-embed
codestral-embed-2505
Mistral 20.0B 256K ~12.00 ~36.00
V T I A J
Inferred
codestral-2508
codestral-latest
Mistral 20.0B 256K ~12.00 ~36.00
V T I A J
Inferred
Codestral Mamba
codestral-mamba
Mistral 7.0B - ~4.20 ~12.60
-
Estimated
Codestral Mamba Latest
codestral-mamba-latest
Mistral 3.7B 256K ~2.22 ~6.66
V T I A J
Inferred
codex-mini-latest Openai 84.0B 200K ~42.00 ~126.00
-
Inferred
Venice: Uncensored (free)
cognitivecomputations/dolphin-mistral-24b-venice-edition:free
Openrouter 24.0B 33K ~12.00 ~36.00
-
Estimated
Cohere: Command A
cohere/command-a
Openrouter 111.0B 256K ~55.50 ~166.50
-
Estimated
Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024
Openrouter 7.0B 128K ~3.50 ~10.50
-
Estimated
Command
command
Cohere 34.0B 4K ~20.40 ~61.20
V T I A J
Inferred
Command A
command-a
Cohere 76.0B 256K ~45.60 ~136.80
V T I A J
Inferred
Command A 03 2025
command-a-03-2025
Cohere 76.0B 256K ~45.60 ~136.80
V T I A J
Inferred
command-a-reasoning-08-2025 Cohere 76.0B 289K ~45.60 ~136.80
V T I A J
Inferred
command-a-vision-07-2025 Cohere 70.0B 128K ~42.00 ~126.00
V T I A J
Inferred
Command Light
command-light
Cohere 7.2B 4K ~4.32 ~12.96
V T I A J
Inferred
Command Light Nightly
command-light-nightly
Cohere 8.0B 8K ~4.80 ~14.40
V T I A J
Inferred
Command Nightly
command-nightly
Cohere 38.0B 8K ~22.80 ~68.40
V T I A J
Inferred
Command R
command-r
Cohere 35.0B 128K ~21.00 ~63.00
V T I A J
Inferred
Command R 03 2024
command-r-03-2024
Cohere 35.0B 128K ~21.00 ~63.00
V T I A J
Inferred
command-r-08-2024 Cohere 2.2B 132K ~2.00 ~6.00
V T I A J
Inferred
Command R Plus
command-r-plus
Cohere 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
Command R Plus 04 2024
command-r-plus-04-2024
Cohere 220.0B 128K ~132.00 ~396.00
V T I A J
Inferred
Command R Plus 08 2024
command-r-plus-08-2024
Cohere 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
dall-e-2 Openai 3.5B - ~2.52 ~7.56
V T I A J
Estimated
dall-e-3 Openai 12.0B - ~8.64 ~25.92
V T I A J
Estimated
dbrx-instruct
databricks/dbrx-instruct
Together 25.0B 33K ~15.00 ~45.00
V T I A J
Inferred
Cogito V2 Preview Llama 109B
deepcogito/cogito-v2-preview-llama-109b-moe
Openrouter 17.0B 33K ~10.20 ~30.60
-
Estimated
Deep Cogito: Cogito V2 Preview Llama 405B
deepcogito/cogito-v2-preview-llama-405b
Openrouter 405.0B 33K ~202.50 ~607.50
-
Estimated
Deep Cogito: Cogito V2 Preview Llama 70B
deepcogito/cogito-v2-preview-llama-70b
Openrouter 70.0B 33K ~35.00 ~105.00
-
Estimated
Deep Cogito: Cogito v2.1 671B
deepcogito/cogito-v2.1-671b
Openrouter 671.0B 128K ~201.30 ~603.90
-
Estimated
DeepSeek-R1
deepseek-ai/DeepSeek-R1
Together 80.0B 66K ~28.80 ~86.40
V T I A J
Inferred
DeepSeek-R1-0528
deepseek-ai/DeepSeek-R1-0528
Together 80.0B 66K ~28.80 ~86.40
V T I A J
Inferred
DeepSeek-R1-Distill-Llama-70B
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Together 6.1B 66K ~2.20 ~6.59
V T I A J
Inferred
DeepSeek-V3
deepseek-ai/DeepSeek-V3
Together 671.0B 66K ~201.30 ~603.90
V T I A J
Estimated
DeepSeek-V3.1
deepseek-ai/DeepSeek-V3.1
Together 34.0B 66K ~12.24 ~36.72
V T I A J
Inferred
DeepSeek Deepseek Chat
deepseek-chat
Deepseek 671.0B 64K ~201.30 ~603.90
V T I A J
Estimated
DeepSeek Deepseek Coder
deepseek-coder
Deepseek 3.8B 128K ~2.00 ~6.00
V T I A J
Inferred
deepseek-coder:33b Ollama 33.0B - ~19.80 ~59.40
-
Estimated
deepseek-coder:6.7b Ollama 6.7B - ~4.02 ~12.06
-
Estimated
DeepSeek Deepseek Lite
deepseek-lite
Deepseek 3.4B 32K ~2.00 ~6.00
V T I A J
Inferred
Deepseek R1 Distill Llama 70b
deepseek-r1-distill-llama-70b
Groq 14.0B 33K ~5.04 ~15.12
V T I A J
Inferred
DeepSeek Deepseek Reasoner
deepseek-reasoner
Deepseek 671.0B 64K ~201.30 ~603.90
V T I A J
Estimated
DeepSeek Deepseek V2
deepseek-v2
Deepseek 22.0B 64K ~7.92 ~23.76
V T I A J
Inferred
DeepSeek Deepseek V2.5
deepseek-v2.5
Deepseek 22.0B 64K ~7.92 ~23.76
V T I A J
Inferred
DeepSeek: DeepSeek V3
deepseek/deepseek-chat
Openrouter 671.0B 164K ~201.30 ~603.90
-
Estimated
DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324
Openrouter 685.0B 164K ~342.50 ~1,027.50
-
Estimated
DeepSeek V3.1
deepseek/deepseek-chat-v3.1
Redpill 48.0B 164K ~17.28 ~51.84
V T I A J
Inferred
DeepSeek: DeepSeek Prover V2
deepseek/deepseek-prover-v2
Openrouter 671.0B 164K ~335.50 ~1,006.50
-
Estimated
DeepSeek: R1
deepseek/deepseek-r1
Openrouter 671.0B 164K ~201.30 ~603.90
-
Estimated
DeepSeek: R1 0528
deepseek/deepseek-r1-0528
Redpill 42.0B 164K ~15.12 ~45.36
V T I A J
Inferred
DeepSeek: DeepSeek R1 0528 Qwen3 8B
deepseek/deepseek-r1-0528-qwen3-8b
Openrouter 8.0B 33K ~4.00 ~12.00
-
Estimated
DeepSeek: R1 0528 (free)
deepseek/deepseek-r1-0528:free
Openrouter 671.0B 164K ~201.30 ~603.90
-
Estimated
DeepSeek: R1 Distill Llama 70B
deepseek/deepseek-r1-distill-llama-70b
Openrouter 70.0B 131K ~42.00 ~126.00
-
Estimated
DeepSeek: R1 Distill Qwen 14B
deepseek/deepseek-r1-distill-qwen-14b
Openrouter 14.0B 33K ~8.40 ~25.20
-
Estimated
DeepSeek: R1 Distill Qwen 32B
deepseek/deepseek-r1-distill-qwen-32b
Openrouter 32.0B 64K ~19.20 ~57.60
-
Estimated
DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
Openrouter 671.0B 164K ~335.50 ~1,006.50
-
Estimated
DeepSeek: DeepSeek V3.1 Terminus (exacto)
deepseek/deepseek-v3.1-terminus:exacto
Openrouter 671.0B 164K ~335.50 ~1,006.50
-
Estimated
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2
Redpill 5.3B 164K ~2.00 ~6.00
V T I A J
Inferred
Devstral 2
devstral-2
Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
devstral-2512 Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
devstral-2512
devstral-latest
Mistral 4.0B 128K ~2.40 ~7.20
V T I A J
Inferred
Devstral Medium
devstral-medium
Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
devstral-medium-2507 Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
devstral-2512
devstral-medium-latest
Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
Devstral Small
devstral-small
Mistral 4.0B 128K ~2.40 ~7.20
V T I A J
Inferred
devstral-small-2507 Mistral 4.0B 128K ~2.40 ~7.20
V T I A J
Inferred
labs-devstral-small-2512
devstral-small-latest
Mistral 4.0B 128K ~2.40 ~7.20
V T I A J
Inferred
Distil Whisper Large V3 En
distil-whisper-large-v3-en
Groq 2.0B - ~2.00 ~6.00
V T I A J
Inferred
EleutherAI: Llemma 7b
eleutherai/llemma_7b
Openrouter 7.0B 4K ~3.50 ~10.50
-
Estimated
embed-english-light-v3.0 Cohere 3.3B 512 ~2.00 ~6.00
V T I A J
Inferred
embed-english-v3.0 Cohere 3.3B 512 ~2.00 ~6.00
V T I A J
Inferred
Embed Multilingual Light V3.0
embed-multilingual-light-v3.0
Cohere 3.3B 512 ~2.00 ~6.00
V T I A J
Inferred
Embed Multilingual V3.0
embed-multilingual-v3.0
Cohere 3.3B 512 ~2.00 ~6.00
V T I A J
Inferred
Embedding 001
embedding-001
Google 1.8B 2K ~2.00 ~6.00
V T I A J
Inferred
EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instruct
Openrouter 8.0B 33K ~4.00 ~12.00
-
Estimated
Gemini 1.0 Pro
gemini-1.0-pro
Google 32.0B 32K ~11.52 ~34.56
V T I A J
Inferred
Gemini 1.5 Flash
gemini-1.5-flash
Google 4.4B 1.0M ~2.00 ~6.00
V T I A J
Inferred
Gemini 1.5 Flash 8b
gemini-1.5-flash-8b
Google 2.4B 1.0M ~2.00 ~6.00
V T I A J
Inferred
Gemini 1.5 Flash Latest
gemini-1.5-flash-latest
Google 4.4B 1.0M ~2.00 ~6.00
V T I A J
Inferred
Gemini 1.5 Pro
gemini-1.5-pro
Google 76.0B 2.0M ~27.36 ~82.08
V T I A J
Inferred
Gemini 1.5 Pro Latest
gemini-1.5-pro-latest
Google 76.0B 2.0M ~27.36 ~82.08
V T I A J
Inferred
Gemini 2.0 Flash
gemini-2.0-flash
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Gemini 2.0 Flash 001
gemini-2.0-flash-001
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Gemini 2.0 Flash Exp
gemini-2.0-flash-exp
Google 50.0B 1.0M ~18.00 ~54.00
V T I A J
Estimated
Gemini 2.0 Flash-Lite
gemini-2.0-flash-lite
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Gemini 2.0 Flash-Lite 001
gemini-2.0-flash-lite-001
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Gemini 2.0 Flash Thinking Exp
gemini-2.0-flash-thinking-exp
Google 50.0B - ~18.00 ~54.00
-
Estimated
Gemini 2.5 Flash
gemini-2.5-flash
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Nano Banana
gemini-2.5-flash-image
Google 50.0B 33K ~18.00 ~54.00
V T I A J
Inferred
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Gemini 2.5 Flash-Lite Preview Sep 2025
gemini-2.5-flash-lite-preview-09-2025
Google 60.0B 1.0M ~21.60 ~64.80
V T I A J
Inferred
Gemini 2.5 Flash Native Audio Latest
gemini-2.5-flash-native-audio-latest
Google 55.0B 131K ~19.80 ~59.40
V T I A J
Inferred
Gemini 2.5 Flash Native Audio Preview 09-2025
gemini-2.5-flash-native-audio-preview-09-2025
Google 55.0B 131K ~19.80 ~59.40
V T I A J
Inferred
Gemini 2.5 Flash Native Audio Preview 12-2025
gemini-2.5-flash-native-audio-preview-12-2025
Google 55.0B 131K ~19.80 ~59.40
V T I A J
Inferred
Gemini 2.5 Flash Preview TTS
gemini-2.5-flash-preview-tts
Google 50.0B 8K ~18.00 ~54.00
V T I A J
Inferred
Gemini 2.5 Pro
gemini-2.5-pro
Google 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
Gemini 2.5 Pro Preview TTS
gemini-2.5-pro-preview-tts
Google 110.0B 8K ~39.60 ~118.80
V T I A J
Inferred
Gemini 3 Flash Preview
gemini-3-flash-preview
Google 170.0B 1.0M ~61.20 ~183.60
V T I A J
Inferred
Nano Banana Pro
gemini-3-pro-image-preview
Google 150.0B 131K ~54.00 ~162.00
V T I A J
Inferred
Gemini 3 Pro Preview
gemini-3-pro-preview
Google 170.0B 1.0M ~61.20 ~183.60
V T I A J
Inferred
Gemini Pro
gemini-pro
Google 32.0B 32K ~11.52 ~34.56
V T I A J
Inferred
Gemini Pro Vision
gemini-pro-vision
Google 32.0B 16K ~11.52 ~34.56
V T I A J
Inferred
Gemma 7b It
gemma-7b-it
Groq 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
Gemma2 9b It
gemma2-9b-it
Groq 9.0B 8K ~5.40 ~16.20
V T I A J
Estimated
gemma2:27b Ollama 27.0B - ~16.20 ~48.60
-
Estimated
gemma2:2b Ollama 2.0B - ~2.00 ~6.00
-
Estimated
gemma2:9b Ollama 9.0B - ~5.40 ~16.20
-
Estimated
Google: Gemini 2.0 Flash Experimental (free)
google/gemini-2.0-flash-exp:free
Openrouter 50.0B 1.0M ~12.00 ~36.00
-
Research
Gemini 2.5 Flash
google/gemini-2.5-flash
Perplexity 50.0B - ~18.00 ~54.00
-
Estimated
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
Redpill 20.0B 1.0M ~7.20 ~21.60
V T I A J
Estimated
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
Redpill 240.0B 1.0M ~86.40 ~259.20
V T I A J
Inferred
Google: Gemini 3 Pro Preview
google/gemini-3-pro-preview
Redpill 290.0B 1.0M ~104.40 ~313.20
V T I A J
Inferred
Google: Gemma 2 27B
google/gemma-2-27b-it
Openrouter 27.0B 8K ~16.20 ~48.60
-
Estimated
Google: Gemma 2 9B
google/gemma-2-9b-it
Openrouter 9.0B 8K ~5.40 ~16.20
-
Estimated
Google: Gemma 3 12B
google/gemma-3-12b-it
Openrouter 12.0B 131K ~7.20 ~21.60
-
Estimated
Google: Gemma 3 12B (free)
google/gemma-3-12b-it:free
Openrouter 12.0B 33K ~7.20 ~21.60
-
Estimated
Google: Gemma 3 27B
google/gemma-3-27b-it
Redpill 4.8B 54K ~2.88 ~8.64
V T I A J
Inferred
Google: Gemma 3 27B (free)
google/gemma-3-27b-it:free
Openrouter 27.0B 131K ~16.20 ~48.60
-
Estimated
Google: Gemma 3 4B
google/gemma-3-4b-it
Openrouter 4.0B 96K ~2.40 ~7.20
-
Estimated
Google: Gemma 3 4B (free)
google/gemma-3-4b-it:free
Openrouter 4.0B 33K ~2.40 ~7.20
-
Estimated
Google: Gemma 3n 2B (free)
google/gemma-3n-e2b-it:free
Openrouter 2.0B 8K ~2.00 ~6.00
-
Estimated
Google: Gemma 3n 4B
google/gemma-3n-e4b-it
Openrouter 4.0B 33K ~2.00 ~6.00
-
Estimated
Google: Gemma 3n 4B (free)
google/gemma-3n-e4b-it:free
Openrouter 4.0B 8K ~2.00 ~6.00
-
Estimated
gpt-3.5-turbo Openai 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
gpt-3.5-turbo-0125 Openai 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
gpt-3.5-turbo-1106 Openai 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
gpt-3.5-turbo-16k Openai 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
gpt-3.5-turbo-instruct Openai 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
gpt-3.5-turbo-instruct-0914 Openai 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
gpt-4 Openai 200.0B 8K ~120.00 ~360.00
V T I A J
Estimated
gpt-4-0125-preview Openai 200.0B 8K ~120.00 ~360.00
V T I A J
Estimated
gpt-4-0613 Openai 0.0B 8K ~2.00 ~6.00
V T I A J
Estimated
gpt-4-1106-preview Openai 600.0B 8K ~360.00 ~1,080.00
V T I A J
Inferred
Gpt 4 32k
gpt-4-32k
Openai 220.0B - ~66.00 ~198.00
-
Estimated
Gpt 4 32k 0613
gpt-4-32k-0613
Openai 220.0B - ~66.00 ~198.00
-
Estimated
gpt-4-turbo Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4-turbo-2024-04-09 Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4-turbo-preview Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4.1 Openai 110.0B 1.0M ~66.00 ~198.00
V T I A J
Inferred
gpt-4.1-2025-04-14 Openai 110.0B 1.0M ~66.00 ~198.00
V T I A J
Inferred
gpt-4.1-mini Openai 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
gpt-4.1-mini-2025-04-14 Openai 8.0B 1.0M ~4.80 ~14.40
V T I A J
Estimated
gpt-4.1-nano Openai 4.0B 200K ~2.40 ~7.20
V T I A J
Estimated
gpt-4.1-nano-2025-04-14 Openai 4.0B 1.0M ~2.40 ~7.20
V T I A J
Estimated
gpt-4o Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-2024-05-13 Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-2024-08-06 Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-2024-11-20 Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-audio-preview Openai 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
gpt-4o-audio-preview-2024-12-17 Openai 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
gpt-4o-audio-preview-2025-06-03 Openai 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
gpt-4o-mini Openai 8.0B 128K ~5.76 ~17.28
V T I A J
Estimated
gpt-4o-mini-2024-07-18 Openai 8.0B 128K ~5.76 ~17.28
V T I A J
Estimated
gpt-4o-mini-audio-preview Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-audio-preview-2024-12-17 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-realtime-preview Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-realtime-preview-2024-12-17 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-search-preview Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-search-preview-2025-03-11 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-transcribe Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-transcribe-2025-03-20 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-transcribe-2025-12-15 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-tts Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-tts-2025-03-20 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-mini-tts-2025-12-15 Openai 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
gpt-4o-realtime-preview Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-realtime-preview-2024-12-17 Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-realtime-preview-2025-06-03 Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-search-preview Openai 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
gpt-4o-search-preview-2025-03-11 Openai 120.0B 128K ~72.00 ~216.00
V T I A J
Inferred
gpt-4o-transcribe Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-4o-transcribe-diarize Openai 200.0B 128K ~144.00 ~432.00
V T I A J
Estimated
gpt-5 Openai 0.0B 200K ~2.00 ~6.00
V T I A J
Estimated
gpt-5-2025-08-07 Openai 0.0B 200K ~2.00 ~6.00
V T I A J
Estimated
gpt-5-chat-latest Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5-codex Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5-mini Openai 8.0B 200K ~4.80 ~14.40
V T I A J
Estimated
gpt-5-mini-2025-08-07 Openai 8.0B 200K ~4.80 ~14.40
V T I A J
Estimated
gpt-5-nano Openai 4.0B 200K ~2.40 ~7.20
V T I A J
Estimated
gpt-5-nano-2025-08-07 Openai 4.0B 200K ~2.40 ~7.20
V T I A J
Estimated
gpt-5-pro Openai 50.0B 200K ~30.00 ~90.00
V T I A J
Estimated
gpt-5-pro-2025-10-06 Openai 50.0B 200K ~30.00 ~90.00
V T I A J
Estimated
gpt-5-search-api Openai 300.0B 200K ~180.00 ~540.00
V T I A J
Estimated
gpt-5-search-api-2025-10-14 Openai 300.0B 200K ~180.00 ~540.00
V T I A J
Estimated
gpt-5.1 Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.1-2025-11-13 Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.1-chat-latest Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.1-codex Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.1-codex-max Openai 300.0B 200K ~180.00 ~540.00
V T I A J
Estimated
gpt-5.1-codex-mini Openai 8.0B 200K ~4.80 ~14.40
V T I A J
Estimated
gpt-5.2 Openai 8.0B 200K ~4.80 ~14.40
V T I A J
Estimated
gpt-5.2-2025-12-11 Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.2-chat-latest Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.2-codex Openai 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
gpt-5.2-pro Openai 50.0B 200K ~30.00 ~90.00
V T I A J
Estimated
gpt-5.2-pro-2025-12-11 Openai 50.0B 200K ~30.00 ~90.00
V T I A J
Estimated
gpt-audio Openai 200.0B - ~144.00 ~432.00
-
Estimated
gpt-audio-2025-08-28 Openai 200.0B - ~144.00 ~432.00
-
Estimated
gpt-audio-mini Openai 8.0B - ~4.80 ~14.40
-
Estimated
gpt-audio-mini-2025-10-06 Openai 8.0B - ~4.80 ~14.40
-
Estimated
gpt-audio-mini-2025-12-15 Openai 8.0B - ~4.80 ~14.40
-
Estimated
gpt-image-1 Openai 12.0B - ~8.64 ~25.92
-
Estimated
gpt-image-1-mini Openai 8.0B - ~5.76 ~17.28
-
Estimated
gpt-oss-120B
gpt-oss/gpt-oss-120B
Together 8.8B 128K ~5.28 ~15.84
V T I A J
Inferred
gpt-realtime Openai 200.0B - ~144.00 ~432.00
-
Estimated
gpt-realtime-2025-08-28 Openai 200.0B - ~144.00 ~432.00
-
Estimated
gpt-realtime-mini Openai 8.0B - ~4.80 ~14.40
-
Estimated
gpt-realtime-mini-2025-10-06 Openai 8.0B - ~4.80 ~14.40
-
Estimated
gpt-realtime-mini-2025-12-15 Openai 8.0B - ~4.80 ~14.40
-
Estimated
Grok 2
grok-2
Grok 314.0B - ~94.20 ~282.60
-
Estimated
Grok 2 1212
grok-2-1212
Xai 314.0B 131K ~94.20 ~282.60
V T I A J
Estimated
grok-2-image-1212 Grok 314.0B - ~113.04 ~339.12
-
Estimated
Grok 2 Latest
grok-2-latest
Xai 120.0B 131K ~43.20 ~129.60
V T I A J
Inferred
Grok 2 Mini
grok-2-mini
Grok 70.0B - ~21.00 ~63.00
-
Estimated
Grok 2 Vision 1212
grok-2-vision-1212
Xai 314.0B 33K ~113.04 ~339.12
V T I A J
Estimated
Grok 3
grok-3
Xai 220.0B 131K ~79.20 ~237.60
V T I A J
Inferred
Grok 3 Fast
grok-3-fast
Xai 48.0B 131K ~17.28 ~51.84
V T I A J
Inferred
Grok 3 Mini
grok-3-mini
Xai 100.0B 131K ~30.00 ~90.00
V T I A J
Estimated
Grok 4
grok-4
Xai 220.0B 131K ~79.20 ~237.60
V T I A J
Inferred
Grok 4 0709
grok-4-0709
Xai 220.0B 131K ~79.20 ~237.60
V T I A J
Inferred
Grok 4 1 Fast Non Reasoning
grok-4-1-fast-non-reasoning
Xai 400.0B 131K ~120.00 ~360.00
V T I A J
Estimated
Grok 4 1 Fast Reasoning
grok-4-1-fast-reasoning
Xai 400.0B 131K ~120.00 ~360.00
V T I A J
Estimated
Grok 4 Fast Non Reasoning
grok-4-fast-non-reasoning
Xai 400.0B 131K ~120.00 ~360.00
V T I A J
Estimated
Grok 4 Fast Reasoning
grok-4-fast-reasoning
Xai 400.0B 131K ~120.00 ~360.00
V T I A J
Estimated
Grok 4.1
grok-4.1
Xai 140.0B 2.0M ~50.40 ~151.20
V T I A J
Inferred
Grok 4.1 Fast
grok-4.1-fast
Xai 7.1B 2.0M ~2.56 ~7.67
V T I A J
Inferred
Grok 4.1 Mini
grok-4.1-mini
Xai 4.4B 2.0M ~2.00 ~6.00
V T I A J
Inferred
Grok Beta
grok-beta
Grok 314.0B - ~94.20 ~282.60
-
Estimated
Grok Vision Beta
grok-vision-beta
Xai 200.0B 8K ~72.00 ~216.00
V T I A J
Inferred
Groq/compound
groq/compound
Groq 8.8B 131K ~5.28 ~15.84
V T I A J
Inferred
Groq/compound Mini
groq/compound-mini
Groq 8.8B 131K ~5.28 ~15.84
V T I A J
Inferred
MythoMax 13B
gryphe/mythomax-l2-13b
Openrouter 13.0B 4K ~6.50 ~19.50
-
Estimated
IBM: Granite 4.0 Micro
ibm-granite/granite-4.0-h-micro
Openrouter 3.0B 131K ~2.00 ~6.00
-
Estimated
Inception: Mercury 2
inception/mercury-2
Openrouter 0.0B 128K ~2.00 ~6.00
-
Estimated
J2 Light
j2-light
Ai21 50.0B 8K ~30.00 ~90.00
V T I A J
Inferred
J2 Mid
j2-mid
Ai21 110.0B 8K ~66.00 ~198.00
V T I A J
Inferred
J2 Ultra
j2-ultra
Ai21 200.0B 8K ~120.00 ~360.00
V T I A J
Inferred
Jamba 1.5 Large
jamba-1.5-large
Ai21 110.0B 256K ~66.00 ~198.00
V T I A J
Inferred
Jamba 1.5 Mini
jamba-1.5-mini
Ai21 5.7B 256K ~3.42 ~10.26
V T I A J
Inferred
Jamba Instruct
jamba-instruct
Ai21 13.0B 256K ~7.80 ~23.40
V T I A J
Inferred
Jamba Instruct Preview
jamba-instruct-preview
Ai21 13.0B 256K ~7.80 ~23.40
V T I A J
Inferred
Kimi K2
kimi-k2
Moonshot 48.0B 131K ~28.80 ~86.40
V T I A J
Inferred
Kimi K2 0711
kimi-k2-0711
Moonshot 47.0B 131K ~28.20 ~84.60
V T I A J
Inferred
Kimi K2 0711 Preview
kimi-k2-0711-preview
Moonshot 47.0B 131K ~28.20 ~84.60
V T I A J
Inferred
Kimi K2 0905
kimi-k2-0905
Moonshot 44.0B 262K ~26.40 ~79.20
V T I A J
Inferred
Kimi K2 0905 Preview
kimi-k2-0905-preview
Moonshot 44.0B 262K ~26.40 ~79.20
V T I A J
Inferred
Kimi K2 Exacto
kimi-k2-exacto
Moonshot 44.0B 262K ~26.40 ~79.20
V T I A J
Inferred
Kimi K2 Thinking
kimi-k2-thinking
Moonshot 53.0B 262K ~31.80 ~95.40
V T I A J
Inferred
Kimi K2 Thinking Turbo
kimi-k2-thinking-turbo
Moonshot 53.0B 262K ~31.80 ~95.40
V T I A J
Inferred
Kimi K2 Turbo
kimi-k2-turbo
Moonshot 99.0B 131K ~59.40 ~178.20
V T I A J
Inferred
Kimi K2 Turbo Preview
kimi-k2-turbo-preview
Moonshot 110.0B 262K ~66.00 ~198.00
V T I A J
Inferred
Kimi K2.5
kimi-k2.5
Moonshot 57.0B 262K ~34.20 ~102.60
V T I A J
Inferred
Kimi Latest
kimi-latest
Moonshot 52.0B 131K ~31.20 ~93.60
V T I A J
Inferred
Kimi Long
kimi-long
Moonshot 170.0B 1.0M ~102.00 ~306.00
V T I A J
Inferred
LiquidAI: LFM2-24B-A2B
liquid/lfm-2-24b-a2b
Openrouter 24.0B 33K ~12.00 ~36.00
-
Estimated
LiquidAI/LFM2-2.6B
liquid/lfm-2.2-6b
Openrouter 2.6B 33K ~2.00 ~6.00
-
Estimated
LiquidAI: LFM2.5-1.2B-Instruct (free)
liquid/lfm-2.5-1.2b-instruct:free
Openrouter 1.2B 33K ~2.00 ~6.00
-
Estimated
LiquidAI: LFM2.5-1.2B-Thinking (free)
liquid/lfm-2.5-1.2b-thinking:free
Openrouter 1.2B 33K ~2.00 ~6.00
-
Estimated
LiquidAI/LFM2-8B-A1B
liquid/lfm2-8b-a1b
Openrouter 8.0B 33K ~4.00 ~12.00
-
Estimated
Llama 3.1 405b Reasoning
llama-3.1-405b-reasoning
Groq 20.0B 33K ~12.00 ~36.00
V T I A J
Inferred
Llama 3.1 70b Versatile
llama-3.1-70b-versatile
Groq 70.0B 131K ~42.00 ~126.00
V T I A J
Estimated
Llama 3.1 8b Instant
llama-3.1-8b-instant
Groq 8.0B 131K ~4.80 ~14.40
V T I A J
Estimated
Llama 3.1 Sonar Huge 128k Online
llama-3.1-sonar-huge-128k-online
Perplexity 70.0B 128K ~42.00 ~126.00
V T I A J
Inferred
Llama 3.1 Sonar Large 128k Chat
llama-3.1-sonar-large-128k-chat
Perplexity 70.0B - ~42.00 ~126.00
-
Estimated
Llama 3.1 Sonar Large 128k Online
llama-3.1-sonar-large-128k-online
Perplexity 22.0B 128K ~13.20 ~39.60
V T I A J
Inferred
Llama 3.1 Sonar Small 128k Chat
llama-3.1-sonar-small-128k-chat
Perplexity 8.0B - ~4.80 ~14.40
-
Estimated
Llama 3.1 Sonar Small 128k Online
llama-3.1-sonar-small-128k-online
Perplexity 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
Llama 3.2 11b Vision Preview
llama-3.2-11b-vision-preview
Groq 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
Llama 3.2 1b Preview
llama-3.2-1b-preview
Groq 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
Llama 3.2 3b Preview
llama-3.2-3b-preview
Groq 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
Llama 3.2 90b Vision Preview
llama-3.2-90b-vision-preview
Groq 17.0B 8K ~10.20 ~30.60
V T I A J
Inferred
Llama 3.3 70b Specdec
llama-3.3-70b-specdec
Groq 70.0B - ~42.00 ~126.00
-
Estimated
Llama 3.3 70b Versatile
llama-3.3-70b-versatile
Groq 70.0B 131K ~42.00 ~126.00
V T I A J
Estimated
Llama3 70b 8192
llama3-70b-8192
Groq 70.0B 8K ~42.00 ~126.00
V T I A J
Estimated
Llama3 8b 8192
llama3-8b-8192
Groq 8.0B 8K ~4.80 ~14.40
V T I A J
Estimated
Llama3 1 8b
llama3.1-8b
Cerebras 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
llama3.1:70b Ollama 70.0B - ~42.00 ~126.00
-
Estimated
llama3.1:8b Ollama 8.0B - ~4.80 ~14.40
-
Estimated
llama3.2:1b Ollama 1.0B - ~2.00 ~6.00
-
Estimated
llama3.2:3b Ollama 3.0B - ~2.00 ~6.00
-
Estimated
llama3:70b Ollama 70.0B - ~42.00 ~126.00
-
Estimated
llama3:8b Ollama 8.0B - ~4.80 ~14.40
-
Estimated
llava:13b Ollama 13.0B - ~9.36 ~28.08
-
Estimated
llava:34b Ollama 34.0B - ~24.48 ~73.44
-
Estimated
llava:7b Ollama 7.0B - ~5.04 ~15.12
-
Estimated
Magistral Medium 2506
magistral-medium-2506
Mistral 63.0B 40K ~37.80 ~113.40
V T I A J
Inferred
Magistral Medium 2507
magistral-medium-2507
Mistral 63.0B 40K ~37.80 ~113.40
V T I A J
Inferred
magistral-medium-2509 Mistral 63.0B 40K ~37.80 ~113.40
V T I A J
Inferred
magistral-medium-2509
magistral-medium-latest
Mistral 63.0B 40K ~37.80 ~113.40
V T I A J
Inferred
Magistral Small 2507
magistral-small-2507
Mistral 32.0B 40K ~19.20 ~57.60
V T I A J
Inferred
magistral-small-2509 Mistral 32.0B 40K ~19.20 ~57.60
V T I A J
Inferred
magistral-small-2509
magistral-small-latest
Mistral 32.0B 40K ~19.20 ~57.60
V T I A J
Inferred
Meituan: LongCat Flash Chat
meituan/longcat-flash-chat
Openrouter 560.0B 131K ~168.00 ~504.00
-
Estimated
Meituan: LongCat Flash Chat (free)
meituan/longcat-flash-chat:free
Openrouter 560.0B 131K ~168.00 ~504.00
-
Estimated
Llama-3-8B-Instruct-Lite
meta-llama/Llama-3-8B-Instruct-Lite
Together 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
Llama-3.2-11B-Vision-Instruct-Turbo
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo
Together 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Llama-3.2-3B-Instruct-Turbo
meta-llama/Llama-3.2-3B-Instruct-Turbo
Together 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
Llama-3.2-90B-Vision-Instruct-Turbo
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo
Together 27.0B 131K ~16.20 ~48.60
V T I A J
Inferred
Llama-3.3-70B-Instruct-Turbo
meta-llama/Llama-3.3-70B-Instruct-Turbo
Together 70.0B 131K ~42.00 ~126.00
V T I A J
Estimated
Meta-Llama-3.1-405B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Together 405.0B 131K ~243.00 ~729.00
V T I A J
Estimated
Meta-Llama-3.1-70B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
Together 70.0B 131K ~42.00 ~126.00
V T I A J
Estimated
Meta-Llama-3.1-8B-Instruct-Turbo
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
Together 8.0B 131K ~4.80 ~14.40
V T I A J
Estimated
Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct
Openrouter 70.0B 8K ~28.00 ~84.00
-
Research
Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instruct
Openrouter 8.0B 8K ~3.20 ~9.60
-
Research
Meta: Llama 3.1 405B (base)
meta-llama/llama-3.1-405b
Openrouter 405.0B 33K ~202.50 ~607.50
-
Estimated
Meta: Llama 3.1 405B Instruct
meta-llama/llama-3.1-405b-instruct
Openrouter 405.0B 131K ~162.00 ~486.00
-
Research
Meta: Llama 3.1 405B Instruct (free)
meta-llama/llama-3.1-405b-instruct:free
Openrouter 400.0B 131K ~200.00 ~600.00
-
Estimated
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct
Openrouter 70.0B 131K ~28.00 ~84.00
-
Research
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct
Openrouter 8.0B 131K ~3.20 ~9.60
-
Research
Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct
Openrouter 11.0B 131K ~7.92 ~23.76
-
Estimated
Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instruct
Openrouter 1.0B 60K ~2.00 ~6.00
-
Estimated
Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct
Openrouter 3.0B 131K ~2.00 ~6.00
-
Estimated
Meta: Llama 3.2 3B Instruct (free)
meta-llama/llama-3.2-3b-instruct:free
Openrouter 3.0B 131K ~2.00 ~6.00
-
Estimated
Meta: Llama 3.2 90B Vision Instruct
meta-llama/llama-3.2-90b-vision-instruct
Openrouter 90.0B 33K ~64.80 ~194.40
-
Estimated
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct
Redpill 70.0B 131K ~28.00 ~84.00
V T I A J
Research
Meta: Llama 3.3 70B Instruct (free)
meta-llama/llama-3.3-70b-instruct:free
Openrouter 70.0B 131K ~35.00 ~105.00
-
Estimated
Meta: Llama 4 Maverick
meta-llama/llama-4-maverick
Openrouter 17.0B 1.0M ~6.12 ~18.36
-
Estimated
Meta: Llama 4 Scout
meta-llama/llama-4-scout
Openrouter 17.0B 328K ~6.12 ~18.36
-
Estimated
Meta Llama/llama 4 Scout 17b 16e Instruct
meta-llama/llama-4-scout-17b-16e-instruct
Groq 4.5B 131K ~2.70 ~8.10
V T I A J
Inferred
Meta: LlamaGuard 2 8B
meta-llama/llama-guard-2-8b
Openrouter 8.0B 8K ~4.00 ~12.00
-
Estimated
Llama Guard 3 8B
meta-llama/llama-guard-3-8b
Openrouter 8.0B 131K ~4.00 ~12.00
-
Estimated
Meta: Llama Guard 4 12B
meta-llama/llama-guard-4-12b
Openrouter 12.0B 164K ~7.20 ~21.60
-
Estimated
Meta Llama/llama Prompt Guard 2 22m
meta-llama/llama-prompt-guard-2-22m
Groq 1.8B 512 ~2.00 ~6.00
V T I A J
Inferred
Meta Llama/llama Prompt Guard 2 86m
meta-llama/llama-prompt-guard-2-86m
Groq 1.8B 512 ~2.00 ~6.00
V T I A J
Inferred
Llama 2 13b Chat
meta/llama-2-13b-chat
Replicate 5.3B 4K ~3.18 ~9.54
V T I A J
Inferred
Llama 2 70b Chat
meta/llama-2-70b-chat
Replicate 42.0B 4K ~25.20 ~75.60
V T I A J
Inferred
Llama 2 7b Chat
meta/llama-2-7b-chat
Replicate 2.8B 4K ~2.00 ~6.00
V T I A J
Inferred
Meta Llama 3 70b Instruct
meta/meta-llama-3-70b-instruct
Replicate 47.0B 8K ~28.20 ~84.60
V T I A J
Inferred
Meta Llama 3 8b Instruct
meta/meta-llama-3-8b-instruct
Replicate 3.1B 8K ~2.00 ~6.00
V T I A J
Inferred
Meta Llama 3.1 405b Instruct
meta/meta-llama-3.1-405b-instruct
Replicate 120.0B 131K ~72.00 ~216.00
V T I A J
Inferred
Microsoft: Phi-3 Medium 128K Instruct
microsoft/phi-3-medium-128k-instruct
Openrouter 14.0B 128K ~7.00 ~21.00
-
Estimated
Microsoft: Phi-3 Mini 128K Instruct
microsoft/phi-3-mini-128k-instruct
Openrouter 3.8B 128K ~2.00 ~6.00
-
Estimated
Microsoft: Phi-3.5 Mini 128K Instruct
microsoft/phi-3.5-mini-128k-instruct
Openrouter 3.8B 128K ~2.00 ~6.00
-
Estimated
Microsoft: Phi 4
microsoft/phi-4
Openrouter 14.0B 16K ~7.00 ~21.00
-
Estimated
Microsoft: Phi 4 Multimodal Instruct
microsoft/phi-4-multimodal-instruct
Openrouter 5.6B 131K ~3.36 ~10.08
-
Estimated
Microsoft: Phi 4 Reasoning Plus
microsoft/phi-4-reasoning-plus
Openrouter 14.0B 33K ~7.00 ~21.00
-
Estimated
WizardLM-2 8x22B
microsoft/wizardlm-2-8x22b
Openrouter 176.0B 66K ~52.80 ~158.40
-
Estimated
MiniMax: MiniMax-01
minimax/minimax-01
Openrouter 456.0B 1.0M ~164.16 ~492.48
-
Estimated
MiniMax: MiniMax M1
minimax/minimax-m1
Openrouter 456.0B 1.0M ~136.80 ~410.40
-
Estimated
MiniMax: MiniMax M2
minimax/minimax-m2
Openrouter 10.0B 262K ~5.00 ~15.00
-
Estimated
MiniMax: MiniMax M2.1
minimax/minimax-m2.1
Openrouter 10.0B 205K ~5.00 ~15.00
-
Estimated
MiniMax: MiniMax M2.5
minimax/minimax-m2.5
Openrouter 2.5B 197K ~2.00 ~6.00
-
Estimated
ministral-14b-2512 Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
ministral-14b-2512
ministral-14b-latest
Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
ministral-3b-2512 Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
ministral-3b-2512
ministral-3b-latest
Mistral 3.0B 128K ~2.00 ~6.00
V T I A J
Estimated
ministral-8b-2512 Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
ministral-8b-2512
ministral-8b-latest
Mistral 8.0B 128K ~4.80 ~14.40
V T I A J
Estimated
Mistral 7b Instruct
mistral-7b-instruct
Mistral 2.0B 32K ~2.00 ~6.00
V T I A J
Inferred
mistral-embed-2312
mistral-embed
Mistral 3.7B 8K ~2.22 ~6.66
V T I A J
Inferred
Mistral Large
mistral-large
Mistral 38.0B 256K ~22.80 ~68.40
V T I A J
Inferred
Mistral Large 2407
mistral-large-2407
Mistral 77.0B 128K ~46.20 ~138.60
V T I A J
Inferred
mistral-large-2411 Mistral 123.0B 128K ~73.80 ~221.40
V T I A J
Estimated
mistral-large-2512 Mistral 38.0B 256K ~22.80 ~68.40
V T I A J
Inferred
mistral-large-2512
mistral-large-latest
Mistral 38.0B 256K ~22.80 ~68.40
V T I A J
Inferred
pixtral-large-2411
mistral-large-pixtral-2411
Mistral 38.0B 256K ~22.80 ~68.40
V T I A J
Inferred
mistral-medium-2508
mistral-medium
Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
Mistral Medium 3
mistral-medium-3
Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
Mistral Medium 3.1
mistral-medium-3.1
Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
mistral-medium-2508
mistral-medium-latest
Mistral 42.0B 128K ~25.20 ~75.60
V T I A J
Inferred
Mistral Nemo
mistral-nemo
Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
Mistral Saba
mistral-saba
Mistral 8.0B 32K ~4.80 ~14.40
V T I A J
Inferred
Mistral Small
mistral-small
Mistral 2.0B 32K ~2.00 ~6.00
V T I A J
Inferred
Mistral Small 2409
mistral-small-2409
Mistral 22.0B - ~13.20 ~39.60
-
Estimated
Mistral Small 3.1
mistral-small-3.1
Mistral 2.0B 32K ~2.00 ~6.00
V T I A J
Inferred
Mistral Small 3.2
mistral-small-3.2
Mistral 2.0B 32K ~2.00 ~6.00
V T I A J
Inferred
Mistral Small Creative
mistral-small-creative
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
mistral-small-2506
mistral-small-latest
Mistral 2.0B 32K ~2.00 ~6.00
V T I A J
Inferred
Mistral Tiny
mistral-tiny
Mistral 3.1B 32K ~2.00 ~6.00
V T I A J
Inferred
mistral:7b Ollama 7.0B - ~4.20 ~12.60
-
Estimated
Mistral 7B Instruct V0.3
mistralai/Mistral-7B-Instruct-v0.3
Together 7.0B - ~4.20 ~12.60
-
Estimated
Mistral-Small-3
mistralai/Mistral-Small-3
Together 3.7B 33K ~2.22 ~6.66
V T I A J
Inferred
Mixtral-8x22B-Instruct-v0.1
mistralai/Mixtral-8x22B-Instruct-v0.1
Together 141.0B 66K ~42.30 ~126.90
V T I A J
Estimated
Mixtral-8x7B-Instruct-v0.1
mistralai/Mixtral-8x7B-Instruct-v0.1
Together 47.0B 33K ~14.10 ~42.30
V T I A J
Estimated
Mistral: Codestral 2508
mistralai/codestral-2508
Openrouter 2.5B 256K ~2.00 ~6.00
-
Estimated
Mistral: Devstral 2 2512
mistralai/devstral-2512
Openrouter 123.0B 262K ~73.80 ~221.40
-
Estimated
Mistral: Devstral 2 2512 (free)
mistralai/devstral-2512:free
Openrouter 123.0B 262K ~73.80 ~221.40
-
Estimated
Mistral: Devstral Small 1.1
mistralai/devstral-small
Openrouter 24.0B 128K ~12.00 ~36.00
-
Estimated
Mistral: Devstral Small 2505
mistralai/devstral-small-2505
Openrouter 24.0B 128K ~12.00 ~36.00
-
Estimated
Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512
Openrouter 14.0B 262K ~8.40 ~25.20
-
Estimated
Mistral: Ministral 3B
mistralai/ministral-3b
Openrouter 3.0B 131K ~2.00 ~6.00
-
Estimated
Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512
Openrouter 3.0B 131K ~2.00 ~6.00
-
Estimated
Mistral: Ministral 8B
mistralai/ministral-8b
Openrouter 8.0B 131K ~4.80 ~14.40
-
Estimated
Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512
Openrouter 8.0B 262K ~4.80 ~14.40
-
Estimated
Mistral: Mistral 7B Instruct
mistralai/mistral-7b-instruct
Openrouter 7.0B 33K ~4.20 ~12.60
-
Estimated
Mistral: Mistral 7B Instruct v0.1
mistralai/mistral-7b-instruct-v0.1
Openrouter 7.3B 3K ~3.65 ~10.95
-
Estimated
Mistral 7b Instruct V0.2
mistralai/mistral-7b-instruct-v0.2
Replicate 3.1B 33K ~2.00 ~6.00
V T I A J
Inferred
Mistral: Mistral 7B Instruct v0.3
mistralai/mistral-7b-instruct-v0.3
Openrouter 7.0B 33K ~4.20 ~12.60
-
Estimated
Mistral: Mistral 7B Instruct (free)
mistralai/mistral-7b-instruct:free
Openrouter 7.3B 33K ~3.65 ~10.95
-
Estimated
Mistral Large 2407
mistralai/mistral-large-2407
Openrouter 2.4B 131K ~2.00 ~6.00
-
Estimated
Mistral Large 2411
mistralai/mistral-large-2411
Openrouter 123.0B 131K ~49.20 ~147.60
-
Research
Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512
Openrouter 41.0B 262K ~24.60 ~73.80
-
Estimated
Mistral: Mistral Nemo
mistralai/mistral-nemo
Openrouter 12.0B 131K ~7.20 ~21.60
-
Estimated
Mistral: Saba
mistralai/mistral-saba
Openrouter 24.0B 33K ~12.00 ~36.00
-
Estimated
Mistral: Mistral Small 3
mistralai/mistral-small-24b-instruct-2501
Openrouter 24.0B 33K ~12.00 ~36.00
-
Estimated
Mistral: Mistral Small 3.1 24B
mistralai/mistral-small-3.1-24b-instruct
Openrouter 24.0B 131K ~14.40 ~43.20
-
Estimated
Mistral: Mistral Small 3.1 24B (free)
mistralai/mistral-small-3.1-24b-instruct:free
Openrouter 24.0B 128K ~14.40 ~43.20
-
Estimated
Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct
Openrouter 24.0B 131K ~14.40 ~43.20
-
Estimated
Mistral Tiny
mistralai/mistral-tiny
Openrouter 8.0B 33K ~4.00 ~12.00
-
Estimated
Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instruct
Openrouter 141.0B 66K ~42.30 ~126.90
-
Estimated
Mistral: Mixtral 8x7B Instruct
mistralai/mixtral-8x7b-instruct
Openrouter 47.0B 33K ~14.10 ~42.30
-
Estimated
Mixtral 8x7b Instruct V0.1
mistralai/mixtral-8x7b-instruct-v0.1
Replicate 20.0B 33K ~7.20 ~21.60
V T I A J
Inferred
Mistral: Pixtral 12B
mistralai/pixtral-12b
Openrouter 12.0B 33K ~8.64 ~25.92
-
Estimated
Mistral: Pixtral Large 2411
mistralai/pixtral-large-2411
Openrouter 124.0B 131K ~74.40 ~223.20
-
Estimated
Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507
Openrouter 24.0B 32K ~14.40 ~43.20
-
Estimated
Mixtral 8x22b Instruct
mixtral-8x22b-instruct
Mistral 70.0B 64K ~25.20 ~75.60
V T I A J
Inferred
Mixtral 8x7b 32768
mixtral-8x7b-32768
Groq 47.0B 33K ~14.10 ~42.30
V T I A J
Estimated
Mixtral 8x7b Instruct
mixtral-8x7b-instruct
Mistral 6.3B 32K ~2.27 ~6.80
V T I A J
Inferred
mixtral:8x22b Ollama 141.0B - ~42.30 ~126.90
-
Estimated
mixtral:8x7b Ollama 47.0B - ~14.10 ~42.30
-
Estimated
Moonshot V1 128k
moonshot-v1-128k
Moonshot 77.0B 131K ~46.20 ~138.60
V T I A J
Inferred
Moonshot V1 128k Vision Preview
moonshot-v1-128k-vision-preview
Moonshot 77.0B 131K ~46.20 ~138.60
V T I A J
Inferred
Moonshot V1 32k
moonshot-v1-32k
Moonshot 43.0B 33K ~25.80 ~77.40
V T I A J
Inferred
Moonshot V1 32k Vision Preview
moonshot-v1-32k-vision-preview
Moonshot 43.0B 33K ~25.80 ~77.40
V T I A J
Inferred
Moonshot V1 8k
moonshot-v1-8k
Moonshot 25.0B 8K ~15.00 ~45.00
V T I A J
Inferred
Moonshot V1 8k Vision Preview
moonshot-v1-8k-vision-preview
Moonshot 25.0B 8K ~15.00 ~45.00
V T I A J
Inferred
Kimi-K2.5
moonshotai/Kimi-K2.5
Together 57.0B 256K ~34.20 ~102.60
V T I A J
Inferred
MoonshotAI: Kimi Dev 72B
moonshotai/kimi-dev-72b
Openrouter 72.0B 131K ~36.00 ~108.00
-
Estimated
MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2
Openrouter 32.0B 131K ~9.60 ~28.80
-
Estimated
MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905
Openrouter 32.0B 262K ~9.60 ~28.80
-
Estimated
MoonshotAI: Kimi K2 0905 (exacto)
moonshotai/kimi-k2-0905:exacto
Openrouter 32.0B 262K ~9.60 ~28.80
-
Estimated
Moonshotai/kimi K2 Instruct
moonshotai/kimi-k2-instruct
Groq 55.0B 131K ~33.00 ~99.00
V T I A J
Inferred
Moonshotai/kimi K2 Instruct 0905
moonshotai/kimi-k2-instruct-0905
Groq 60.0B 262K ~36.00 ~108.00
V T I A J
Inferred
MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinking
Redpill 46.0B 262K ~27.60 ~82.80
V T I A J
Inferred
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5
Redpill 60.0B 262K ~36.00 ~108.00
V T I A J
Inferred
MoonshotAI: Kimi K2 0711 (free)
moonshotai/kimi-k2:free
Openrouter 32.0B 33K ~9.60 ~28.80
-
Estimated
MoonshotAI: Kimi Linear 48B A3B Instruct
moonshotai/kimi-linear-48b-a3b-instruct
Openrouter 48.0B 1.0M ~24.00 ~72.00
-
Estimated
NeverSleep: Lumimaid v0.2 8B
neversleep/llama-3.1-lumimaid-8b
Openrouter 8.0B 33K ~4.00 ~12.00
-
Estimated
Noromaid 20B
neversleep/noromaid-20b
Openrouter 20.0B 4K ~10.00 ~30.00
-
Estimated
Nous: DeepHermes 3 Mistral 24B Preview
nousresearch/deephermes-3-mistral-24b-preview
Openrouter 24.0B 33K ~12.00 ~36.00
-
Estimated
NousResearch: Hermes 2 Pro - Llama-3 8B
nousresearch/hermes-2-pro-llama-3-8b
Openrouter 8.0B 8K ~4.00 ~12.00
-
Estimated
Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405b
Redpill 22.0B 131K ~13.20 ~39.60
V T I A J
Inferred
Nous: Hermes 3 405B Instruct (free)
nousresearch/hermes-3-llama-3.1-405b:free
Openrouter 405.0B 131K ~202.50 ~607.50
-
Estimated
Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70b
Openrouter 7.0B 66K ~3.50 ~10.50
-
Estimated
Nous: Hermes 4 405B
nousresearch/hermes-4-405b
Openrouter 405.0B 131K ~202.50 ~607.50
-
Estimated
Nous: Hermes 4 70B
nousresearch/hermes-4-70b
Openrouter 70.0B 131K ~35.00 ~105.00
-
Estimated
NVIDIA: Llama 3.1 Nemotron 70B Instruct
nvidia/llama-3.1-nemotron-70b-instruct
Openrouter 70.0B 131K ~42.00 ~126.00
-
Estimated
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
nvidia/llama-3.1-nemotron-ultra-253b-v1
Openrouter 253.0B 131K ~126.50 ~379.50
-
Estimated
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
Openrouter 49.0B 131K ~24.50 ~73.50
-
Estimated
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
Openrouter 30.0B 262K ~9.00 ~27.00
-
Estimated
NVIDIA: Nemotron 3 Nano 30B A3B (free)
nvidia/nemotron-3-nano-30b-a3b:free
Openrouter 30.0B 256K ~9.00 ~27.00
-
Estimated
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-120b-a12b:free
Openrouter 120.0B 262K ~36.00 ~108.00
-
Estimated
NVIDIA: Nemotron Nano 12B 2 VL
nvidia/nemotron-nano-12b-v2-vl
Openrouter 12.0B 131K ~7.20 ~21.60
-
Estimated
NVIDIA: Nemotron Nano 12B 2 VL (free)
nvidia/nemotron-nano-12b-v2-vl:free
Openrouter 12.0B 128K ~7.20 ~21.60
-
Estimated
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2
Openrouter 9.0B 131K ~4.50 ~13.50
-
Estimated
NVIDIA: Nemotron Nano 9B V2 (free)
nvidia/nemotron-nano-9b-v2:free
Openrouter 9.0B 128K ~4.50 ~13.50
-
Estimated
o1 Openai 200.0B 200K ~120.00 ~360.00
V T I A J
Estimated
o1-2024-12-17 Openai 8.0B 200K ~4.80 ~14.40
V T I A J
Estimated
o1-mini Openai 150.0B 128K ~90.00 ~270.00
V T I A J
Inferred
O1 Mini 2024 09 12
o1-mini-2024-09-12
Openai 100.0B - ~30.00 ~90.00
-
Estimated
o1-preview Openai 660.0B 128K ~396.00 ~1,188.00
V T I A J
Inferred
O1 Preview 2024 09 12
o1-preview-2024-09-12
Openai 300.0B - ~90.00 ~270.00
-
Estimated
o1-pro Openai 50.0B 200K ~30.00 ~90.00
V T I A J
Estimated
o1-pro-2025-03-19 Openai 50.0B 200K ~30.00 ~90.00
V T I A J
Estimated
o3 Openai 560.0B 200K ~336.00 ~1,008.00
V T I A J
Inferred
o3-2025-04-16 Openai 560.0B 200K ~336.00 ~1,008.00
V T I A J
Inferred
o3-mini Openai 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
o3-mini-2025-01-31 Openai 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
o4-mini Openai 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
o4-mini-2025-04-16 Openai 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
o4-mini-deep-research Openai 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
o4-mini-deep-research-2025-06-26 Openai 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
Mistral: Mistral 7B Instruct
open-mistral-7b
Mistral 7.0B - ~4.20 ~12.60
-
Estimated
open-mistral-nemo Mistral 12.0B 128K ~7.20 ~21.60
V T I A J
Estimated
open-mistral-nemo
open-mistral-nemo-2407
Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
Open Mixtral 8x22b
open-mixtral-8x22b
Mistral 141.0B 64K ~42.30 ~126.90
V T I A J
Estimated
Open Mixtral 8x7b
open-mixtral-8x7b
Mistral 47.0B 32K ~14.10 ~42.30
V T I A J
Estimated
OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turbo
Redpill 20.0B 16K ~12.00 ~36.00
V T I A J
Estimated
OpenAI: GPT-4
openai/gpt-4
Redpill 200.0B 8K ~120.00 ~360.00
V T I A J
Estimated
OpenAI: GPT-4 Turbo
openai/gpt-4-turbo
Openrouter 200.0B 128K ~144.00 ~432.00
-
Estimated
OpenAI: GPT-4 Turbo Preview
openai/gpt-4-turbo-preview
Openrouter 200.0B 128K ~144.00 ~432.00
-
Estimated
OpenAI: GPT-4.1
openai/gpt-4.1
Redpill 110.0B 1.0M ~66.00 ~198.00
V T I A J
Inferred
OpenAI: GPT-4.1 Mini
openai/gpt-4.1-mini
Redpill 8.0B 1.0M ~5.76 ~17.28
V T I A J
Estimated
OpenAI: GPT-4.1 Nano
openai/gpt-4.1-nano
Redpill 4.0B 1.0M ~2.88 ~8.64
V T I A J
Estimated
OpenAI: GPT-4o
openai/gpt-4o
Redpill 200.0B 128K ~96.00 ~288.00
V T I A J
Research
OpenAI: GPT-4o (2024-05-13)
openai/gpt-4o-2024-05-13
Openrouter 200.0B 128K ~144.00 ~432.00
-
Estimated
OpenAI: GPT-4o (2024-08-06)
openai/gpt-4o-2024-08-06
Openrouter 200.0B 128K ~144.00 ~432.00
-
Estimated
OpenAI: GPT-4o (2024-11-20)
openai/gpt-4o-2024-11-20
Openrouter 200.0B 128K ~144.00 ~432.00
-
Estimated
OpenAI: GPT-4o-mini
openai/gpt-4o-mini
Redpill 8.0B 128K ~5.76 ~17.28
V T I A J
Estimated
OpenAI: GPT-4o-mini (2024-07-18)
openai/gpt-4o-mini-2024-07-18
Openrouter 8.0B 128K ~5.76 ~17.28
-
Estimated
OpenAI: GPT-5
openai/gpt-5
Redpill 140.0B 400K ~84.00 ~252.00
V T I A J
Inferred
OpenAI: GPT-5 Image
openai/gpt-5-image
Openrouter 0.0B 400K ~2.00 ~6.00
-
Estimated
OpenAI: GPT-5 Image Mini
openai/gpt-5-image-mini
Openrouter 0.0B 400K ~2.00 ~6.00
-
Estimated
OpenAI: GPT-5 Mini
openai/gpt-5-mini
Redpill 46.0B 400K ~27.60 ~82.80
V T I A J
Inferred
OpenAI: GPT-5 Nano
openai/gpt-5-nano
Redpill 10.0B 400K ~7.20 ~21.60
V T I A J
Estimated
OpenAI: GPT-5.1
openai/gpt-5.1
Redpill 140.0B 400K ~84.00 ~252.00
V T I A J
Inferred
OpenAI: GPT-5.2
openai/gpt-5.2
Redpill 200.0B 400K ~120.00 ~360.00
V T I A J
Inferred
OpenAI: GPT OSS 120B
openai/gpt-oss-120b
Redpill 6.4B 131K ~3.84 ~11.52
V T I A J
Inferred
OpenAI: gpt-oss-120b (exacto)
openai/gpt-oss-120b:exacto
Openrouter 120.0B 131K ~36.00 ~108.00
-
Estimated
OpenAI: gpt-oss-120b (free)
openai/gpt-oss-120b:free
Openrouter 120.0B 131K ~36.00 ~108.00
-
Estimated
OpenAI: GPT OSS 20B
openai/gpt-oss-20b
Redpill 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
OpenAI: gpt-oss-20b (free)
openai/gpt-oss-20b:free
Openrouter 20.0B 131K ~6.00 ~18.00
-
Estimated
Openai/gpt Oss Safeguard 20b
openai/gpt-oss-safeguard-20b
Groq 4.0B 131K ~2.40 ~7.20
V T I A J
Inferred
OpenAI: o1
openai/o1
Openrouter 200.0B 200K ~120.00 ~360.00
-
Estimated
OpenAI: o3
openai/o3
Redpill 110.0B 200K ~66.00 ~198.00
V T I A J
Inferred
OpenAI: o3 Mini
openai/o3-mini
Openrouter 50.0B 200K ~30.00 ~90.00
-
Estimated
OpenAI: o4 Mini
openai/o4-mini
Redpill 71.0B 200K ~42.60 ~127.80
V T I A J
Inferred
Whisper
openai/whisper
Replicate 17.0B - ~10.20 ~30.60
V T I A J
Inferred
OpenGVLab: InternVL3 78B
opengvlab/internvl3-78b
Openrouter 78.0B 33K ~46.80 ~140.40
-
Estimated
Auto Router
openrouter/auto
Openrouter 176.0B 2.0M ~52.80 ~158.40
-
Estimated
DeepSeek V3.1
phala/deepseek-chat-v3.1
Redpill 48.0B 164K ~17.28 ~51.84
V T I A J
Inferred
DeepSeek: DeepSeek V3.2
phala/deepseek-v3.2
Redpill 5.3B 164K ~2.00 ~6.00
V T I A J
Inferred
Google: Gemma 3 27B
phala/gemma-3-27b-it
Redpill 4.8B 54K ~2.88 ~8.64
V T I A J
Inferred
Z.AI: GLM 4.7
phala/glm-4.7
Redpill 57.0B 131K ~34.20 ~102.60
V T I A J
Inferred
Z.AI: GLM 4.7 Flash
phala/glm-4.7-flash
Redpill 6.1B 203K ~3.66 ~10.98
V T I A J
Inferred
Z.AI: GLM 5
phala/glm-5
Redpill 64.0B 203K ~38.40 ~115.20
V T I A J
Inferred
OpenAI: GPT OSS 120B
phala/gpt-oss-120b
Redpill 6.4B 131K ~3.84 ~11.52
V T I A J
Inferred
OpenAI: GPT OSS 20B
phala/gpt-oss-20b
Redpill 2.2B 131K ~2.00 ~6.00
V T I A J
Inferred
MoonshotAI: Kimi K2.5
phala/kimi-k2.5
Redpill 60.0B 262K ~36.00 ~108.00
V T I A J
Inferred
Qwen2.5 7B Instruct
phala/qwen-2.5-7b-instruct
Redpill 2.0B 33K ~2.00 ~6.00
V T I A J
Inferred
Qwen: Qwen3 VL 30B A3B Instruct
phala/qwen2.5-vl-72b-instruct
Redpill 12.0B 128K ~7.20 ~21.60
V T I A J
Inferred
Qwen3 30B A3B Instruct 2507
phala/qwen3-30b-a3b-instruct-2507
Redpill 6.4B 262K ~3.84 ~11.52
V T I A J
Inferred
Qwen: Qwen3 VL 30B A3B Instruct
phala/qwen3-vl-30b-a3b-instruct
Redpill 12.0B 128K ~7.20 ~21.60
V T I A J
Inferred
Qwen: Qwen3.5-27B
phala/qwen3.5-27b
Redpill 51.0B 262K ~30.60 ~91.80
V T I A J
Inferred
Phala: Venice Uncensored 24B
phala/uncensored-24b
Redpill 17.0B 33K ~10.20 ~30.60
V T I A J
Inferred
phi3:medium Ollama 14.0B - ~8.40 ~25.20
-
Estimated
phi3:mini Ollama 3.8B - ~2.28 ~6.84
-
Estimated
Mistral: Pixtral 12B
pixtral-12b-2409
Mistral 12.0B - ~8.64 ~25.92
-
Estimated
Pixtral 12b Latest
pixtral-12b-latest
Mistral 2.2B 128K ~2.00 ~6.00
V T I A J
Inferred
pixtral-large-2411 Mistral 77.0B 128K ~46.20 ~138.60
V T I A J
Inferred
pixtral-large-2411
pixtral-large-latest
Mistral 77.0B 128K ~46.20 ~138.60
V T I A J
Inferred
Prime Intellect: INTELLECT-3
prime-intellect/intellect-3
Openrouter 106.0B 131K ~31.80 ~95.40
-
Estimated
Qwen 2.5 32b
qwen-2.5-32b
Groq 4.7B 33K ~2.82 ~8.46
V T I A J
Inferred
Qwen 2.5 Coder 32b
qwen-2.5-coder-32b
Groq 4.7B 33K ~2.82 ~8.46
V T I A J
Inferred
Qwen 3 235b A22b Instruct 2507
qwen-3-235b-a22b-instruct-2507
Cerebras 25.0B 33K ~15.00 ~45.00
V T I A J
Inferred
Qwen Qwq 32b
qwen-qwq-32b
Groq 7.6B 131K ~4.56 ~13.68
V T I A J
Inferred
Qwen2.5 72B Instruct
qwen/qwen-2.5-72b-instruct
Openrouter 72.0B 33K ~43.20 ~129.60
-
Estimated
Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct
Redpill 7.0B 33K ~4.20 ~12.60
V T I A J
Estimated
Qwen2.5 Coder 32B Instruct
qwen/qwen-2.5-coder-32b-instruct
Openrouter 32.0B 33K ~19.20 ~57.60
-
Estimated
Qwen: Qwen2.5-VL 7B Instruct
qwen/qwen-2.5-vl-7b-instruct
Openrouter 7.0B 33K ~4.20 ~12.60
-
Estimated
Qwen: Qwen2.5-VL 7B Instruct (free)
qwen/qwen-2.5-vl-7b-instruct:free
Openrouter 7.0B 33K ~4.20 ~12.60
-
Estimated
Qwen: Qwen2.5 Coder 7B Instruct
qwen/qwen2.5-coder-7b-instruct
Openrouter 7.0B 33K ~3.50 ~10.50
-
Estimated
Qwen: Qwen2.5 VL 32B Instruct
qwen/qwen2.5-vl-32b-instruct
Openrouter 32.0B 16K ~19.20 ~57.60
-
Estimated
Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
Openrouter 72.0B 33K ~43.20 ~129.60
-
Estimated
Qwen: Qwen3 14B
qwen/qwen3-14b
Openrouter 14.0B 41K ~7.00 ~21.00
-
Estimated
Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22b
Openrouter 235.0B 41K ~70.50 ~211.50
-
Estimated
Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
Openrouter 235.0B 262K ~117.50 ~352.50
-
Estimated
Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
Openrouter 235.0B 262K ~70.50 ~211.50
-
Estimated
Qwen: Qwen3 235B A22B (free)
qwen/qwen3-235b-a22b:free
Openrouter 235.0B 131K ~70.50 ~211.50
-
Estimated
Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3b
Openrouter 30.0B 41K ~9.00 ~27.00
-
Estimated
Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507
Redpill 6.4B 262K ~3.84 ~11.52
V T I A J
Inferred
Qwen: Qwen3 30B A3B Thinking 2507
qwen/qwen3-30b-a3b-thinking-2507
Openrouter 30.0B 33K ~15.00 ~45.00
-
Estimated
Qwen/qwen3 32b
qwen/qwen3-32b
Groq 7.6B 131K ~4.56 ~13.68
V T I A J
Inferred
Qwen: Qwen3 4B (free)
qwen/qwen3-4b:free
Openrouter 4.0B 41K ~2.00 ~6.00
-
Estimated
Qwen: Qwen3 8B
qwen/qwen3-8b
Openrouter 8.0B 128K ~4.00 ~12.00
-
Estimated
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder
Openrouter 480.0B 262K ~144.00 ~432.00
-
Estimated
Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instruct
Openrouter 30.0B 262K ~9.00 ~27.00
-
Estimated
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder-480b-a35b-instruct
Redpill 46.0B 262K ~27.60 ~82.80
V T I A J
Inferred
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus
Openrouter 480.0B 128K ~240.00 ~720.00
-
Estimated
Qwen: Qwen3 Coder 480B A35B (exacto)
qwen/qwen3-coder:exacto
Openrouter 480.0B 262K ~144.00 ~432.00
-
Estimated
Qwen: Qwen3 Coder 480B A35B (free)
qwen/qwen3-coder:free
Openrouter 480.0B 262K ~144.00 ~432.00
-
Estimated
Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
Openrouter 80.0B 262K ~40.00 ~120.00
-
Estimated
Qwen: Qwen3 Next 80B A3B Instruct (free)
qwen/qwen3-next-80b-a3b-instruct:free
Openrouter 80.0B 262K ~40.00 ~120.00
-
Estimated
Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinking
Openrouter 80.0B 131K ~40.00 ~120.00
-
Estimated
Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
Openrouter 235.0B 262K ~141.00 ~423.00
-
Estimated
Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinking
Openrouter 235.0B 262K ~141.00 ~423.00
-
Estimated
Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
Redpill 12.0B 128K ~7.20 ~21.60
V T I A J
Inferred
Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinking
Openrouter 30.0B 131K ~18.00 ~54.00
-
Estimated
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
Openrouter 32.0B 262K ~19.20 ~57.60
-
Estimated
Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instruct
Openrouter 8.0B 131K ~4.80 ~14.40
-
Estimated
Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking
Openrouter 8.0B 256K ~4.80 ~14.40
-
Estimated
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b
Openrouter 122.0B 262K ~73.20 ~219.60
-
Estimated
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b
Redpill 27.0B 262K ~19.44 ~58.32
V T I A J
Estimated
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b
Openrouter 35.0B 262K ~21.00 ~63.00
-
Estimated
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
Redpill 397.0B 262K ~238.20 ~714.60
V T I A J
Estimated
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b
Openrouter 9.0B 262K ~5.40 ~16.20
-
Estimated
Qwen: QwQ 32B
qwen/qwq-32b
Openrouter 32.0B 33K ~16.00 ~48.00
-
Estimated
qwen2.5:14b Ollama 14.0B - ~8.40 ~25.20
-
Estimated
qwen2.5:32b Ollama 32.0B - ~19.20 ~57.60
-
Estimated
qwen2.5:72b Ollama 72.0B - ~43.20 ~129.60
-
Estimated
qwen2.5:7b Ollama 7.0B - ~4.20 ~12.60
-
Estimated
SorcererLM 8x22B
raifle/sorcererlm-8x22b
Openrouter 176.0B 16K ~52.80 ~158.40
-
Estimated
Rerank English V2.0
rerank-english-v2.0
Cohere 50.0B - ~25.00 ~75.00
-
Inferred
Rerank English V3.0
rerank-english-v3.0
Cohere 70.0B - ~35.00 ~105.00
-
Inferred
Rerank Multilingual V2.0
rerank-multilingual-v2.0
Cohere 50.0B - ~25.00 ~75.00
-
Inferred
rerank-multilingual-v3.0 Cohere 63.0B 4K ~31.50 ~94.50
-
Inferred
Sao10k: Llama 3 Euryale 70B v2.1
sao10k/l3-euryale-70b
Openrouter 70.0B 8K ~35.00 ~105.00
-
Estimated
Sao10K: Llama 3 8B Lunaris
sao10k/l3-lunaris-8b
Openrouter 8.0B 8K ~4.00 ~12.00
-
Estimated
Sao10K: Llama 3.1 70B Hanami x1
sao10k/l3.1-70b-hanami-x1
Openrouter 70.0B 16K ~35.00 ~105.00
-
Estimated
Sao10K: Llama 3.1 Euryale 70B v2.2
sao10k/l3.1-euryale-70b
Openrouter 70.0B 33K ~35.00 ~105.00
-
Estimated
Sao10K: Llama 3.3 Euryale 70B
sao10k/l3.3-euryale-70b
Openrouter 70.0B 131K ~35.00 ~105.00
-
Estimated
Sonar
sonar
Perplexity 22.0B 128K ~13.20 ~39.60
V T I A J
Inferred
Sonar Medium Chat
sonar-medium-chat
Perplexity 8.0B 16K ~4.80 ~14.40
V T I A J
Inferred
Sonar Medium Online
sonar-medium-online
Perplexity 8.0B 12K ~4.80 ~14.40
V T I A J
Inferred
Sonar Pro
sonar-pro
Perplexity 240.0B 200K ~144.00 ~432.00
V T I A J
Inferred
Sonar Reasoning
sonar-reasoning
Perplexity 70.0B 128K ~42.00 ~126.00
V T I A J
Inferred
Sonar Reasoning Pro
sonar-reasoning-pro
Perplexity 99.0B 128K ~59.40 ~178.20
V T I A J
Inferred
Sonar Small Chat
sonar-small-chat
Perplexity 2.0B 16K ~2.00 ~6.00
V T I A J
Inferred
Sonar Small Online
sonar-small-online
Perplexity 2.0B 12K ~2.00 ~6.00
V T I A J
Inferred
StepFun: Step3
stepfun-ai/step3
Openrouter 321.0B 66K ~192.60 ~577.80
-
Estimated
StepFun: Step 3.5 Flash
stepfun/step-3.5-flash
Openrouter 11.0B 256K ~3.30 ~9.90
-
Estimated
StepFun: Step 3.5 Flash (free)
stepfun/step-3.5-flash:free
Openrouter 11.0B 256K ~3.30 ~9.90
-
Estimated
Tencent: Hunyuan A13B Instruct
tencent/hunyuan-a13b-instruct
Openrouter 13.0B 131K ~3.90 ~11.70
-
Estimated
Text Embedding 004
text-embedding-004
Google 1.8B 2K ~2.00 ~6.00
V T I A J
Inferred
text-embedding-3-large Openai 4.7B 8K ~2.82 ~8.46
V T I A J
Inferred
text-embedding-3-small Openai 2.0B 8K ~2.00 ~6.00
V T I A J
Inferred
text-embedding-ada-002 Openai 3.7B 8K ~2.22 ~6.66
V T I A J
Inferred
TheDrummer: Anubis 70B V1.1
thedrummer/anubis-70b-v1.1
Openrouter 70.0B 131K ~35.00 ~105.00
-
Estimated
TheDrummer: Cydonia 24B V4.1
thedrummer/cydonia-24b-v4.1
Openrouter 24.0B 131K ~12.00 ~36.00
-
Estimated
TheDrummer: Rocinante 12B
thedrummer/rocinante-12b
Openrouter 12.0B 33K ~6.00 ~18.00
-
Estimated
TheDrummer: Skyfall 36B V2
thedrummer/skyfall-36b-v2
Openrouter 36.0B 33K ~18.00 ~54.00
-
Estimated
TheDrummer: UnslopNemo 12B
thedrummer/unslopnemo-12b
Openrouter 12.0B 33K ~6.00 ~18.00
-
Estimated
THUDM: GLM 4.1V 9B Thinking
thudm/glm-4.1v-9b-thinking
Openrouter 9.0B 66K ~5.40 ~16.20
-
Estimated
TNG: DeepSeek R1T2 Chimera
tngtech/deepseek-r1t2-chimera
Openrouter 671.0B 164K ~335.50 ~1,006.50
-
Estimated
TNG: DeepSeek R1T2 Chimera (free)
tngtech/deepseek-r1t2-chimera:free
Openrouter 671.0B 164K ~335.50 ~1,006.50
-
Estimated
ReMM SLERP 13B
undi95/remm-slerp-l2-13b
Openrouter 2.0B 6K ~2.00 ~6.00
-
Estimated
Upstage: Solar Pro 3
upstage/solar-pro-3
Openrouter 102.0B 128K ~30.60 ~91.80
-
Estimated
voxtral-mini-transcribe-2507
voxtral-mini-2507
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-mini-2602 Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-mini-2602
voxtral-mini-latest
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-mini-transcribe-realtime-2602
voxtral-mini-realtime-2602
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-mini-transcribe-realtime-2602
voxtral-mini-realtime-latest
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-mini-transcribe-2507 Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-mini-transcribe-realtime-2602 Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
Voxtral Small
voxtral-small
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-small-2507 Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
voxtral-small-2507
voxtral-small-latest
Mistral 3.7B 32K ~2.22 ~6.66
V T I A J
Inferred
Whisper Large V3
whisper-large-v3
Groq 3.6B 448 ~2.16 ~6.48
V T I A J
Inferred
Whisper Large V3 Turbo
whisper-large-v3-turbo
Groq 1.8B 448 ~2.00 ~6.00
V T I A J
Inferred
xAI: Grok 3 Beta
x-ai/grok-3-beta
Openrouter 400.0B 131K ~120.00 ~360.00
-
Estimated
xAI: Grok 3 Mini Beta
x-ai/grok-3-mini-beta
Openrouter 100.0B 131K ~30.00 ~90.00
-
Estimated
xAI: Grok 4
x-ai/grok-4
Redpill 240.0B 256K ~86.40 ~259.20
V T I A J
Inferred
xAI: Grok 4.1 Fast
x-ai/grok-4.1-fast
Redpill 400.0B 2.0M ~120.00 ~360.00
V T I A J
Estimated
xAI: Grok 4.20 Beta
x-ai/grok-4.20-beta
Openrouter 4.2B 2.0M ~2.52 ~7.56
-
Estimated
xAI: Grok Code Fast 1
x-ai/grok-code-fast-1
Redpill 200.0B 256K ~60.00 ~180.00
V T I A J
Estimated
Xiaomi: MiMo-V2-Flash
xiaomi/mimo-v2-flash
Openrouter 309.0B 262K ~154.50 ~463.50
-
Estimated
Xiaomi: MiMo-V2-Flash (free)
xiaomi/mimo-v2-flash:free
Openrouter 309.0B 262K ~154.50 ~463.50
-
Estimated
Z.AI: GLM 4 32B
z-ai/glm-4-32b
Openrouter 32.0B 128K ~16.00 ~48.00
-
Estimated
Z.AI: GLM 4.5V
z-ai/glm-4.5v
Openrouter 106.0B 66K ~38.16 ~114.48
-
Estimated
Z.AI: GLM 4.7
z-ai/glm-4.7
Redpill 57.0B 131K ~34.20 ~102.60
V T I A J
Inferred
Z.AI: GLM 4.7 Flash
z-ai/glm-4.7-flash
Redpill 30.0B 203K ~15.00 ~45.00
V T I A J
Estimated
Z.AI: GLM 5
z-ai/glm-5
Redpill 0.0B 203K ~2.00 ~6.00
V T I A J
Estimated

Frequently Asked Questions

How much energy does AI use?

  • Small models: 0.1-0.3 Wh per request
  • Large models: 1-5 Wh per request
  • GPT-4o-mini: optimized for efficiency
  • GPT-4/Claude Opus: highest consumption
  • FlowDot shows estimates per model

Why does FlowDot track AI energy usage?

  • We believe in transparency
  • Most platforms hide energy costs
  • Helps users make informed choices
  • Encourages efficient model selection
  • Shows true workflow cost

Can I reduce my AI carbon footprint?

  • Choose smaller models for simple tasks
  • Use GPT-4o-mini instead of GPT-4 when possible
  • Run local AI with Ollama on renewable power
  • Batch requests to reduce overhead
  • Track consumption with FlowDot estimates
  • Write concise prompts for fewer tokens
  • Cache repeated queries
  • Choose efficient providers

What is the most energy-efficient AI model?

  • GPT-4o-mini: OpenAI's efficient option
  • Claude 3 Haiku: Anthropic's fast model
  • Gemini 1.5 Flash: Google's efficient choice
  • Llama 3.1 8B: excellent local option
  • Mistral 7B: lightweight and capable
  • Phi-3 Mini: Microsoft's small model

How does AI energy compare to other activities?

  • GPT-4 query: 1-5 Wh (60W bulb for 1-5 min)
  • GPT-3 training: ~1,287 MWh total
  • Equivalent to ~120 US homes per year
  • Efficiency improves 10-100x per generation
  • Modern data centers: PUE of 1.1-1.2
  • Cooling adds 10-20% energy overhead

Quick Reference Guide

Low Energy Models (Best for Simple Tasks)

  • GPT-4o-mini: fast and efficient
  • Claude 3 Haiku: optimized for speed
  • Gemini 1.5 Flash: designed for efficiency
  • Llama 3.1 8B: great for local use
  • Mistral 7B: excellent efficiency ratio
  • Phi-3 Mini: Microsoft's efficient option

When to Use Larger Models

  • Complex reasoning tasks
  • Multi-step analysis
  • Creative writing requiring nuance
  • Code generation for large projects
  • Tasks requiring broad knowledge
  • High-stakes decisions needing accuracy

Call for Transparency

We encourage all AI providers to publicly disclose energy consumption data. Transparency is essential for building a sustainable AI future.

Share Energy Data

Are you an AI provider with official energy data? We'd love to hear from you.

Published by FlowDot Research Team
Questions? Visit our FAQs page