LLMScape

Compare AI Models by Price, Context & Capabilities

Deep Infra logo

Deep Infra

ImageReasoningTextVideo

Deep Infra — AI provider offering 29 models.

up
57ms

Models & Capabilities

Available Models
Model NameContext WindowInput Cost (1M)Output Cost (1M)Capabilities
deepinfra/Gryphe/MythoMax-L2-13b
4,096$0.08$0.09
toolstextfunction calling
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
131,072$1.00$1.00
toolstextfunction calling
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
131,072$0.30$0.30
text
deepinfra/Qwen/QwQ-32B
131,072$0.15$0.40
toolstextfunction calling
deepinfra/Qwen/Qwen2.5-72B-Instruct
32,768$0.12$0.39
toolstextfunction calling
deepinfra/Qwen/Qwen2.5-7B-Instruct
32,768$0.04$0.10
text
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
128,000$0.20$0.60
toolstextimagefunction calling
deepinfra/Qwen/Qwen3-14B
40,960$0.06$0.24
toolstextfunction calling
deepinfra/Qwen/Qwen3-235B-A22B
40,960$0.18$0.54
toolstextfunction calling
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
262,144$0.09$0.60
toolstextfunction calling
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
262,144$0.30$2.90
toolstextfunction calling
deepinfra/Qwen/Qwen3-30B-A3B
40,960$0.08$0.29
toolstextfunction calling
deepinfra/Qwen/Qwen3-32B
40,960$0.10$0.28
toolstextfunction calling
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
262,144$0.40$1.60
toolstextfunction calling
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
262,144$0.29$1.20
toolstextfunction calling
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct
262,144$0.14$1.40
toolstextfunction calling
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking
262,144$0.14$1.40
toolstextfunction calling
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
8,192$0.04$0.05
text
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
131,072$0.65$0.75
text
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
131,072$0.65$0.75
text
deepinfra/allenai/olmOCR-7B-0725-FP8
16,384$0.27$1.50
text
deepinfra/anthropic/claude-3-7-sonnet-latest
200,000$3.30$16.50
toolstextfunction calling
deepinfra/anthropic/claude-4-opus
200,000$16.50$82.50
toolstextfunction calling
deepinfra/anthropic/claude-4-sonnet
200,000$3.30$16.50
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1
163,840$0.70$2.40
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-0528
163,840$0.50$2.15
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
32,768$1.00$3.00
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
131,072$0.20$0.60
text
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
131,072$0.27$0.27
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
40,960$1.00$3.00
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3
163,840$0.38$0.89
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3-0324
163,840$0.25$0.88
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3.1
163,840$0.27$1.00
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus
163,840$0.27$1.00
toolstextfunction calling
deepinfra/google/gemini-2.0-flash-001
1,000,000$0.10$0.40
toolstextfunction calling
deepinfra/google/gemini-2.5-flash
1,000,000$0.30$2.50
toolstextfunction calling
deepinfra/google/gemini-2.5-pro
1,000,000$1.25$10.00
toolstextfunction calling
deepinfra/google/gemma-3-12b-it
131,072$0.05$0.10
toolstextfunction calling
deepinfra/google/gemma-3-27b-it
131,072$0.09$0.16
toolstextfunction calling
deepinfra/google/gemma-3-4b-it
131,072$0.04$0.08
toolstextfunction calling
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
131,072$0.049$0.049
text
deepinfra/meta-llama/Llama-3.2-3B-Instruct
131,072$0.02$0.02
toolstextfunction calling
deepinfra/meta-llama/Llama-3.3-70B-Instruct
131,072$0.23$0.40
toolstextfunction calling
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
131,072$0.13$0.39
toolstextfunction calling
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
1,048,576$0.15$0.60
toolstextfunction calling
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
327,680$0.08$0.30
toolstextfunction calling
deepinfra/meta-llama/Llama-Guard-3-8B
131,072$0.055$0.055
text
deepinfra/meta-llama/Llama-Guard-4-12B
163,840$0.18$0.18
text
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
8,192$0.03$0.06
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
131,072$0.40$0.40
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
131,072$0.10$0.28
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
131,072$0.03$0.05
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
131,072$0.02$0.03
toolstextfunction calling
deepinfra/microsoft/WizardLM-2-8x22B
65,536$0.48$0.48
text
deepinfra/microsoft/phi-4
16,384$0.07$0.14
toolstextfunction calling
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
131,072$0.02$0.04
toolstextfunction calling
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
32,768$0.05$0.08
toolstextfunction calling
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
128,000$0.075$0.20
toolstextfunction calling
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
32,768$0.40$0.40
toolstextfunction calling
deepinfra/moonshotai/Kimi-K2-Instruct
131,072$0.50$2.00
toolstextfunction calling
deepinfra/moonshotai/Kimi-K2-Instruct-0905
262,144$0.50$2.00
toolstextfunction calling
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
131,072$0.60$0.60
toolstextfunction calling
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
131,072$0.10$0.40
toolstextfunction calling
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
131,072$0.04$0.16
toolstextfunction calling
deepinfra/openai/gpt-oss-120b
131,072$0.05$0.45
toolstextfunction calling
deepinfra/openai/gpt-oss-20b
131,072$0.04$0.15
toolstextfunction calling
deepinfra/zai-org/GLM-4.5
131,072$0.40$1.60
toolstextfunction calling
Qwen3 Coder 480B A35B Instruct Turboqwen
262,144$0.30$1.20
toolstextfunction calling
Qwen3 Coder 480B A35B Instructqwen
262,144$0.40$1.60
toolstextfunction calling
GLM-4.7-Flashglm-flash
202,752$0.06$0.40
reasoningtoolstextfunction calling
GLM-4.5glm
131,072$0.60$2.20
toolstextfunction calling
GLM-4.7glm
202,752$0.43$1.75
reasoningtoolstextfunction calling
GLM-5.1glm
202,752$1.40$4.40
reasoningtoolstextfunction calling
GLM-5glm
202,752$0.80$2.56
reasoningtoolstextfunction calling
GLM-4.6Vglm
204,800$0.30$0.90
reasoningtoolstextimagefunction calling
GLM-4.6glm
204,800$0.43$1.74
reasoningtoolstextfunction calling
Llama 4 Scout 17Bllama
10,000,000$0.08$0.30
toolstextimagefunction calling
Llama 3.1 8Bllama
131,072$0.02$0.05
toolstextfunction calling
Llama 3.1 70Bllama
131,072$0.40$0.40
toolstextfunction calling
Llama 3.1 8B Turbollama
131,072$0.02$0.03
toolstextfunction calling
Llama 3.3 70B Turbollama
131,072$0.10$0.32
toolstextfunction calling
Llama 4 Maverick 17B FP8llama
1,000,000$0.15$0.60
toolstextimagefunction calling
Llama 3.1 70B Turbollama
131,072$0.40$0.40
toolstextfunction calling
DeepSeek-R1-0528
163,840$0.50$2.15
reasoningtext
DeepSeek-V3.2
163,840$0.26$0.38
reasoningtoolstextfunction calling
GPT OSS 20Bgpt-oss
131,072$0.03$0.14
reasoningtoolstextfunction calling
GPT OSS 120Bgpt-oss
131,072$0.05$0.24
reasoningtoolstextfunction calling
Kimi K2 Thinkingkimi-thinking
131,072$0.47$2.00
reasoningtoolstextfunction calling
Kimi K2kimi
131,072$0.50$2.00
toolstextfunction calling
Kimi K2 0905kimi
262,144$0.40$2.00
toolstextfunction calling
Kimi K2.5kimi
262,144$0.50$2.80
reasoningtoolstextimagevideofunction calling
MiniMax M2minimax
262,144$0.254$1.02
reasoningtoolstextfunction calling
MiniMax M2.5minimax
204,800$0.27$0.95
reasoningtoolstextfunction calling
MiniMax M2.1
196,608$0.28$1.20
reasoningtoolstextfunction calling
Claude Sonnet 3.7 (Latest)claude-sonnet
200,000$3.30$16.50
reasoningtoolstextimagefunction calling
Claude Opus 4claude-opus
200,000$16.50$82.50
reasoningtoolstextimagefunction calling

Subscription Plans

Monthly plans, quota windows, weighted premium credits, and API access entitlement for this provider.

No subscription plans available yet for this provider.

Uptime History

System Status
30 minutes agoNow
Cost Estimator
Estimated Monthly Cost$0.125
1,000,000@ $0.08 / 1M
010M
500,000@ $0.09 / 1M
05M
Deep Infra Models & Pricing - LLMScape | LLMScape