LLMScape

Compare AI Models by Price, Context & Capabilities

Deep Infra logo

Deep Infra

AudioImageReasoningTextVideo

Deep Infra — AI provider offering 25 models.

up
269ms

Models & Capabilities

Available Models
Model NameContext WindowInput Cost (1M)Output Cost (1M)Capabilities
deepinfra/Gryphe/MythoMax-L2-13b
4,096$0.08$0.09
toolstextfunction calling
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
131,072$1.00$1.00
toolstextfunction calling
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
131,072$0.30$0.30
text
deepinfra/Qwen/QwQ-32B
131,072$0.15$0.40
toolstextfunction calling
deepinfra/Qwen/Qwen2.5-72B-Instruct
32,768$0.12$0.39
toolstextfunction calling
deepinfra/Qwen/Qwen2.5-7B-Instruct
32,768$0.04$0.10
text
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
128,000$0.20$0.60
toolstextimagefunction calling
deepinfra/Qwen/Qwen3-14B
40,960$0.06$0.24
toolstextfunction calling
deepinfra/Qwen/Qwen3-235B-A22B
40,960$0.18$0.54
toolstextfunction calling
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
262,144$0.09$0.60
toolstextfunction calling
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
262,144$0.30$2.90
toolstextfunction calling
deepinfra/Qwen/Qwen3-30B-A3B
40,960$0.08$0.29
toolstextfunction calling
deepinfra/Qwen/Qwen3-32B
40,960$0.10$0.28
toolstextfunction calling
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
262,144$0.40$1.60
toolstextfunction calling
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
262,144$0.29$1.20
toolstextfunction calling
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct
262,144$0.14$1.40
toolstextfunction calling
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking
262,144$0.14$1.40
toolstextfunction calling
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
8,192$0.04$0.05
text
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
131,072$0.65$0.75
text
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
131,072$0.65$0.75
text
deepinfra/allenai/olmOCR-7B-0725-FP8
16,384$0.27$1.50
text
deepinfra/anthropic/claude-3-7-sonnet-latest
200,000$3.30$16.50
toolstextfunction calling
deepinfra/anthropic/claude-4-opus
200,000$16.50$82.50
toolstextfunction calling
deepinfra/anthropic/claude-4-sonnet
200,000$3.30$16.50
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1
163,840$0.70$2.40
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-0528
163,840$0.50$2.15
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
32,768$1.00$3.00
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
131,072$0.20$0.60
text
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
131,072$0.27$0.27
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
40,960$1.00$3.00
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3
163,840$0.38$0.89
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3-0324
163,840$0.25$0.88
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3.1
163,840$0.27$1.00
toolstextfunction calling
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus
163,840$0.27$1.00
toolstextfunction calling
deepinfra/google/gemini-2.0-flash-001
1,000,000$0.10$0.40
toolstextfunction calling
deepinfra/google/gemini-2.5-flash
1,000,000$0.30$2.50
toolstextfunction calling
deepinfra/google/gemini-2.5-pro
1,000,000$1.25$10.00
toolstextfunction calling
deepinfra/google/gemma-3-12b-it
131,072$0.05$0.10
toolstextfunction calling
deepinfra/google/gemma-3-27b-it
131,072$0.09$0.16
toolstextfunction calling
deepinfra/google/gemma-3-4b-it
131,072$0.04$0.08
toolstextfunction calling
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
131,072$0.049$0.049
text
deepinfra/meta-llama/Llama-3.2-3B-Instruct
131,072$0.02$0.02
toolstextfunction calling
deepinfra/meta-llama/Llama-3.3-70B-Instruct
131,072$0.23$0.40
toolstextfunction calling
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
131,072$0.13$0.39
toolstextfunction calling
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
1,048,576$0.15$0.60
toolstextfunction calling
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
327,680$0.08$0.30
toolstextfunction calling
deepinfra/meta-llama/Llama-Guard-3-8B
131,072$0.055$0.055
text
deepinfra/meta-llama/Llama-Guard-4-12B
163,840$0.18$0.18
text
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
8,192$0.03$0.06
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
131,072$0.40$0.40
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
131,072$0.10$0.28
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
131,072$0.03$0.05
toolstextfunction calling
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
131,072$0.02$0.03
toolstextfunction calling
deepinfra/microsoft/WizardLM-2-8x22B
65,536$0.48$0.48
text
deepinfra/microsoft/phi-4
16,384$0.07$0.14
toolstextfunction calling
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
131,072$0.02$0.04
toolstextfunction calling
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
32,768$0.05$0.08
toolstextfunction calling
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
128,000$0.075$0.20
toolstextfunction calling
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
32,768$0.40$0.40
toolstextfunction calling
deepinfra/moonshotai/Kimi-K2-Instruct
131,072$0.50$2.00
toolstextfunction calling
deepinfra/moonshotai/Kimi-K2-Instruct-0905
262,144$0.50$2.00
toolstextfunction calling
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
131,072$0.60$0.60
toolstextfunction calling
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
131,072$0.10$0.40
toolstextfunction calling
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
131,072$0.04$0.16
toolstextfunction calling
deepinfra/openai/gpt-oss-120b
131,072$0.05$0.45
toolstextfunction calling
deepinfra/openai/gpt-oss-20b
131,072$0.04$0.15
toolstextfunction calling
deepinfra/zai-org/GLM-4.5
131,072$0.40$1.60
toolstextfunction calling
MiMo-V2.5mimo
262,144$0.40$2.00
reasoningtoolstextimageaudiovideofunction calling
MiMo-V2.5-Promimo
1,048,576$1.00$3.00
reasoningtoolstextfunction calling
MiniMax M2.5minimax
204,800$0.27$0.95
reasoningtoolstextfunction calling
GLM-4.7glm
202,752$0.43$1.75
reasoningtoolstextfunction calling
GLM-5glm
202,752$0.80$2.56
reasoningtoolstextfunction calling
GLM-5.1glm
202,752$1.40$4.40
reasoningtoolstextfunction calling
GLM-4.7-Flashglm-flash
202,752$0.06$0.40
reasoningtoolstextfunction calling
GLM-4.6glm
204,800$0.43$1.74
reasoningtoolstextfunction calling
Kimi K2.6kimi
262,144$0.75$3.50
reasoningtoolstextimagevideofunction calling
Kimi K2.5kimi
262,144$0.50$2.80
reasoningtoolstextimagevideofunction calling
Llama 3.3 70B Turbollama
131,072$0.10$0.32
toolstextfunction calling
Llama 4 Maverick 17B FP8llama
1,000,000$0.15$0.60
toolstextimagefunction calling
Llama 4 Scout 17Bllama
10,000,000$0.08$0.30
toolstextimagefunction calling
DeepSeek V4 Flashdeepseek-flash
1,048,576$0.10$0.20
reasoningtoolstextfunction calling
DeepSeek V4 Prodeepseek-thinking
1,048,576$1.30$2.60
reasoningtoolstextfunction calling
DeepSeek-V3.2
163,840$0.26$0.38
reasoningtoolstextfunction calling
DeepSeek-R1-0528
163,840$0.50$2.15
reasoningtext
Qwen3 Coder 480B A35B Instruct Turboqwen
262,144$0.30$1.20
toolstextfunction calling
Qwen3.6 35B A3Bqwen
262,144$0.20$1.00
reasoningtoolstextimagevideofunction calling
Qwen 3.5 35B A3Bqwen
262,144$0.20$0.95
reasoningtoolstextimagevideofunction calling
Qwen 3.5 397B A17Bqwen
262,144$0.54$3.40
reasoningtoolstextimagevideofunction calling
Gemma 4 26B A4B ITgemma
262,144$0.07$0.34
reasoningtoolstextimagefunction calling
Gemma 4 31B ITgemma
262,144$0.13$0.38
reasoningtoolstextimagefunction calling
GPT OSS 120Bgpt-oss
131,072$0.05$0.24
reasoningtoolstextfunction calling
GPT OSS 20Bgpt-oss
131,072$0.03$0.14
reasoningtoolstextfunction calling

Subscription Plans

Monthly plans, quota windows, weighted premium credits, and API access entitlement for this provider.

No subscription plans available yet for this provider.

Uptime History

System Status
100% Uptime
269ms Avg. Latency
30 minutes agoNow
Cost Estimator
Estimated Monthly Cost$0.125
1,000,000@ $0.08 / 1M
010M
500,000@ $0.09 / 1M
05M
Deep Infra Models & Pricing - LLMScape | LLMScape