Deep Infra
AudioImageReasoningTextVideo
Deep Infra — AI provider offering 25 models.
up
269ms
Models & Capabilities
Available Models
| Model Name | Context Window | Input Cost (1M) | Output Cost (1M) | Capabilities | |
|---|---|---|---|---|---|
deepinfra/Gryphe/MythoMax-L2-13b | 4,096 | $0.08 | $0.09 | toolstextfunction calling | |
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B | 131,072 | $1.00 | $1.00 | toolstextfunction calling | |
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B | 131,072 | $0.30 | $0.30 | text | |
deepinfra/Qwen/QwQ-32B | 131,072 | $0.15 | $0.40 | toolstextfunction calling | |
deepinfra/Qwen/Qwen2.5-72B-Instruct | 32,768 | $0.12 | $0.39 | toolstextfunction calling | |
deepinfra/Qwen/Qwen2.5-7B-Instruct | 32,768 | $0.04 | $0.10 | text | |
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct | 128,000 | $0.20 | $0.60 | toolstextimagefunction calling | |
deepinfra/Qwen/Qwen3-14B | 40,960 | $0.06 | $0.24 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-235B-A22B | 40,960 | $0.18 | $0.54 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507 | 262,144 | $0.09 | $0.60 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507 | 262,144 | $0.30 | $2.90 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-30B-A3B | 40,960 | $0.08 | $0.29 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-32B | 40,960 | $0.10 | $0.28 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct | 262,144 | $0.40 | $1.60 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | 262,144 | $0.29 | $1.20 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct | 262,144 | $0.14 | $1.40 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking | 262,144 | $0.14 | $1.40 | toolstextfunction calling | |
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo | 8,192 | $0.04 | $0.05 | text | |
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2 | 131,072 | $0.65 | $0.75 | text | |
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3 | 131,072 | $0.65 | $0.75 | text | |
deepinfra/allenai/olmOCR-7B-0725-FP8 | 16,384 | $0.27 | $1.50 | text | |
deepinfra/anthropic/claude-3-7-sonnet-latest | 200,000 | $3.30 | $16.50 | toolstextfunction calling | |
deepinfra/anthropic/claude-4-opus | 200,000 | $16.50 | $82.50 | toolstextfunction calling | |
deepinfra/anthropic/claude-4-sonnet | 200,000 | $3.30 | $16.50 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1 | 163,840 | $0.70 | $2.40 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-0528 | 163,840 | $0.50 | $2.15 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo | 32,768 | $1.00 | $3.00 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 131,072 | $0.20 | $0.60 | text | |
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | 131,072 | $0.27 | $0.27 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-Turbo | 40,960 | $1.00 | $3.00 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3 | 163,840 | $0.38 | $0.89 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3-0324 | 163,840 | $0.25 | $0.88 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3.1 | 163,840 | $0.27 | $1.00 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus | 163,840 | $0.27 | $1.00 | toolstextfunction calling | |
deepinfra/google/gemini-2.0-flash-001 | 1,000,000 | $0.10 | $0.40 | toolstextfunction calling | |
deepinfra/google/gemini-2.5-flash | 1,000,000 | $0.30 | $2.50 | toolstextfunction calling | |
deepinfra/google/gemini-2.5-pro | 1,000,000 | $1.25 | $10.00 | toolstextfunction calling | |
deepinfra/google/gemma-3-12b-it | 131,072 | $0.05 | $0.10 | toolstextfunction calling | |
deepinfra/google/gemma-3-27b-it | 131,072 | $0.09 | $0.16 | toolstextfunction calling | |
deepinfra/google/gemma-3-4b-it | 131,072 | $0.04 | $0.08 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct | 131,072 | $0.049 | $0.049 | text | |
deepinfra/meta-llama/Llama-3.2-3B-Instruct | 131,072 | $0.02 | $0.02 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-3.3-70B-Instruct | 131,072 | $0.23 | $0.40 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo | 131,072 | $0.13 | $0.39 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 1,048,576 | $0.15 | $0.60 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct | 327,680 | $0.08 | $0.30 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-Guard-3-8B | 131,072 | $0.055 | $0.055 | text | |
deepinfra/meta-llama/Llama-Guard-4-12B | 163,840 | $0.18 | $0.18 | text | |
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | 8,192 | $0.03 | $0.06 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct | 131,072 | $0.40 | $0.40 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | 131,072 | $0.10 | $0.28 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct | 131,072 | $0.03 | $0.05 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | 131,072 | $0.02 | $0.03 | toolstextfunction calling | |
deepinfra/microsoft/WizardLM-2-8x22B | 65,536 | $0.48 | $0.48 | text | |
deepinfra/microsoft/phi-4 | 16,384 | $0.07 | $0.14 | toolstextfunction calling | |
deepinfra/mistralai/Mistral-Nemo-Instruct-2407 | 131,072 | $0.02 | $0.04 | toolstextfunction calling | |
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501 | 32,768 | $0.05 | $0.08 | toolstextfunction calling | |
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506 | 128,000 | $0.075 | $0.20 | toolstextfunction calling | |
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | 32,768 | $0.40 | $0.40 | toolstextfunction calling | |
deepinfra/moonshotai/Kimi-K2-Instruct | 131,072 | $0.50 | $2.00 | toolstextfunction calling | |
deepinfra/moonshotai/Kimi-K2-Instruct-0905 | 262,144 | $0.50 | $2.00 | toolstextfunction calling | |
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct | 131,072 | $0.60 | $0.60 | toolstextfunction calling | |
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 | 131,072 | $0.10 | $0.40 | toolstextfunction calling | |
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2 | 131,072 | $0.04 | $0.16 | toolstextfunction calling | |
deepinfra/openai/gpt-oss-120b | 131,072 | $0.05 | $0.45 | toolstextfunction calling | |
deepinfra/openai/gpt-oss-20b | 131,072 | $0.04 | $0.15 | toolstextfunction calling | |
deepinfra/zai-org/GLM-4.5 | 131,072 | $0.40 | $1.60 | toolstextfunction calling | |
MiMo-V2.5mimo | 262,144 | $0.40 | $2.00 | reasoningtoolstextimageaudiovideofunction calling | |
MiMo-V2.5-Promimo | 1,048,576 | $1.00 | $3.00 | reasoningtoolstextfunction calling | |
MiniMax M2.5minimax | 204,800 | $0.27 | $0.95 | reasoningtoolstextfunction calling | |
GLM-4.7glm | 202,752 | $0.43 | $1.75 | reasoningtoolstextfunction calling | |
GLM-5glm | 202,752 | $0.80 | $2.56 | reasoningtoolstextfunction calling | |
GLM-5.1glm | 202,752 | $1.40 | $4.40 | reasoningtoolstextfunction calling | |
GLM-4.7-Flashglm-flash | 202,752 | $0.06 | $0.40 | reasoningtoolstextfunction calling | |
GLM-4.6glm | 204,800 | $0.43 | $1.74 | reasoningtoolstextfunction calling | |
Kimi K2.6kimi | 262,144 | $0.75 | $3.50 | reasoningtoolstextimagevideofunction calling | |
Kimi K2.5kimi | 262,144 | $0.50 | $2.80 | reasoningtoolstextimagevideofunction calling | |
Llama 3.3 70B Turbollama | 131,072 | $0.10 | $0.32 | toolstextfunction calling | |
Llama 4 Maverick 17B FP8llama | 1,000,000 | $0.15 | $0.60 | toolstextimagefunction calling | |
Llama 4 Scout 17Bllama | 10,000,000 | $0.08 | $0.30 | toolstextimagefunction calling | |
DeepSeek V4 Flashdeepseek-flash | 1,048,576 | $0.10 | $0.20 | reasoningtoolstextfunction calling | |
DeepSeek V4 Prodeepseek-thinking | 1,048,576 | $1.30 | $2.60 | reasoningtoolstextfunction calling | |
DeepSeek-V3.2 | 163,840 | $0.26 | $0.38 | reasoningtoolstextfunction calling | |
DeepSeek-R1-0528 | 163,840 | $0.50 | $2.15 | reasoningtext | |
Qwen3 Coder 480B A35B Instruct Turboqwen | 262,144 | $0.30 | $1.20 | toolstextfunction calling | |
Qwen3.6 35B A3Bqwen | 262,144 | $0.20 | $1.00 | reasoningtoolstextimagevideofunction calling | |
Qwen 3.5 35B A3Bqwen | 262,144 | $0.20 | $0.95 | reasoningtoolstextimagevideofunction calling | |
Qwen 3.5 397B A17Bqwen | 262,144 | $0.54 | $3.40 | reasoningtoolstextimagevideofunction calling | |
Gemma 4 26B A4B ITgemma | 262,144 | $0.07 | $0.34 | reasoningtoolstextimagefunction calling | |
Gemma 4 31B ITgemma | 262,144 | $0.13 | $0.38 | reasoningtoolstextimagefunction calling | |
GPT OSS 120Bgpt-oss | 131,072 | $0.05 | $0.24 | reasoningtoolstextfunction calling | |
GPT OSS 20Bgpt-oss | 131,072 | $0.03 | $0.14 | reasoningtoolstextfunction calling |
Subscription Plans
Monthly plans, quota windows, weighted premium credits, and API access entitlement for this provider.
No subscription plans available yet for this provider.
Uptime History
System Status
100% Uptime
269ms Avg. Latency
30 minutes agoNow
Cost Estimator
Estimated Monthly Cost$0.125
1,000,000@ $0.08 / 1M
010M
500,000@ $0.09 / 1M
05M