Models & Capabilities
Available Models
| Model Name | Context Window | Input Cost (1M) | Output Cost (1M) | Capabilities | |
|---|---|---|---|---|---|
deepinfra/Gryphe/MythoMax-L2-13b | 4,096 | $0.08 | $0.09 | toolstextfunction calling | |
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B | 131,072 | $1.00 | $1.00 | toolstextfunction calling | |
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B | 131,072 | $0.30 | $0.30 | text | |
deepinfra/Qwen/QwQ-32B | 131,072 | $0.15 | $0.40 | toolstextfunction calling | |
deepinfra/Qwen/Qwen2.5-72B-Instruct | 32,768 | $0.12 | $0.39 | toolstextfunction calling | |
deepinfra/Qwen/Qwen2.5-7B-Instruct | 32,768 | $0.04 | $0.10 | text | |
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct | 128,000 | $0.20 | $0.60 | toolstextimagefunction calling | |
deepinfra/Qwen/Qwen3-14B | 40,960 | $0.06 | $0.24 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-235B-A22B | 40,960 | $0.18 | $0.54 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507 | 262,144 | $0.09 | $0.60 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507 | 262,144 | $0.30 | $2.90 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-30B-A3B | 40,960 | $0.08 | $0.29 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-32B | 40,960 | $0.10 | $0.28 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct | 262,144 | $0.40 | $1.60 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | 262,144 | $0.29 | $1.20 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct | 262,144 | $0.14 | $1.40 | toolstextfunction calling | |
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking | 262,144 | $0.14 | $1.40 | toolstextfunction calling | |
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo | 8,192 | $0.04 | $0.05 | text | |
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2 | 131,072 | $0.65 | $0.75 | text | |
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3 | 131,072 | $0.65 | $0.75 | text | |
deepinfra/allenai/olmOCR-7B-0725-FP8 | 16,384 | $0.27 | $1.50 | text | |
deepinfra/anthropic/claude-3-7-sonnet-latest | 200,000 | $3.30 | $16.50 | toolstextfunction calling | |
deepinfra/anthropic/claude-4-opus | 200,000 | $16.50 | $82.50 | toolstextfunction calling | |
deepinfra/anthropic/claude-4-sonnet | 200,000 | $3.30 | $16.50 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1 | 163,840 | $0.70 | $2.40 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-0528 | 163,840 | $0.50 | $2.15 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo | 32,768 | $1.00 | $3.00 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 131,072 | $0.20 | $0.60 | text | |
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | 131,072 | $0.27 | $0.27 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-R1-Turbo | 40,960 | $1.00 | $3.00 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3 | 163,840 | $0.38 | $0.89 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3-0324 | 163,840 | $0.25 | $0.88 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3.1 | 163,840 | $0.27 | $1.00 | toolstextfunction calling | |
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus | 163,840 | $0.27 | $1.00 | toolstextfunction calling | |
deepinfra/google/gemini-2.0-flash-001 | 1,000,000 | $0.10 | $0.40 | toolstextfunction calling | |
deepinfra/google/gemini-2.5-flash | 1,000,000 | $0.30 | $2.50 | toolstextfunction calling | |
deepinfra/google/gemini-2.5-pro | 1,000,000 | $1.25 | $10.00 | toolstextfunction calling | |
deepinfra/google/gemma-3-12b-it | 131,072 | $0.05 | $0.10 | toolstextfunction calling | |
deepinfra/google/gemma-3-27b-it | 131,072 | $0.09 | $0.16 | toolstextfunction calling | |
deepinfra/google/gemma-3-4b-it | 131,072 | $0.04 | $0.08 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct | 131,072 | $0.049 | $0.049 | text | |
deepinfra/meta-llama/Llama-3.2-3B-Instruct | 131,072 | $0.02 | $0.02 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-3.3-70B-Instruct | 131,072 | $0.23 | $0.40 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo | 131,072 | $0.13 | $0.39 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 1,048,576 | $0.15 | $0.60 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct | 327,680 | $0.08 | $0.30 | toolstextfunction calling | |
deepinfra/meta-llama/Llama-Guard-3-8B | 131,072 | $0.055 | $0.055 | text | |
deepinfra/meta-llama/Llama-Guard-4-12B | 163,840 | $0.18 | $0.18 | text | |
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | 8,192 | $0.03 | $0.06 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct | 131,072 | $0.40 | $0.40 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | 131,072 | $0.10 | $0.28 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct | 131,072 | $0.03 | $0.05 | toolstextfunction calling | |
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | 131,072 | $0.02 | $0.03 | toolstextfunction calling | |
deepinfra/microsoft/WizardLM-2-8x22B | 65,536 | $0.48 | $0.48 | text | |
deepinfra/microsoft/phi-4 | 16,384 | $0.07 | $0.14 | toolstextfunction calling | |
deepinfra/mistralai/Mistral-Nemo-Instruct-2407 | 131,072 | $0.02 | $0.04 | toolstextfunction calling | |
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501 | 32,768 | $0.05 | $0.08 | toolstextfunction calling | |
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506 | 128,000 | $0.075 | $0.20 | toolstextfunction calling | |
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | 32,768 | $0.40 | $0.40 | toolstextfunction calling | |
deepinfra/moonshotai/Kimi-K2-Instruct | 131,072 | $0.50 | $2.00 | toolstextfunction calling | |
deepinfra/moonshotai/Kimi-K2-Instruct-0905 | 262,144 | $0.50 | $2.00 | toolstextfunction calling | |
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct | 131,072 | $0.60 | $0.60 | toolstextfunction calling | |
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 | 131,072 | $0.10 | $0.40 | toolstextfunction calling | |
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2 | 131,072 | $0.04 | $0.16 | toolstextfunction calling | |
deepinfra/openai/gpt-oss-120b | 131,072 | $0.05 | $0.45 | toolstextfunction calling | |
deepinfra/openai/gpt-oss-20b | 131,072 | $0.04 | $0.15 | toolstextfunction calling | |
deepinfra/zai-org/GLM-4.5 | 131,072 | $0.40 | $1.60 | toolstextfunction calling | |
Qwen3 Coder 480B A35B Instruct Turboqwen | 262,144 | $0.30 | $1.20 | toolstextfunction calling | |
Qwen3 Coder 480B A35B Instructqwen | 262,144 | $0.40 | $1.60 | toolstextfunction calling | |
GLM-4.7-Flashglm-flash | 202,752 | $0.06 | $0.40 | reasoningtoolstextfunction calling | |
GLM-4.5glm | 131,072 | $0.60 | $2.20 | toolstextfunction calling | |
GLM-4.7glm | 202,752 | $0.43 | $1.75 | reasoningtoolstextfunction calling | |
GLM-5.1glm | 202,752 | $1.40 | $4.40 | reasoningtoolstextfunction calling | |
GLM-5glm | 202,752 | $0.80 | $2.56 | reasoningtoolstextfunction calling | |
GLM-4.6Vglm | 204,800 | $0.30 | $0.90 | reasoningtoolstextimagefunction calling | |
GLM-4.6glm | 204,800 | $0.43 | $1.74 | reasoningtoolstextfunction calling | |
Llama 4 Scout 17Bllama | 10,000,000 | $0.08 | $0.30 | toolstextimagefunction calling | |
Llama 3.1 8Bllama | 131,072 | $0.02 | $0.05 | toolstextfunction calling | |
Llama 3.1 70Bllama | 131,072 | $0.40 | $0.40 | toolstextfunction calling | |
Llama 3.1 8B Turbollama | 131,072 | $0.02 | $0.03 | toolstextfunction calling | |
Llama 3.3 70B Turbollama | 131,072 | $0.10 | $0.32 | toolstextfunction calling | |
Llama 4 Maverick 17B FP8llama | 1,000,000 | $0.15 | $0.60 | toolstextimagefunction calling | |
Llama 3.1 70B Turbollama | 131,072 | $0.40 | $0.40 | toolstextfunction calling | |
DeepSeek-R1-0528 | 163,840 | $0.50 | $2.15 | reasoningtext | |
DeepSeek-V3.2 | 163,840 | $0.26 | $0.38 | reasoningtoolstextfunction calling | |
GPT OSS 20Bgpt-oss | 131,072 | $0.03 | $0.14 | reasoningtoolstextfunction calling | |
GPT OSS 120Bgpt-oss | 131,072 | $0.05 | $0.24 | reasoningtoolstextfunction calling | |
Kimi K2 Thinkingkimi-thinking | 131,072 | $0.47 | $2.00 | reasoningtoolstextfunction calling | |
Kimi K2kimi | 131,072 | $0.50 | $2.00 | toolstextfunction calling | |
Kimi K2 0905kimi | 262,144 | $0.40 | $2.00 | toolstextfunction calling | |
Kimi K2.5kimi | 262,144 | $0.50 | $2.80 | reasoningtoolstextimagevideofunction calling | |
MiniMax M2minimax | 262,144 | $0.254 | $1.02 | reasoningtoolstextfunction calling | |
MiniMax M2.5minimax | 204,800 | $0.27 | $0.95 | reasoningtoolstextfunction calling | |
MiniMax M2.1 | 196,608 | $0.28 | $1.20 | reasoningtoolstextfunction calling | |
Claude Sonnet 3.7 (Latest)claude-sonnet | 200,000 | $3.30 | $16.50 | reasoningtoolstextimagefunction calling | |
Claude Opus 4claude-opus | 200,000 | $16.50 | $82.50 | reasoningtoolstextimagefunction calling |
Subscription Plans
Monthly plans, quota windows, weighted premium credits, and API access entitlement for this provider.
No subscription plans available yet for this provider.
Uptime History
System Status
30 minutes agoNow
Cost Estimator
Estimated Monthly Cost$0.125
1,000,000@ $0.08 / 1M
010M
500,000@ $0.09 / 1M
05M