LLMScape

Compare AI Models by Price, Context & Capabilities

Nebius Token Factory logo

Nebius Token Factory

ImageReasoningText

Nebius Token Factory — AI provider offering 46 models.

up
364ms

Models & Capabilities

Available Models
Model NameContext WindowInput Cost (1M)Output Cost (1M)Capabilities
GLM-4.7 (FP8)
128,000$0.40$2.00
toolstextfunction calling
GLM-4.5-Air
128,000$0.20$1.20
toolstextfunction calling
GLM-4.5
128,000$0.60$2.20
toolstextfunction calling
Llama-3.1-Nemotron-Ultra-253B-v1
128,000$0.60$1.80
toolstextfunction calling
Nemotron-Nano-V2-12b
32,000$0.07$0.20
toolstextfunction calling
Nemotron-3-Nano-30B-A3B
32,000$0.06$0.24
toolstextfunction calling
Hermes-4-405B
128,000$1.00$3.00
reasoningtoolstextfunction calling
Hermes-4-70B
128,000$0.13$0.40
reasoningtoolstextfunction calling
BGE-ICLtext-embedding
32,768$0.01$0.00
text
bge-multilingual-gemma2text-embedding
8,192$0.01$0.00
text
INTELLECT-3
128,000$0.20$1.10
toolstextfunction calling
MiniMax-M2.1
128,000$0.30$1.20
reasoningtoolstextfunction calling
DeepSeek-V3-0324 (Fast)
128,000$0.75$2.25
toolstextfunction calling
DeepSeek R1 0528 Fastdeepseek
131,072$2.00$6.00
reasoningtoolstextfunction calling
DeepSeek-R1-0528
128,000$0.80$2.40
reasoningtoolstextfunction calling
DeepSeek-V3-0324
128,000$0.50$1.50
toolstextfunction calling
DeepSeek-V3.2
128,000$0.30$0.45
reasoningtoolstextfunction calling
e5-mistral-7b-instructtext-embedding
32,768$0.01$0.00
text
Kimi-K2-Instruct
200,000$0.50$2.40
toolstextimagefunction calling
Kimi-K2.5kimi
262,144$0.50$2.50
reasoningtoolstextimagefunction calling
Kimi-K2-Thinking
128,000$0.60$2.50
reasoningtoolstextfunction calling
Gemma-2-2b-it
8,192$0.02$0.06
text
Gemma-3-27b-it (Fast)
128,000$0.20$0.60
toolstextimagefunction calling
Gemma-2-9b-it (Fast)
8,192$0.03$0.09
text
Gemma-3-27b-it
128,000$0.10$0.30
toolstextimagefunction calling
Qwen3 235B A22B Instruct 2507qwen
262,144$0.20$0.60
reasoningtoolstextfunction calling
Qwen3-Next-80B-A3B-Thinking
128,000$0.15$1.20
reasoningtoolstextfunction calling
Qwen2.5-Coder-7B (Fast)
128,000$0.03$0.09
toolstextfunction calling
Qwen3 Coder 480B A35B Instructqwen
262,144$0.40$1.80
toolstextfunction calling
Qwen3-Embedding-8Btext-embedding
32,768$0.01$0.00
text
Qwen3-32B
128,000$0.10$0.30
toolstextfunction calling
Qwen3-30B-A3B-Instruct-2507
128,000$0.10$0.30
toolstextfunction calling
Qwen2.5-VL-72B-Instruct
128,000$0.25$0.75
toolstextimagefunction calling
Qwen3-Coder-30B-A3B-Instruct
128,000$0.10$0.30
toolstextfunction calling
Qwen3-30B-A3B-Thinking-2507
128,000$0.10$0.30
reasoningtoolstextfunction calling
Qwen3-32B (Fast)
128,000$0.20$0.60
toolstextfunction calling
Qwen3 235B A22B Thinking 2507qwen
262,144$0.20$0.80
reasoningtoolstextfunction calling
Llama-Guard-3-8B
8,192$0.02$0.06
text
Meta-Llama-3.1-8B-Instruct
128,000$0.02$0.06
toolstextfunction calling
Llama-3.3-70B-Instruct
128,000$0.13$0.40
toolstextfunction calling
Llama-3.3-70B-Instruct (Fast)
128,000$0.25$0.75
toolstextfunction calling
Meta-Llama-3.1-8B-Instruct (Fast)
128,000$0.03$0.09
toolstextfunction calling
gpt-oss-120b
128,000$0.15$0.60
reasoningtoolstextfunction calling
gpt-oss-20b
128,000$0.05$0.20
toolstextfunction calling
FLUX.1-dev
77$0.00$0.00
text
FLUX.1-schnell
77$0.00$0.00
text

Uptime History

System Status
100% Uptime
434ms Avg. Latency
30 minutes agoNow
Cost Estimator
Estimated Monthly Cost$1.40
1,000,000@ $0.40 / 1M
010M
500,000@ $2.00 / 1M
05M