💰 Paid Models (86)
| Model | Input/1M | Output/1M | Context | Capabilities |
|---|---|---|---|---|
| Llama-3.2-3B-Instruct meta-llama-llama-32-3b-instruct | $0.020 | $0.020 | 131K | chat |
| Meta-Llama-3.1-8B-Instruct-Turbo meta-llama-meta-llama-31-8b-instruct-turbo | $0.020 | $0.030 | 131K | chat |
| Mistral-Nemo-Instruct-2407 mistralai-mistral-nemo-instruct-2407 | $0.020 | $0.040 | 131K | chat |
| Meta-Llama-3-8B-Instruct meta-llama-meta-llama-3-8b-instruct | $0.030 | $0.060 | 8K | chat |
| Meta-Llama-3.1-8B-Instruct meta-llama-meta-llama-31-8b-instruct | $0.030 | $0.050 | 131K | chat |
| gpt-oss-20b openai-gpt-oss-20b | $0.030 | $0.140 | 131K | chatreasoning |
| DeepSeek-OCR deepseek-ai-deepseek-ocr | $0.030 | $0.100 | 8K | chat |
| gemini-1.5-flash-8b google-gemini-15-flash-8b | $0.037 | $0.150 | 1.0M | chatreasoning |
| gpt-oss-120b openai-gpt-oss-120b | $0.039 | $0.190 | 131K | chatreasoning |
| L3-8B-Lunaris-v1-Turbo sao10k-l3-8b-lunaris-v1-turbo | $0.040 | $0.050 | 8K | chat |
| gemma-3-4b-it google-gemma-3-4b-it | $0.040 | $0.080 | 131K | chat |
| NVIDIA-Nemotron-Nano-9B-v2 nvidia-nvidia-nemotron-nano-9b-v2 | $0.040 | $0.160 | 131K | chat |
| gemma-3-12b-it google-gemma-3-12b-it | $0.040 | $0.130 | 131K | chat |
| Llama-3.2-11B-Vision-Instruct meta-llama-llama-32-11b-vision-instruct | $0.049 | $0.049 | 131K | chatvision |
| Nemotron-3-Nano-30B-A3B nvidia-nemotron-3-nano-30b-a3b | $0.050 | $0.200 | 262K | chat |
| Mistral-Small-24B-Instruct-2501 mistralai-mistral-small-24b-instruct-2501 | $0.050 | $0.080 | 33K | chat |
| GLM-4.7-Flash zai-org-glm-47-flash | $0.060 | $0.400 | 203K | chat |
| phi-4 microsoft-phi-4 | $0.070 | $0.140 | 16K | chat |
| Qwen3-235B-A22B-Instruct-2507 qwen-qwen3-235b-a22b-instruct-2507 | $0.071 | $0.463 | 262K | chatreasoning |
| gemini-1.5-flash google-gemini-15-flash | $0.075 | $0.300 | 1.0M | chatreasoning |
| Mistral-Small-3.2-24B-Instruct-2506 mistralai-mistral-small-32-24b-instruct-2506 | $0.075 | $0.200 | 128K | chat |
| Qwen3-30B-A3B qwen-qwen3-30b-a3b | $0.080 | $0.290 | 41K | chatreasoning |
| Llama-4-Scout-17B-16E-Instruct meta-llama-llama-4-scout-17b-16e-instruct | $0.080 | $0.300 | 328K | chat |
| MythoMax-L2-13b gryphe-mythomax-l2-13b | $0.080 | $0.080 | 4K | chat |
| Qwen3-32B qwen-qwen3-32b | $0.080 | $0.280 | 41K | chatreasoning |
| Qwen3-14B qwen-qwen3-14b | $0.080 | $0.240 | 41K | chatreasoning |
| Qwen3-Next-80B-A3B-Instruct qwen-qwen3-next-80b-a3b-instruct | $0.090 | $1.10 | 262K | chatreasoning |
| olmOCR-2-7B-1025 allenai-olmocr-2-7b-1025 | $0.090 | $0.190 | 16K | chat |
| gemma-3-27b-it google-gemma-3-27b-it | $0.090 | $0.160 | 131K | chat |
| Llama-3.3-70B-Instruct-Turbo meta-llama-llama-33-70b-instruct-turbo | $0.100 | $0.320 | 131K | chat |
| Llama-3.3-Nemotron-Super-49B-v1.5 nvidia-llama-33-nemotron-super-49b-v15 | $0.100 | $0.400 | 131K | chat |
| Qwen2.5-72B-Instruct qwen-qwen25-72b-instruct | $0.120 | $0.390 | 33K | chat |
| PaddleOCR-VL-0.9B paddlepaddle-paddleocr-vl-09b | $0.140 | $0.800 | 16K | chatvision |
| Llama-4-Maverick-17B-128E-Instruct-FP8 meta-llama-llama-4-maverick-17b-128e-instruct-fp8 | $0.150 | $0.600 | 1.0M | chat |
| gpt-oss-120b-Turbo openai-gpt-oss-120b-turbo | $0.150 | $0.600 | 131K | chatreasoning |
| Qwen3-VL-30B-A3B-Instruct qwen-qwen3-vl-30b-a3b-instruct | $0.150 | $0.600 | 262K | chatvisionreasoning |
| Llama-Guard-4-12B meta-llama-llama-guard-4-12b | $0.180 | $0.180 | 164K | chat |
| Qwen2.5-VL-32B-Instruct qwen-qwen25-vl-32b-instruct | $0.200 | $0.600 | 128K | chatvision |
| NVIDIA-Nemotron-Nano-12B-v2-VL nvidia-nvidia-nemotron-nano-12b-v2-vl | $0.200 | $0.600 | 131K | chatvision |
| DeepSeek-V3-0324 deepseek-ai-deepseek-v3-0324 | $0.200 | $0.880 | 164K | chat |
| Qwen3-VL-235B-A22B-Instruct qwen-qwen3-vl-235b-a22b-instruct | $0.200 | $1.20 | 262K | chatvisionreasoning |
| Olmo-3.1-32B-Instruct allenai-olmo-31-32b-instruct | $0.200 | $0.600 | 66K | chat |
| DeepSeek-V3.1-Terminus deepseek-ai-deepseek-v31-terminus | $0.210 | $0.790 | 164K | chatreasoning |
| DeepSeek-V3.1 deepseek-ai-deepseek-v31 | $0.210 | $0.790 | 164K | chatreasoning |
| Qwen3-235B-A22B-Thinking-2507 qwen-qwen3-235b-a22b-thinking-2507 | $0.230 | $2.39 | 262K | chatreasoning |
| DeepSeek-V3.2 deepseek-ai-deepseek-v32 | $0.260 | $0.380 | 164K | chat |
| Qwen3-Coder-480B-A35B-Instruct-Turbo qwen-qwen3-coder-480b-a35b-instruct-turbo | $0.280 | $1.20 | 262K | chatcode |
| MiniMax-M2.1 minimaxai-minimax-m21 | $0.280 | $1.20 | 197K | chat |
| Hermes-3-Llama-3.1-70B nousresearch-hermes-3-llama-31-70b | $0.300 | $0.300 | 131K | chat |
| gemini-2.5-flash google-gemini-25-flash | $0.300 | $2.50 | 1.0M | chatreasoning |
| GLM-4.6V zai-org-glm-46v | $0.300 | $0.900 | 131K | chat |
| DeepSeek-V3 deepseek-ai-deepseek-v3 | $0.320 | $0.890 | 164K | chat |
| Kimi-K2-Instruct-0905 moonshotai-kimi-k2-instruct-0905 | $0.400 | $2.00 | 131K | chat |
| Meta-Llama-3.1-70B-Instruct meta-llama-meta-llama-31-70b-instruct | $0.400 | $0.400 | 131K | chat |
| GLM-4.7 zai-org-glm-47 | $0.400 | $1.90 | 203K | chat |
| Qwen3-Coder-480B-A35B-Instruct qwen-qwen3-coder-480b-a35b-instruct | $0.400 | $1.60 | 262K | chatcode |
| Meta-Llama-3.1-70B-Instruct-Turbo meta-llama-meta-llama-31-70b-instruct-turbo | $0.400 | $0.400 | 131K | chat |
| GLM-4.6 zai-org-glm-46 | $0.430 | $1.75 | 203K | chat |
| Kimi-K2.5 moonshotai-kimi-k25 | $0.450 | $2.80 | 262K | chat |
| Kimi-K2-Thinking moonshotai-kimi-k2-thinking | $0.470 | $2.00 | 131K | chat |
| WizardLM-2-8x22B microsoft-wizardlm-2-8x22b | $0.480 | $0.480 | 66K | chat |
| DeepSeek-R1-0528 deepseek-ai-deepseek-r1-0528 | $0.500 | $2.15 | 164K | chat |
| Mixtral-8x7B-Instruct-v0.1 mistralai-mixtral-8x7b-instruct-v01 | $0.540 | $0.540 | 33K | chat |
| DeepSeek-R1-Distill-Llama-70B deepseek-ai-deepseek-r1-distill-llama-70b | $0.600 | $1.20 | 131K | chat |
| L3.3-70B-Euryale-v2.3 sao10k-l33-70b-euryale-v23 | $0.850 | $0.850 | 131K | chat |
| L3.1-70B-Euryale-v2.2 sao10k-l31-70b-euryale-v22 | $0.850 | $0.850 | 131K | chat |
| Hermes-3-Llama-3.1-405B nousresearch-hermes-3-llama-31-405b | $1.00 | $1.00 | 131K | chat |
| DeepSeek-R1-0528-Turbo deepseek-ai-deepseek-r1-0528-turbo | $1.00 | $3.00 | 33K | chat |
| Llama-3.1-Nemotron-70B-Instruct nvidia-llama-31-nemotron-70b-instruct | $1.20 | $1.20 | 131K | chat |
| gemini-2.5-pro google-gemini-25-pro | $1.25 | $10.00 | 1.0M | chatreasoning |
| claude-4-sonnet anthropic-claude-4-sonnet | $3.30 | $16.50 | 200K | chatreasoning |
| claude-3-7-sonnet-latest anthropic-claude-3-7-sonnet-latest | $3.30 | $16.50 | 200K | chatreasoning |
| claude-4-opus anthropic-claude-4-opus | $16.50 | $82.50 | 200K | chatreasoning |
| e5-base-v2 e5-base-v2 | $512.00 | $0.0050 | - | chat |
| all-MiniLM-L12-v2 all-minilm-l12-v2 | $512.00 | $0.0050 | - | chat |
| multi-qa-mpnet-base-dot-v1 multi-qa-mpnet-base-dot-v1 | $512.00 | $0.0050 | - | chat |
| bge-large-en-v1.5 bge-large-en-v1-5 | $512.00 | $0.010 | - | chat |
| bge-base-en-v1.5 bge-base-en-v1-5 | $512.00 | $0.0050 | - | chat |
| all-MiniLM-L6-v2 all-minilm-l6-v2 | $512.00 | $0.0050 | - | chat |
| gte-base gte-base | $512.00 | $0.0050 | - | chat |
| text2vec-base-chinese text2vec-base-chinese | $512.00 | $0.0050 | - | chat |
| all-mpnet-base-v2 all-mpnet-base-v2 | $512.00 | $0.0050 | - | chat |
| gte-large gte-large | $512.00 | $0.010 | - | chat |
| paraphrase-MiniLM-L6-v2 paraphrase-minilm-l6-v2 | $512.00 | $0.0050 | - | chat |
| multilingual-e5-large multilingual-e5-large | $512.00 | $0.010 | - | chat |
| e5-large-v2 e5-large-v2 | $512.00 | $0.010 | - | chat |