Groq
InferenceOptimization • US
Ultra-fast inference
💰 Paid Models (17)
| Model | Input/1M | Output/1M | Context | Capabilities |
|---|---|---|---|---|
| Llama 3.1 8B llama-3-1-8b | $0.050 | $0.080 | 128K | chatfunction_calling |
| Llama 3 8B llama-3-8b | $0.050 | $0.080 | 8K | chat |
| Llama 3.1 8B Instant llama-3-1-8b-instant | $0.050 | $0.080 | 128K | chatfunction_calling |
| GPT-OSS 20B gpt-oss-20b | $0.075 | $0.300 | 128K | chat |
| Llama 4 Scout llama-4-scout | $0.110 | $0.340 | 128K | chatfunction_calling |
| GPT-OSS 120B gpt-oss-120b | $0.150 | $0.600 | 128K | chat |
| Llama 4 Maverick llama-4-maverick | $0.200 | $0.600 | 128K | chatfunction_calling |
| Gemma 2 9B gemma-2-9b | $0.200 | $0.200 | 8K | chat |
| Llama Guard 4 12B llama-guard-4-12b | $0.200 | $0.200 | 128K | chatmoderation |
| Mixtral 8x7B mixtral-8x7b | $0.240 | $0.240 | 33K | chat |
| Qwen3 32B qwen3-32b | $0.290 | $0.590 | 131K | chatfunction_calling |
| Llama 3 70B llama-3-70b | $0.590 | $0.790 | 8K | chat |
| Llama 3.3 70B llama-3-3-70b | $0.590 | $0.790 | 128K | chatfunction_calling |
| Llama 3.1 70B llama-3-1-70b | $0.590 | $0.790 | 128K | chatfunction_calling |
| Llama 3.3 70B Versatile llama-3-3-70b-versatile | $0.590 | $0.790 | 128K | chatfunction_calling |
| DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | $0.750 | $0.990 | 128K | chatreasoning |
| Kimi K2 kimi-k2 | $1.00 | $3.00 | 256K | chatreasoning |