Llama 3.1 Instruct Turbo (8B)
Release Date: 7/23/2024
Benchmarked by
Llama 3.1 Instruct Turbo, 8B parameters with FP8 quantization.
Avg. Accuracy
48.4%
Latency
2.2s
Cost (In/Out)
0.18 / 0.18
Context Window
131k
Max Output Tokens
4k
Input Modality
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)