Llama 3.1 Instruct Turbo (70B)
Release Date: 7/23/2024
Benchmarked by
Llama 3.1 Instruct Turbo, 70B parameters with FP8 quantization.
Avg. Accuracy
67.3%
Latency
4.3s
Cost (In/Out)
0.88 / 0.88
Context Window
131k
Max Output Tokens
4k
Input Modality
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)