Llama 3.3 Instruct Turbo (70B)
Release Date: 12/6/2024
Benchmarked by
Llama 3.3 Instruct Turbo, 70B parameters with FP16 quantization.
Avg. Accuracy
53.6%
Latency
4.7s
Cost (In/Out)
0.88 / 0.88
Context Window
128k
Max Output Tokens
4k
Input Modality
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)