Llama 3.3 Nemotron Super (Thinking)
Release Date: 3/18/2025
Benchmarked by
Nvidia's 49B Nemotron model, fine-tuned with Llama 3.3 weights.
Avg. Accuracy
69.5%
Latency
75.1s
Cost (In/Out)
N/A
Context Window
131k
Max Output Tokens
33k
Input Modality
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)