New Finance Agent Benchmark Released

Nvidia's 49B Nemotron model, fine-tuned with Llama 3.3 weights.

Released Date: 3/18/2025

Avg. Accuracy:

69.7%

Latency:

62.07s

Performance by Benchmark

Benchmarks

Accuracy

Rankings

CaseLaw

81.9%

( 24 / 69 )

ContractLaw

58.9%

( 66 / 72 )

TaxEval

67.1%

( 38 / 56 )

Math500

91.4%

( 17 / 52 )

AIME

53.5%

( 17 / 46 )

MGSM

86.4%

( 38 / 49 )

GPQA

60.6%

( 24 / 48 )

MMLU Pro

69.1%

( 34 / 46 )

LiveCodeBench

58.4%

( 19 / 47 )

Academic Benchmarks
Proprietary Benchmarks (contact us to get access)

Cost Analysis

Cost information not available for this model

Join our mailing list to receive benchmark updates on

Stay up to date as new benchmarks and models are released.