New Finance Agent Benchmark Released

Released Date: 9/19/2025

Avg. Accuracy:

67.8%

Latency:

121.40s

Performance by Benchmark

Benchmarks

Accuracy

Rankings

FinanceAgent

54.7%

( 1 / 36 )

CorpFin

73.3%

( 3 / 47 )

CaseLaw

71.4%

( 13 / 30 )

TaxEval

68.2%

( 42 / 64 )

MortgageTax

55.1%

( 31 / 37 )

AIME

91.2%

( 3 / 54 )

MGSM

90.9%

( 27 / 57 )

LegalBench

81.7%

( 13 / 79 )

MedQA

92.1%

( 16 / 60 )

GPQA

85.1%

( 3 / 56 )

MMLU Pro

79.7%

( 21 / 54 )

MMMU

72.8%

( 13 / 34 )

LiveCodeBench

79.0%

( 8 / 56 )

IOI

11.5%

( 5 / 16 )

Terminal-Bench

26.3%

( 10 / 15 )

SWE-bench

52.4%

( 5 / 17 )

Academic Benchmarks
Proprietary Benchmarks (contact us to get access)

Cost Analysis

Input Cost

$0.20 / M Tokens

Output Cost

$0.50 / M Tokens

Input Cost (per char)

N/A

Output Cost (per char)

N/A

Join our mailing list to receive benchmark updates on

Stay up to date as new benchmarks and models are released.