New Finance Agent Benchmark Released
fireworks/gpt-oss-20b GPT OSS 20B

Released Date: 8/5/2025

Avg. Accuracy:

74.0%

Latency:

61.70s

Performance by Benchmark

Benchmarks

Accuracy

Rankings

CorpFin

66.6%

( 12 / 44 )

CaseLaw

78.8%

( 37 / 73 )

ContractLaw

66.0%

( 44 / 76 )

TaxEval

68.7%

( 39 / 60 )

AIME

86.0%

( 6 / 50 )

MGSM

89.0%

( 36 / 53 )

LegalBench

71.0%

( 51 / 75 )

MedQA

82.9%

( 34 / 56 )

GPQA

56.8%

( 32 / 52 )

MMLU Pro

67.7%

( 39 / 50 )

LiveCodeBench

80.4%

( 6 / 51 )

Academic Benchmarks
Proprietary Benchmarks (contact us to get access)

Cost Analysis

Input Cost

$0.05 / M Tokens

Output Cost

$0.20 / M Tokens

Input Cost (per char)

$0.02 / M chars

Output Cost (per char)

N/A

Join our mailing list to receive benchmark updates on

Stay up to date as new benchmarks and models are released.