Interactive Benchmark Dashboard
View the interactive benchmark dashboard to explore SpecForge performance results:
The dashboard displays the following key metrics:
- Acceptance Length: Average number of tokens accepted per speculation step
- Throughput: Output tokens generated per second (tokens/s)
- Speedup: Performance improvement ratio over baseline
Benchmark Datasets
View results across multiple benchmarks:
- MTBench
- HumanEval
- GSM8K
- Math500
If the dashboard doesn't load, please ensure JavaScript is enabled in your browser.