SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding Paper • 2605.10453 • Published 7 days ago • 8
SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding Paper • 2605.10453 • Published 7 days ago • 8
LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding Paper • 2602.23881 • Published Feb 27 • 18