Paper Notes: Speculative Decoding
April 2, 2026·
·
1 min read
Jiangneng Li
Papers:
- Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads (Cai et al., 2024)
- EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty (Li et al., 2024)
Reading notes coming soon.
