# Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

> Research article (Findings of the Association for Computational Linguistics ACL 2024, 2024) · cited 31× · AI/ML

**Wikidata**: [openalex:W4402683901](https://www.wikidata.org/wiki/openalex:W4402683901)  
**Source**: https://4ort.xyz/entity/unlocking-efficiency-in-large-language-model-inference-a-comprehensive-survey-of-speculative-decoding