# ShuffleInfer: Disaggregate LLM Inference for Mixed Downstream Workloads

> Research article (ACM Transactions on Architecture and Code Optimization, 2025) · cited 13× · AI/ML

**Wikidata**: [openalex:W4409963643](https://www.wikidata.org/wiki/openalex:W4409963643)  
**Source**: https://4ort.xyz/entity/shuffleinfer-disaggregate-llm-inference-for-mixed-downstream-workloads