TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Research article (Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, 2025) · cited 14× · AI/ML
Press Enter · cited answer in seconds
0 sources
TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Summary
TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms is a scholarly article[1].
Key Facts
- TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms's instance of is recorded as scholarly article[2].