# Large Language Models (LLMs) Inference Offloading and Resource Allocation in Cloud-Edge Computing: An Active Inference Approach

> Research article (IEEE Transactions on Mobile Computing, 2024) · cited 69× · AI/ML

**Wikidata**: [openalex:W4400447774](https://www.wikidata.org/wiki/openalex:W4400447774)  
**Source**: https://4ort.xyz/entity/large-language-models-llms-inference-offloading-and-resource-allocation-in-cloud-edge-computing-an-active-inference-appr