# Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery

> Research article (Proceedings of Machine Learning and Systems, 2021) · cited 10× · AI/ML

**Wikidata**: [openalex:W3137114593](https://www.wikidata.org/wiki/openalex:W3137114593)  
**Source**: https://4ort.xyz/entity/understanding-and-improving-failure-tolerant-training-for-deep-learning-recommendation-with-partial-recovery
