# Just-In-Time Checkpointing: Low Cost Error Recovery from Deep Learning Training Failures

> Research article (Proceedings of the Nineteenth European Conference on Computer Systems, 2024) · cited 16× · AI/ML

**Wikidata**: [openalex:W4394923484](https://www.wikidata.org/wiki/openalex:W4394923484)  
**Source**: https://4ort.xyz/entity/just-in-time-checkpointing-low-cost-error-recovery-from-deep-learning-training-failures
