# Cross-Modal Transformer-Based Streaming Dense Video Captioning with Neural ODE Temporal Localization

> Research article (Sensors, 2025) · cited 17× · AI/ML

**Wikidata**: [openalex:W4406800849](https://www.wikidata.org/wiki/openalex:W4406800849)  
**Source**: https://4ort.xyz/entity/cross-modal-transformer-based-streaming-dense-video-captioning-with-neural-ode-temporal-localization
