# End-to-end Generative Pretraining for Multimodal Video Captioning

> Research article (2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022) · cited 153× · AI/ML

**Wikidata**: [openalex:W4312463400](https://www.wikidata.org/wiki/openalex:W4312463400)  
**Source**: https://4ort.xyz/entity/end-to-end-generative-pretraining-for-multimodal-video-captioning
