# End-to-End Referring Video Object Segmentation with Multimodal Transformers

> Research article (2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022) · cited 150× · AI/ML

**Wikidata**: [openalex:W3215899623](https://www.wikidata.org/wiki/openalex:W3215899623)  
**Source**: https://4ort.xyz/entity/end-to-end-referring-video-object-segmentation-with-multimodal-transformers