# CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes

> Research article (2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2023) · cited 41× · AI/ML

**Wikidata**: [openalex:W4385804899](https://www.wikidata.org/wiki/openalex:W4385804899)  
**Source**: https://4ort.xyz/entity/clip-guided-vision-language-pre-training-for-question-answering-in-3d-scenes
