# ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

> Research article (2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022) · cited 87× · AI/ML

**Wikidata**: [openalex:W4313178921](https://www.wikidata.org/wiki/openalex:W4313178921)  
**Source**: https://4ort.xyz/entity/vista-vision-and-scene-text-aggregation-for-cross-modal-retrieval
