# Fine-grained visual textual alignment for cross-modal retrieval using transformer encoders

> Research article (CINECA IRIS Institutial research information system (University of Pisa), 2021) · cited 140× · AI/ML

**Wikidata**: [openalex:W3213100861](https://www.wikidata.org/wiki/openalex:W3213100861)  
**Source**: https://4ort.xyz/entity/fine-grained-visual-textual-alignment-for-cross-modal-retrieval-using-transformer-encoders
