# Look, listen, and decode: Multimodal speech recognition with images

> Research article (2016 IEEE Spoken Language Technology Workshop (SLT), 2016) · cited 29× · AI/ML

**Wikidata**: [openalex:W2586850765](https://www.wikidata.org/wiki/openalex:W2586850765)  
**Source**: https://4ort.xyz/entity/look-listen-and-decode-multimodal-speech-recognition-with-images
