# Multimodal Transformer Networks with Latent Interaction for Audio-Visual Event Localization

> Research article (2021 IEEE International Conference on Multimedia and Expo (ICME), 2021) · cited 11× · AI/ML

**Wikidata**: [openalex:W3170936177](https://www.wikidata.org/wiki/openalex:W3170936177)  
**Source**: https://4ort.xyz/entity/multimodal-transformer-networks-with-latent-interaction-for-audio-visual-event-localization
