# Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization

> Research article (International Conference on Learning Representations, 2021) · cited 27× · AI/ML

**Wikidata**: [openalex:W3130662682](https://www.wikidata.org/wiki/openalex:W3130662682)  
**Source**: https://4ort.xyz/entity/cross-attentional-audio-visual-fusion-for-weakly-supervised-action-localization
