# Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking

> Research article (Proceedings of the AAAI Conference on Artificial Intelligence, 2022) · cited 28× · AI/ML

**Wikidata**: [openalex:W4200633562](https://www.wikidata.org/wiki/openalex:W4200633562)  
**Source**: https://4ort.xyz/entity/multi-modal-perception-attention-network-with-self-supervised-learning-for-audio-visual-speaker-tracking
