# STNet: Deep Audio–Visual Fusion Network for Robust Speaker Tracking

> Research article (IEEE Transactions on Multimedia, 2024) · cited 11× · AI/ML

**Wikidata**: [openalex:W4405754159](https://www.wikidata.org/wiki/openalex:W4405754159)  
**Source**: https://4ort.xyz/entity/stnet-deep-audiovisual-fusion-network-for-robust-speaker-tracking
