# Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning

> Research article (IEEE Transactions on Circuits and Systems for Video Technology, 2022) · cited 11× · AI/ML

**Wikidata**: [openalex:W4312998375](https://www.wikidata.org/wiki/openalex:W4312998375)  
**Source**: https://4ort.xyz/entity/multi-granularity-aggregation-transformer-for-joint-video-audio-text-representation-learning
