# Multimodal Speech Emotion Recognition Using Cross Attention with Aligned Audio and Text

> Research article (Interspeech 2020, 2020) · cited 23× · AI/ML

**Wikidata**: [openalex:W3096164988](https://www.wikidata.org/wiki/openalex:W3096164988)  
**Source**: https://4ort.xyz/entity/multimodal-speech-emotion-recognition-using-cross-attention-with-aligned-audio-and-text
