# SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings

> Research article (Proceedings of the 2023 8th International Conference on Intelligent Information Technology, 2023) · cited 10× · AI/ML

**Wikidata**: [openalex:W4384209462](https://www.wikidata.org/wiki/openalex:W4384209462)  
**Source**: https://4ort.xyz/entity/server-multi-modal-speech-emotion-recognition-using-transformer-based-and-vision-based-embeddings
