# EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning

> Research article (ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024) · cited 17× · AI/ML

**Wikidata**: [openalex:W4392902953](https://www.wikidata.org/wiki/openalex:W4392902953)  
**Source**: https://4ort.xyz/entity/enclap-combining-neural-audio-codec-and-audio-text-joint-embedding-for-automated-audio-captioning
