# Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation

> Research article (ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024) · cited 14× · AI/ML

**Wikidata**: [openalex:W4392903033](https://www.wikidata.org/wiki/openalex:W4392903033)  
**Source**: https://4ort.xyz/entity/improving-audio-captioning-models-with-fine-grained-audio-features-text-embedding-supervision-and-llm-mix-up-augmentatio
