# Layer-wise enhanced transformer with multi-modal fusion for image caption

> Research article (Multimedia Systems, 2022) · cited 12× · AI/ML

**Wikidata**: [openalex:W4313422350](https://www.wikidata.org/wiki/openalex:W4313422350)  
**Source**: https://4ort.xyz/entity/layer-wise-enhanced-transformer-with-multi-modal-fusion-for-image-caption
