# Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models

> Research article (IEEE Transactions on Circuits and Systems for Video Technology, 2024) · cited 13× · AI/ML

**Wikidata**: [openalex:W4402557613](https://www.wikidata.org/wiki/openalex:W4402557613)  
**Source**: https://4ort.xyz/entity/surveillance-video-and-language-understanding-from-small-to-large-multimodal-models
