# Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition

> Research article (2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2023) · cited 13× · AI/ML

**Wikidata**: [openalex:W4385815572](https://www.wikidata.org/wiki/openalex:W4385815572)  
**Source**: https://4ort.xyz/entity/learning-clip-guided-visual-text-fusion-transformer-for-video-based-pedestrian-attribute-recognition
