# Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

> Research article (2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024) · cited 393× · AI/ML

**Wikidata**: [openalex:W4402713111](https://www.wikidata.org/wiki/openalex:W4402713111)  
**Source**: https://4ort.xyz/entity/intern-vl-scaling-up-vision-foundation-models-and-aligning-for-generic-visual-linguistic-tasks
