# SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

> Research article (2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2024) · cited 82× · AI/ML

**Wikidata**: [openalex:W4402915908](https://www.wikidata.org/wiki/openalex:W4402915908)  
**Source**: https://4ort.xyz/entity/sam-clip-merging-vision-foundation-models-towards-semantic-and-spatial-understanding