Publications
Full Publication list in [Google Scholar](https://scholar.google.com/citations?user=BUJPCegAAAAJ&hl=en)
2024
- Efficient multimodal learning from data-centric perspectivearXiv preprint arXiv:2402.11530, 2024
- kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large VocabulariesarXiv preprint arXiv:2404.09447, 2024
- SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language ModelarXiv preprint arXiv:2402.18068, 2024
- SpatialBot: Precise Spatial Understanding with Vision Language ModelsarXiv preprint arXiv:2406.13642, 2024