Full publication list is available on Google Scholar.
2024
-
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
Jianhao Yuan, Shuyang Sun, Daniel Omeiza, and 4 more authors
RSS, 2024
-
Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators
Jianhao Yuan, Francesco Pinto, Adam Davies, and 1 more author
ICML, 2024
-
Real-Fake: Effective Training Data Synthesis Through Distribution Matching
Jianhao Yuan, Jie Zhang, Shuyang Sun, and 2 more authors
ICLR, 2024
-
Efficient multimodal learning from data-centric perspective
Muyang He, Yexin Liu, Boya Wu, and 4 more authors
arXiv preprint arXiv:2402.11530, 2024
-
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Zhongrui Gui, Shuyang Sun, Runjia Li, and 5 more authors
arXiv preprint arXiv:2404.09447, 2024
-
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Bin Cao, Jianhao Yuan, Yexin Liu, and 4 more authors
arXiv preprint arXiv:2402.18068, 2024
-
SpatialBot: Precise Spatial Understanding with Vision Language Models
Wenxiao Cai, Yaroslav Ponomarenko, Jianhao Yuan, and 4 more authors
arXiv preprint arXiv:2406.13642, 2024
2023
-
Off the Radar: Uncertainty-Aware Radar Place Recognition with Introspective Querying and Map Maintenance
Jianhao Yuan, Paul Newman, and Matthew Gadd
IROS, 2023