Publications

Full Publication list in [Google Scholar](https://scholar.google.com/citations?user=BUJPCegAAAAJ&hl=en)

2024

  1. ragdriver.png
    RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
    Jianhao Yuan, Shuyang Sun, Daniel Omeiza, and 4 more authors
    RSS, 2024
  2. njpp.jpg
    Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators
    Jianhao Yuan, Francesco Pinto, Adam Davies, and 1 more author
    ICML, 2024
  3. realfake.jpg
    Real-Fake: Effective Training Data Synthesis Through Distribution Matching
    Jianhao Yuan, Jie Zhang, Shuyang Sun, and 2 more authors
    ICLR, 2024
  4. Efficient multimodal learning from data-centric perspective
    Muyang He, Yexin Liu, Boya Wu, and 4 more authors
    arXiv preprint arXiv:2402.11530, 2024
  5. kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
    Zhongrui Gui, Shuyang Sun, Runjia Li, and 5 more authors
    arXiv preprint arXiv:2404.09447, 2024
  6. SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
    Bin Cao, Jianhao Yuan, Yexin Liu, and 4 more authors
    arXiv preprint arXiv:2402.18068, 2024
  7. SpatialBot: Precise Spatial Understanding with Vision Language Models
    Wenxiao Cai, Yaroslav Ponomarenko, Jianhao Yuan, and 4 more authors
    arXiv preprint arXiv:2406.13642, 2024

2023

  1. offtheradar.jpg
    Off the Radar: Uncertainty-Aware Radar Place Recognition with Introspective Querying and Map Maintenance
    Jianhao Yuan, Paul Newman, and Matthew Gadd
    IROS, 2023