Publications

Full publication list is available on Google Scholar.

  1. LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference
    Jianhao Yuan, Fabio Pizzati, Francesco Pinto, and 5 more authors
    ICLR, 2026
  2. Inference-time Physics Alignment of Video Generative Models with Latent World Models
    Jianhao Yuan, Xiaofeng Zhang, Felix Friedrich, and 7 more authors
    CVPR, 2026
  3. SpatialBot: Precise Spatial Understanding with Vision Language Models
    Wenxiao Cai, Yaroslav Ponomarenko, Jianhao Yuan, and 4 more authors
    ICRA, 2025
  4. RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
    Jianhao Yuan, Shuyang Sun, Daniel Omeiza, and 4 more authors
    RSS, 2024
  5. Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators
    Jianhao Yuan, Francesco Pinto, Adam Davies, and 1 more author
    ICML, 2024
  6. Real-Fake: Effective Training Data Synthesis Through Distribution Matching
    Jianhao Yuan, Jie Zhang, Shuyang Sun, and 2 more authors
    ICLR, 2024
  7. Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models
    Arshia Hemmat, Adam Davies, Tom A. Lamb, and 4 more authors
    NeurIPS, 2024
  8. Efficient multimodal learning from data-centric perspective
    Muyang He, Yexin Liu, Boya Wu, and 4 more authors
    arXiv preprint, 2024
  9. kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
    Zhongrui Gui, Shuyang Sun, Runjia Li, and 5 more authors
    TMLR, 2024
  10. SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
    Bin Cao, Jianhao Yuan, Yexin Liu, and 4 more authors
    arXiv preprint, 2024
  11. Off the Radar: Uncertainty-Aware Radar Place Recognition with Introspective Querying and Map Maintenance
    Jianhao Yuan, Paul Newman, and Matthew Gadd
    IROS, 2023