Publications | Jianhao Yuan

Full publication list is available on Google Scholar.

ICLR

LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference

Jianhao Yuan, Fabio Pizzati, Francesco Pinto, and 5 more authors

ICLR, 2026

arXiv Code Website
CVPR

Inference-time Physics Alignment of Video Generative Models with Latent World Models

Jianhao Yuan, Xiaofeng Zhang, Felix Friedrich, and 7 more authors

CVPR, 2026

arXiv
ICRA

SpatialBot: Precise Spatial Understanding with Vision Language Models

Wenxiao Cai, Yaroslav Ponomarenko, Jianhao Yuan, and 4 more authors

ICRA, 2025

PDF
ICML

Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators

Jianhao Yuan, Francesco Pinto, Adam Davies, and 1 more author

ICML, 2024

arXiv PDF Code Website
ICLR

Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Jianhao Yuan, Jie Zhang, Shuyang Sun, and 2 more authors

ICLR, 2024

arXiv PDF Code Website
RSS

RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model

Jianhao Yuan, Shuyang Sun, Daniel Omeiza, and 4 more authors

RSS, 2024

arXiv Code Website
NeurIPS

Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models

Arshia Hemmat, Adam Davies, Tom A. Lamb, and 4 more authors

NeurIPS, 2024

PDF Website
arXiv

Efficient multimodal learning from data-centric perspective

Muyang He, Yexin Liu, Boya Wu, and 4 more authors

arXiv preprint, 2024

PDF
TMLR

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

Zhongrui Gui, Shuyang Sun, Runjia Li, and 5 more authors

TMLR, 2024

PDF
arXiv

SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model

Bin Cao, Jianhao Yuan, Yexin Liu, and 4 more authors

arXiv preprint, 2024

PDF
IROS

Off the Radar: Uncertainty-Aware Radar Place Recognition with Introspective Querying and Map Maintenance

Jianhao Yuan, Paul Newman, and Matthew Gadd

IROS, 2023

arXiv PDF