Shifang Zhao

I'm a master student in the Institute of Information Science at the Beijing Jiaotong University(BJTU), supervised by Prof. Yunchao Wei. Previously, I received the B. Sc. degree in Automation Engineering from Beijing University of Technology(BJUT) in 2023.

Email  /  CV  /  Scholar  /  Wechat  /  Zhihu

profile photo

🎓 I am actively seeking PhD opportunities for Fall 2026! 🎓

Research

I'm interested in Multimodal Large Language Model, Image Generation and AI4Medical. My research is about inspect and understanding the visual information.

This was once a passive interest shaped by the trend. After watching Saining Xie's talk, I clarified a long-term research direction:

Create to inspire creation.

I currently focus on building agentic systems that help people express ideas through existing tools faster and more clearly. Instead of asking users to master fragmented software stacks, my goal is to let them state intent in natural language and have agents plan, orchestrate, and execute the right toolchain.

OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning
Shifang Zhao, Yiheng Lin, Lu Han, Yao Zhao, Yunchao Wei
arXiv, 2025

An approach for enhancing MLLMs in anomaly detection and understanding by unified perception and reasoning.

AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
Yiheng Lin*, Shifang Zhao*, Ting Liu, Xiaochao Qu, Luoqi Liu, Yao Zhao, Yunchao Wei
arXiv, 2025

Bridging the gap between the textual and visual priors for robust zero-shot personalized image generation.

Rethinking Data Imbalance in Class Incremental Surgical Instrument Segmentation
Shifang Zhao*, Long Bai*, Kun Yuan, Feng Li, Jieming Yu, Wenzhen Donga, Guankun Wang, Mobarakol Islam, Nicolas Padoy, Nassir Navab, Hongliang Ren
Medical Image Analysis, 2025

A plug-and-play framework for addressing data imbalance in class incremental learning.

WIDE: Make Railway Surveillance Anomaly Detection Right
Shifang Zhao, Chao Ma, Shuai Su, Xianhong Meng, Yao Zhao, Yunchao Wei
IEEE Trans. Intell. Transp. Syst. (In Revision), 2025

The temporal correlations can be used to detect open-set anomalies in open-ended nature of surveillance.

Experience

Intern

I was a Research Intern at CUHK with mentor Long Bai.

Visit

I was a Visiting Student at Great Bay University with mentor Xiaodong Cun.

About Me More

Basketball

I really enjoy playing basketball with my close friends, and I organized the EBL (Everyone's Basketball League). The EBL post on Xiaohongshu received 1,777 likes and saves: EBL 篮球联赛🏀.

Cinema

I am a big fan of films. My favorite director is Stanley Kubrick, and 2001: A Space Odyssey has given me a lot of inspiration.

Thanks to GPT-5.3-Codex