Shifang Zhao

I'm a master student in the Institute of Information Science at the Beijing Jiaotong University(BJTU), supervised by Prof. Yunchao Wei. Previously, I received the B. Sc. degree in Automation Engineering from Beijing University of Technology(BJUT) in 2023.

Email  /  CV  /  Scholar  /  Wechat  /  Zhihu

profile photo

🎓 I am actively seeking PhD opportunities for Fall 2026! 🎓

Research

I'm interested in Multimodal Large Language Model, Image Generation and AI4Medical. My research is about inspect and understanding the visual information.

OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning
Shifang Zhao, Yiheng Lin, Lu Han, Yao Zhao, Yunchao Wei
arXiv, 2025

An approach for enhancing MLLMs in anomaly detection and understanding by unified perception and reasoning.

AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
Yiheng Lin*, Shifang Zhao*, Ting Liu, Xiaochao Qu, Luoqi Liu, Yao Zhao, Yunchao Wei
arXiv, 2025

Bridging the gap between the textual and visual priors for robust zero-shot personalized image generation.

Rethinking Data Imbalance in Class Incremental Surgical Instrument Segmentation
Shifang Zhao*, Long Bai*, Kun Yuan, Feng Li, Jieming Yu, Wenzhen Donga, Guankun Wang, Mobarakol Islam, Nicolas Padoy, Nassir Navab, Hongliang Ren
Medical Image Analysis, 2025

A plug-and-play framework for addressing data imbalance in class incremental learning.

WIDE: Make Railway Surveillance Anomaly Detection Right
Shifang Zhao, Chao Ma, Shuai Su, Xianhong Meng, Yao Zhao, Yunchao Wei
IEEE Trans. Intell. Transp. Syst. (In Revision), 2025

The temporal correlations can be used to detect open-set anomalies in open-ended nature of surveillance.

Miscellanea

Intern

I was a Research Intern at CUHK with mentor Long Bai.

Feel free to steal this website's source code. Do not scrape the HTML from this page itself, as it includes analytics tags that you do not want on your own website — use the github code instead. Also, consider using Leonid Keselman's Jekyll fork of this page.