I am currently pursuing a Ph.D. degree in Computer Science at Wuhan University (WHU) under the supervision of Prof. Lefei Zhang, and expect to graduate in 2027. During this period, I am also fortunate to receive valuable guidance from Dr. Sen Zhang, Dr. Liang Ding, and Prof. Dacheng Tao. Before that, I received my bachelor’s degree in Mathematical Sciences at the University of Electronic Science and Technology of China (UESTC) in 2022, under the supervision of Prof. Xile Zhao. I am sincerely grateful to all my mentors for their invaluable guidance, support, and inspiration throughout my academic journey.

I have authored or co-authored over 10 papers published in top-tier international journals and conferences, including IEEE TPAMI, ICML, NeurIPS, CVPR, ICCV, and EMNLP. In addition, I am supported by the Fundamental Research Project for Young Professional from NSFC (国自然博士生基金, PR). I have also received several honors, including the China Remote Sensing Outstanding Achievement First-Class Prize (2025 中国遥感优秀成果一等奖, PR) and the VALSE 2025 Popular Poster Award (VALSE 2025人气海报奖, PR).

My current research interests mainly focus on Reinforcement Learning (RL) for LLMs, including Reinforcement Learning from Human Feedback (RLHF), reasoning RL, and agentic RL. Previously, my research primarily focused on tensor modeling and computing, high-dimensional image processing, and remote sensing.

📧 I am open to collaboration and welcome inquiries from anyone interested in my research topics. Feel free to reach out via szmyc1@163.com or miaoyuchun@whu.edu.cn.

🔥 News

2025.12: 🎉🎉 I was supported by the Fundamental Research Project for Young Professional from NSFC (国家自然科学基金博士生专项).
2025.11: 🎉🎉 HyperSIGMA was selected for the China Remote Sensing Outstanding Achievement First-Class Prize (2025 中国遥感优秀成果一等奖).
2025.11: 🎉🎉 HyperSIGMA has been selected as ESI Highly Cited Papers (TOP 1%).
2025.10: 🎉🎉 One paper has been accepted by IEEE GRSL.
2025.08: 🎉🎉 One paper has been accepted by EMNLP 2025.
2025.06: 🎉🎉 HyperSIGMA received the VALSE 2025 Popular Poster Award (VALSE 2025人气海报奖).
2025.04: 🎉🎉 One paper has been accepted by ICML 2025.
2025.03: 🎉🎉 One paper has been accepted by IEEE TPAMI.
2024.09: 🎉🎉 One paper has been accepted by NeurIPS 2024.
2024.01: 🎉🎉 One paper has been accepted by IEEE TCI.
2023.07: 🎉🎉 One paper has been accepted by ICCV 2023.
2023.02: 🎉🎉 One paper has been accepted by CVPR 2023.
2021.08: 🎉🎉 One paper has been accepted by IEEE TGRS.

📝 Publications

†: equal contribution, * : corresponding author

Conference Papers

The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
Yuchun Miao, Sen Zhang, Liang Ding, Yuqi Zhang, Lefei Zhang, Dacheng Tao
International Conference on Machine Learning (ICML), 2025
[Paper][Code]
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
Yuchun Miao, Sen Zhang, Liang Ding, Rong Bao, Lefei Zhang, Dacheng Tao
Conference on Neural Information Processing System (NeurIPS), 2024
[Paper][Code]
DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration
Yuchun Miao, Lefei Zhang, Liangpei Zhang, Dacheng Tao
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
[Paper][Code]
AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders
Yuqi Zhang, Yuchun Miao, Zuchao Li, Liang Ding
Findings of Empirical Methods in Natural Language Processing (EMNLP), 2025
[Paper][Code]
Uncertainty-Aware Unsupervised Image Deblurring with Deep Residual Prior
Xiaole Tang, Xile Zhao, Jun Liu, Jianli Wang, Yuchun Miao, Tieyong Zeng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Paper][Code]

Journal Articles

HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang^†, Meiqi Hu^†, Yao Jin^†, Yuchun Miao^†, Jiaqi Yang^†, Yichu Xu^†, Xiaolei Qin^†, Jiaqi Ma^†, Lingyu Sun^†, Chenxing Li, Chuan Fu, Hongruixuan Chen, Chengxi Han, Naoto Yokoya, Jing Zhang, Minqiang Xu, Lin Liu, Lefei Zhang, Chen Wu, Bo Du, Dacheng Tao, Liangpei Zhang.
IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 2025
[Paper][Code]
Snapshot Compressive Imaging Using Domain-Factorized Deep Video Prior
Yuchun Miao, Xile Zhao, Jianli Wang, Xiao Fu, Yao Wang
IEEE Transactions on Computational Imaging (IEEE TCI), 2024
[Paper][Code]
Hyperspectral Denoising Using Unsupervised Disentangled Spatiospectral Deep Priors
Yuchun Miao, Xile Zhao, Xiao Fu, Jianli Wang, Yubang Zheng
IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
[Paper][Code]
PHDMamba: Progressive Hybrid Mamba for Hyperspectral Image Classification
Yichu Xu, Chengxi Han, Shi Chen, Yao Jin, Yuchun Miao, Haonan Guo, Di Wang
IEEE Geoscience and Remote Sensing Letters (IEEE GRSL), 2025
[Paper][Code]
Complex Video Completion Fusing Low-Rank Background and Deep Foreground Priors
Jianli Wang, Tingzhu Huang, Xile Zhao, Yuchun Miao
IEEE Signal Processing Letters (IEEE SPL), 2022
[Paper][Code]

Pre-prints

Information-Theoretic Reward Modeling for Stable RLHF: Detecting and Mitigating Reward Hacking
Yuchun Miao, Liang Ding, Sen Zhang, Rong Bao, Lefei Zhang, Dacheng Tao
Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI)
[Paper][Code]

🎖 Honors and Awards

Fundamental Research Project for Young Professional from NSFC, 2025 December
(主持国家自然科学基金青年学生基础研究项目（博士研究生), Link)
China Remote Sensing Outstanding Achievement First-Class Prize, 2025 December
(中国遥感优秀成果一等奖, Link)
National Scholarship, Ministry of Education of China, 2025 October
(国家奖学金, Top 2% in Wuhan University)
VALSE Popular Poster Award, 2025 June
(VALSE 人气海报奖, 11/398 , Link)
National Scholarship, Ministry of Education of China, 2024 October
(国家奖学金, Top 2% in Wuhan University)

📖 Educations

Wuhan University, September 2022 – Present
Ph.D. Student in the School of Computer Science, Wuhan, China.
Supervised by Lefei Zhang.
University of Electronic Science and Technology of China, September 2018 – July 2022
Undergraduate Student in the School of Mathematical Science, Chengdu, China.
Supervised by Xile Zhao.

💻 Internships

LongCat Team (Agent), Meituan, January 2026 – Present
Research Intern on Agentic RL (Beidou Program), advised by Qi Gu.
A Generative AI Research Startup, April 2023 – October 2025
Research Intern on LLM Alignment, advised by Liang Ding.

Yuchun Miao (苗雨春)