Hello World!
Hi, I am a M.Sc. student at
Wuhan University,
working at the
State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing (LIESMARS),
under the guidance of
Prof. Xianwei Zheng.
My research focuses on embodied intelligence and 3D visual perception,
with an emphasis on how spatial representations support generalizable decision-making and agent-centric policy learning.
My current work lies at the intersection of computer vision, reinforcement learning, and generative modeling, where I study how 2D and 3D representations can be unified to enable robust perception-action coupling. I am particularly interested in structure-aware visual representations that support cross-view understanding, generalization across environments, and interaction-driven learning in embodied settings.
Previously, I explored generative AI for image and video synthesis during my internship at
Baidu.
I am currently a remote research assistant at the
Centre for Frontier AI Research (CFAR), Agency for Science, Technology and Research (A*STAR),
supervised by
Dr. Xingrui Yu,
where I work on generalizable reinforcement learning for embodied agents,
with a focus on agent-centric formulations and transferable policies grounded in implicit spatial representations that support generalization across tasks and scenes.
More broadly, my goal is to develop spatially grounded learning frameworks that bridge perception, geometry, and control, advancing the next generation of embodied systems that can reason about and act within complex real-world environments.
News
- Mar 2026
- Jan 2026
- Nov 2025 Successfully defended my Master's thesis proposal.
- Aug 2025 The first paper accepted by the ISPRS Journal of Photogrammetry and Remote Sensing. [Paper]
- Aug 2025 Began a remote research internship at CFAR, A*STAR, supervised by Dr. Xingrui Yu and in collaboration with Zhenglin Wan.
- Jul 2025 Attended the 2025 Annual Academic Conference on Photogrammetry and Remote Sensing, CSGPC in Kunming, China.
- Dec 2024 Began a research internship at Baidu in Shenzhen, supervised by Dr. Yan Zhang, exploring frontier text-to-image and text-to-video generation.
- Jul 2024 Began collaboration on the SCoDe project under the guidance of Dr. Zimin Xia.
- Sep 2023 Enrolled in the Master's program at the State Key Lab. LIESMARS, Wuhan University, as a recommended exemption student, under the supervision of Prof. Xianwei Zheng.
- ...
Experiences
Wuhan University
School of Remote Sensing and Information Engineering
State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing (LIESMARS)
International Technology R&D Department, Baidu, Inc.
Centre for Frontier AI Research (CFAR),Institute of High Performance Computing (IHPC), Agency for Science, Technology and Research (A*STAR)
Acknowledgements:
I’m grateful to my collaborators and mentors for their guidance and support, especially
Prof. Xianwei Zheng , Prof. Hanjiang Xiong , Dr. Xingrui Yu (A*STAR) , Dr. Zimin Xia (EPFL) , Dr. Yan Zhang (Baidu) ,
and my colleagues/peers including
Zhenglin Wan (NUS) , Jiashen Huang (NTU) , Ziqong Lu (HKU) , Qiyuan Ma , Jintao Zhang , Chenyu Zhao , He Chen
and others I’ve had the pleasure to work with.
Selected Publications
VIEW ALLSleeperVLA: Towards Backdoor-Based Ownership Verification for Vision-Language-Action Models
Ming Sun , Rui Wang , Xingrui Yu* , Lihua Jing , Hangyu Du , Zhenglin Wan , Xu Pan , Ivor Tsang
Projects
SA-VLA
2026A research project on robust RL adaptation of flow-matching–based VLA models for robotic manipulation, focusing on generalization under distribution shifts in challenging benchmarks.
Co-visibility Guided Image Matching
2025A research project on robust image matching in robot vision, photogrammetry and remote sensing, using explicit co-visibility modeling to handle extreme scale and viewpoint variations.
GNDAS
2022The GNDASystem (Global Natural Disaster Assessment System) is a web-based geographic information system application designed for the analysis and assessment of natural disasters.
I2RSI
2022The I2RSI System (Intelligent Interpretation of Remote Sensing Images) is a web-based application for remote sensing image interpretation, powered by the Baidu PaddlePaddle deep learning framework.
Academic Service
| Member | ISPRS Student Consortium (ISPRS SC) | 2024 | |
| Volunteer | 2023 International Graduate Workshop on Geo-Informatics (IGWG'23) | 2023 | |
| Volunteer | 2022 International Graduate Workshop on Geo-Informatics (IGWG'22) | 2022 |
Personal Philosophy
I follow Stoic philosophy. Life is a joyful ascent: a true mountaineer delights in the climb itself, not just the summit.
Thou sufferest this justly: for thou choosest rather to become good to-morrow than to be good to-day.
I also resonate with the spirit of Slow Science.
We live in an age tyrannized by efficiency, outcomes, and speed, to the point that nothing lasts and nothing leaves a deep impression. In the midst of noisy bubbles and short-lived hype, I hope to take time to think carefully, to doubt, to refine, and to do research that is genuinely meaningful and worth remembering.