I am now pursuing the PhD degree at the School of Computing and Data Science, the University of Hong Kong. My research interests focus on embodied intelligence, with an emphasis on robotic manipulation, navigation, and 3D visual perception. Before that, I received the Master degree from Tsinghua University and the Bachelor degree from Huazhong University of Science and Technology. I have published more than 10 first-authored papers in top journals and conferences, and also served as a reviewer for top journals and conferences, e.g., TPAMI, IJCV, CVPR.
I am interested in developing outstanding embodied artificial intelligence systems and vision perception models.
Journal Papers:
Towards Unified 3D Object Detection via Algorithm and Data Unification Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao. (First author)
2025 | IEEE Transactions on Pattern Analysis and Machine Intelligence | paper
GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping Zhuoling Li, Chunrui Han, Zheng Ge, Jinrong Yang, En Yu, Haoqian Wang, Hengshuang Zhao, Xiangyu Zhang. (First author)
2024 | IEEE Robotics and Automation Letters | paper
Efficient Few-shot Classification via Contrastive Pre-training on Web Data Zhuoling Li, Haohan Wang, Tymoteusz Swistek, En Yu, Haoqian Wang. (First author)
2022 | IEEE Transactions on Artificial Intelligence | paper
Relationtrack: Relation-aware Multiple Object Tracking with Decoupled Representation
En Yu*, Zhuoling Li*, Shoudong Han, Hongwei Wang. (Co-First author)
2022 | IEEE Transactions on Multimedia | paper
Few-shot Steel Surface Defect Detection
Haohan Wang*, Zhuoling Li*, Haoqian Wang. (Co-First author)
2021 | IEEE Transactions on Instrumentation and Measurement | paper
Deep Learning based Densely Connected Network for Load Forecasting Zhuoling Li, Yuanzheng Li, Yun Liu, Ping Wang, Renzhi Liu, Hoay Beng Gooi. (First author)
2020 | IEEE Transactions on Power Systems | paper
CLU-CNNs: Object detection for medical images Zhuoling Li, Minghui Dong, Shiping Wen, Xiang Hu, Pan Zhou, Zhigang Zeng. (First author)
2019 | Neurocomputing | paper
| Highly Cited Paper
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence Zhuoling Li, Xiaogang Xu, Zhenhua Xu, SerNam Lim, Hengshuang Zhao. (First author)
2025 | ICML 2025 | paper
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
Zhenhua Xu, Yan Bai, Yujia Zhang, Zhuoling Li, Fei Xia, Kwan-Yee K. Wong, Jianqiang Wang, Hengshuang Zhao
2025 | CVPR 2025 | paper
Transplat: Generalizable 3d gaussian splatting from sparse multi-view images with transformers
Chuanrui Zhang, Yingshuang Zou, Zhuoling Li, Minmin Yi, Haoqian Wang.
2024 | AAAI 2025 | paper
Motrv3: Release-fetch supervision for end-to-end multi-object tracking
En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao
2023 | Arxiv | paper
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection Zhuoling Li, Chuanrui Zhang, Wei-Chiu Ma, Yipin Zhou, Linyan Huang, Haoqian Wang, SerNam Lim, Hengshuang Zhao (First author)
2023 | Arxiv | paper
Delving into the Pre-training Paradigm of Monocular 3D Object Detection Zhuoling Li, Chuanrui Zhang, En Yu, Haoqian Wang. (First author)
2022 | Arxiv | paper
Award
I won the first place in CVPR2023 Autonomous Driving Challenge, 2023.
I won the second award in National Undergraduate Electronics Design Contest, 2018.
I won the second award in China Intelligent Manufacturing Challenge, 2017.
Activity
I gave a talk about 3D visual perception on TalkBeat, 2024.
I hosted a workshop MFM-EAI about multi-modal foundation model and embodied AI in ICML2024, 2024.