Zhuoling
profile photo

Zhuoling Li

I am now pursuing the PhD degree at the School of Computing and Data Science, the University of Hong Kong. My research interests focus on embodied intelligence, with an emphasis on robotic manipulation, navigation, and 3D visual perception. Before that, I received the Master degree from Tsinghua University and the Bachelor degree from Huazhong University of Science and Technology. I have published more than 10 first-authored papers in top journals and conferences, and also served as a reviewer for top journals and conferences, e.g., TPAMI, IJCV, CVPR.

Email  /  Google Scholar

Experience

  • 09/2016~06/2020, I was pursuing the Bachelor degree in Huazhong University of Science and Technology. During this period, I was advised academically by Prof. Shiping Wen and Prof. Yuanzheng Li.
  • 03/2020~06/2020, I was a full-time research intern at the Knowledge Computing Group, Microsoft Research Asia.
  • 09/2020~06/2023, I was pursuing the Master degree in Tsinghua University. During this period, I was advised academically by Prof. Haoqian Wang.
  • 05/2021~11/2021, I was a full-time research intern at the Autonomous Driving Group, Huawei Noah's Ark Lab.
  • 12/2022~07/2023, I was a full-time research intern at the Base Model Group, MEGVII Technology.
  • 05/2024~06/2025, I was a full-time research intern at CVTE.
  • 09/2023~Now, I am pursuing the PhD degree in the University of Hong Kong. During this period, I was advised academically by Prof. Hengshuang Zhao.
  • Publication

    I am interested in developing outstanding embodied artificial intelligence systems and vision perception models.

    Journal Papers:

    1. Towards Unified 3D Object Detection via Algorithm and Data Unification
      Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao. (First author)
      2025 | IEEE Transactions on Pattern Analysis and Machine Intelligence | paper

    2. GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping
      Zhuoling Li, Chunrui Han, Zheng Ge, Jinrong Yang, En Yu, Haoqian Wang, Hengshuang Zhao, Xiangyu Zhang. (First author)
      2024 | IEEE Robotics and Automation Letters | paper

    3. Efficient Few-shot Classification via Contrastive Pre-training on Web Data
      Zhuoling Li, Haohan Wang, Tymoteusz Swistek, En Yu, Haoqian Wang. (First author)
      2022 | IEEE Transactions on Artificial Intelligence | paper

    4. Relationtrack: Relation-aware Multiple Object Tracking with Decoupled Representation
      En Yu*, Zhuoling Li*, Shoudong Han, Hongwei Wang. (Co-First author)
      2022 | IEEE Transactions on Multimedia | paper

    5. Few-shot Steel Surface Defect Detection
      Haohan Wang*, Zhuoling Li*, Haoqian Wang. (Co-First author)
      2021 | IEEE Transactions on Instrumentation and Measurement | paper

    6. Deep Learning based Densely Connected Network for Load Forecasting
      Zhuoling Li, Yuanzheng Li, Yun Liu, Ping Wang, Renzhi Liu, Hoay Beng Gooi. (First author)
      2020 | IEEE Transactions on Power Systems | paper

    7. CLU-CNNs: Object detection for medical images
      Zhuoling Li, Minghui Dong, Shiping Wen, Xiang Hu, Pan Zhou, Zhigang Zeng. (First author)
      2019 | Neurocomputing | paper | Highly Cited Paper

    Conference Papers:

    1. VIP: Vision Instructed Pre-training for Robotic Manipulation
      Zhuoling Li, Liangliang Ren, Jinrong Yang, Yong Zhao, Xiaoyang Wu, Zhenhua Xu, Xiang Bai, Hengshuang Zhao. (First author)
      2025 | ICML 2025 | paper

    2. LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
      Zhuoling Li, Xiaogang Xu, Zhenhua Xu, SerNam Lim, Hengshuang Zhao. (First author)
      2025 | ICML 2025 | paper

    3. DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
      Zhenhua Xu, Yan Bai, Yujia Zhang, Zhuoling Li, Fei Xia, Kwan-Yee K. Wong, Jianqiang Wang, Hengshuang Zhao
      2025 | CVPR 2025 | paper

    4. Transplat: Generalizable 3d gaussian splatting from sparse multi-view images with transformers
      Chuanrui Zhang, Yingshuang Zou, Zhuoling Li, Minmin Yi, Haoqian Wang.
      2024 | AAAI 2025 | paper

    5. UniMODE: Unified Monocular 3D Object Detection
      Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao. (First author)
      2024 | CVPR 2024 | paper | Highlight

    6. The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge
      Dongming Wu, Fan Jia, Jiahao Chang, Zhuoling Li, Jianjian Sun, Chunrui Han, Yingfei Liu, Zheng Ge, Tiancai Wang.
      2023 | CVPR2023 Autonomous Driving Challenge | paper

    7. Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
      En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao.
      2023 | AAAI 2023 | paper

    8. Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection
      Zhuoling Li*, Zhan Qu*, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang. (First author)
      2022 | CVPR 2022 | paper | Oral

    9. Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking
      En Yu*, Zhuoling Li*, Shoudong Han. (Co-First author)
      2022 | CVPR 2022 | paper

    Preprint Papers:

    1. Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation
      Zhuoling Li, Xiaoyang Wu, Zhenhua Xu, Hengshuang Zhao
      2025 | Arxiv | paper

    2. Motrv3: Release-fetch supervision for end-to-end multi-object tracking
      En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao
      2023 | Arxiv | paper

    3. VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection
      Zhuoling Li, Chuanrui Zhang, Wei-Chiu Ma, Yipin Zhou, Linyan Huang, Haoqian Wang, SerNam Lim, Hengshuang Zhao (First author)
      2023 | Arxiv | paper

    4. Delving into the Pre-training Paradigm of Monocular 3D Object Detection
      Zhuoling Li, Chuanrui Zhang, En Yu, Haoqian Wang. (First author)
      2022 | Arxiv | paper

    Award

  • I won the first place in CVPR2023 Autonomous Driving Challenge, 2023.

  • I won the second award in National Undergraduate Electronics Design Contest, 2018.

  • I won the second award in China Intelligent Manufacturing Challenge, 2017.
  • Activity

  • I gave a talk about 3D visual perception on TalkBeat, 2024.

  • I hosted a workshop MFM-EAI about multi-modal foundation model and embodied AI in ICML2024, 2024.
  • No. Visitor Since Feb 2022. Powered by w3.css