Zhuoling Li
I am now pursuing the PhD degree at the Computer Science Department, the University of Hong Kong, where I work on embodied intelligence and 3D visual perception, etc. Before that, I received the Bachelor degree from School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, and received the Master Degree from Shenzhen International Graduate School, Tsinghua University. I have published many papers in top conferences and journals, and also served as reviewers for several top conferences and IEEE Transactions journals, e.g., CVPR, ICCV, ECCV.
Email /
Google Scholar
News
9/2024, GroupLane is accepted for publication in IEEE Robotics and Automation Letters.
5/2024, I am hosting a workshop MFM-EAI about multi-modal foundation model and embodied AI in ICML2024. Welcome to join us.
4/2024, One of my papers is selected as a highlight of CVPR2024 (top 11.9%).
3/2024, One of my papers is accepted by CVPR2024.
6/2023, We achieve 1st place in the OpenLane Topology Challenge, which is held in conjunction with the CVPR 2023 Workshop on Endto-End Autonomous Driving and CVPR 2023 Workshop on
Vision-Centric Autonomous Driving Workshop.
Experience
09/2016~06/2020, I was pursuing the Bachelor degree in Huazhong University of Science and Technology. During this period, I was advised academically by Prof. Shiping Wen and Prof. Yuanzheng Li.
03/2020~06/2020, I was a full-time research intern at the Knowledge Computing Group, Microsoft Research Asia.
09/2020~06/2023, I was pursuing the Master degree in Tsinghua University. During this period, I was advised academically by Prof. Haoqian Wang.
05/2021~11/2021, I was a full-time research intern at the Autonomous Driving Group, Huawei Noah's Ark Lab.
12/2022~07/2023, I was a full-time research intern at the Base Model Group, MEGVII Technology.
09/2023~Now, I am pursuing the PhD degree in the University of Hong Kong. During this period, I was advised academically by Prof. Hengshuang Zhao.
05/2024~Now, I am a full-time research intern at CVTE.
Publication
I am interested in developing outstanding embodied artificial intelligence systems and vision perception models.
Journal Papers:
- GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping
Zhuoling Li, Chunrui Han, Zheng Ge, Jinrong Yang, En Yu, Haoqian Wang, Hengshuang Zhao, Xiangyu Zhang. (First author)
2024 | IEEE Robotics and Automation Letters | paper
- Efficient Few-shot Classification via Contrastive Pre-training on Web Data
Zhuoling Li, Haohan Wang, Tymoteusz Swistek, En Yu, Haoqian Wang. (First author)
2022 | IEEE Transactions on Artificial Intelligence | paper
- Relationtrack: Relation-aware Multiple Object Tracking with Decoupled Representation
En Yu*, Zhuoling Li*, Shoudong Han, Hongwei Wang. (Co-First author)
2022 | IEEE Transactions on Multimedia | paper
- Few-shot Steel Surface Defect Detection
Haohan Wang*, Zhuoling Li*, Haoqian Wang. (Co-First author)
2021 | IEEE Transactions on Instrumentation and Measurement | paper
- Deep Learning based Densely Connected Network for Load Forecasting
Zhuoling Li, Yuanzheng Li, Yun Liu, Ping Wang, Renzhi Liu, Hoay Beng Gooi. (First author)
2020 | IEEE Transactions on Power Systems | paper
- CLU-CNNs: Object detection for medical images
Zhuoling Li, Minghui Dong, Shiping Wen, Xiang Hu, Pan Zhou, Zhigang Zeng. (First author)
2019 | Neurocomputing | paper
| Highly Cited Paper
Conference Papers:
- UniMODE: Unified Monocular 3D Object Detection
Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao. (First author)
2024 | CVPR 2024 | paper | Highlight
- The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge
Dongming Wu, Fan Jia, Jiahao Chang, Zhuoling Li, Jianjian Sun, Chunrui Han, Yingfei Liu, Zheng Ge, Tiancai Wang.
2023 | CVPR2023 Autonomous Driving Challenge | paper
- Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao.
2023 | AAAI 2023 | paper
- Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection
Zhuoling Li*, Zhan Qu*, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang. (First author)
2022 | CVPR 2022 | paper | Oral
- Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking
En Yu*, Zhuoling Li*, Shoudong Han. (Co-First author)
2022 | CVPR 2022 | paper
Preprint Papers:
- LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Zhuoling Listrong>, Xiaogang Xu, Zhenhua Xu, SerNam Lim, Hengshuang Zhao.
2024 | Arxiv | paper
- Motrv3: Release-fetch supervision for end-to-end multi-object tracking
En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao
2023 | Arxiv | paper
- VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection
Zhuoling Li, Chuanrui Zhang, Wei-Chiu Ma, Yipin Zhou, Linyan Huang, Haoqian Wang, SerNam Lim, Hengshuang Zhao (First author)
2023 | Arxiv | paper
- Delving into the Pre-training Paradigm of Monocular 3D Object Detection
Zhuoling Li, Chuanrui Zhang, En Yu, Haoqian Wang. (First author)
2022 | Arxiv | paper