Dr. Wuyang LI
I am a postdoctoral fellow at AIM Group
of The
Chinese University of Hong Kong (CUHK),
supervised by Prof. Yixuan Yuan in 2024. Before
this, I
completed my PhD degree (2020-2023) at City University of Hong Kong (CityU) with
Early Graduation and my Bachelor's degree (2016-2020) at Tianjin University. I
am broadly
interested in computer vision and machine learning algorithms, and I am always willing to turn them
into products.
During my Ph.D., I concentrated on object detection within the context of autonomous driving,
thoroughly investigating the two-dimensional challenges posed by out-of-distribution and domain
shift. Currently, my primary focus revolves around large
vision/language/multi-modal models, including technical research and practical applications.
πΌ To gain a deeper understanding of technology, I founded ScholaGO Education Technology Company Limited (εΈζ
ιζθ²η§ζζιε
¬εΈ in
Chinese) with five shareholders to develop
an innovative educational product, aiming to convert static knowledge into an interactive,
multi-modal adventure
immersively. Our company is supported by HKSTP, HK Tech 300, and Alibaba Cloud. My ultimate goal is
to develop
valuable technology and products to improve the national happiness index.
I'm also on the job market and looking for a Research
Scientist/Engineer or Postdoc position. Feel free to contact me if you have any
openings!
Email  / 
CV  / 
Google Scholar
 / 
Github
|
|
π₯ News
-
[Milestone] From 08/2021 to 08/2024, my first-author works have been selected as
ORAL at 4 CV conferences: CVPR, ICCV, ECCV, and AAAI!
- [2024/08] Our work CLIFF is selected as ORAL in ECCV
- [2024/07] We are organizing the first workshop on multi-modal medical foundation models at NeurIPS; Submission Link
- [2024/07] 1 paper using diffusion to tackle open-vocabulary issue from a probabilistic view-point is accepted by ECCV
2024
-
[2024/06] 6 papers are accepted by MICCAI 2024 , including the works related
to:
Video Generation, Efficient Gaussian Splatting, 3D Point Cloud Segmentation,
Diffusion Model, and Brain Analysis.
- [2024/04] Our work on metasurfaces and stereo vision has been selected as the Cover Paper in ACS Photonics!
- [2023/12] I pass my PhD defense with Early
Graduation!!
- [2023/08] Our work SOMA is selected as ORAL in
ICCV.
- [2023/07] 2 papers are accepted by ICCV 2023.
- [2023/06]
Successfully secured seed funding for our startup team!
- [2023/05] 1 paper is accepted by MICCAI 2023.
- [2023/04] 1 paper is accepted by TNNLS 2023.
- [2023/03] 1 paper is accepted by TMI 2023.
- [2023/02] 1 paper is accepted by CVPR 2023.
- [2023/01] 1 paper is accepted by TPAMI 2023.
- [2022/10] 1 paper is accepted by TMM 2022.
- [2022/06] Our work SIGMA appears on CVPR Best Paper Finalist
[33/8161]!
- [2022/05] 1 paper is accepted by MICCAI 2022 ( Early
Accept,
ORAL).
- [2022/03] 2 papers are accepted by CVPR 2022 (one
ORAL
).
- [2021/10] Our work SCAN is accepted by AAAI 2022 (
ORAL
).
|
π Selected Publication
|
|
CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
European Conference on Computer Vision, ECCV, 2024
Wuyang Li, Xinyu Liu, Jiayi Ma, Yixuan Yuan
paper (soon)
/
codes (soon)
Key Words: Open-Vocabulary Object Detection; Diffusion Model
TL;DR: A probabilistic pipeline modeling the subspace transfer among the object,
image, and text subspaces with continual diffusion.
|
|
From Static to Dynamic Diagnostics: Boosting Medical Image Analysis via
Motion-Informed Generative Videos
Medical Image Computing and Computer Assisted Intervention, , MICCAI, 2024
Wuyang Li, Xinyu Liu, Qiushi Yang, Yixuan Yuan
paper (soon)
/
codes (soon)
Key Words: Medical Video Generation; Semi-Supervised Diagnosis
TL;DR: Enhancing the semi-supervised diganosis with generative medical
videos, enabling the learning across image and video modalities jointly.
|
|
When 3D Partial Points Meets SAM: Tooth Point Cloud Segmentation with Sparse
Labels
Medical Image Computing and Computer Assisted Intervention, , MICCAI, 2024
Yifan Liu, Wuyang Li, Cheng Wang, Hui Chen, Yixuan Yuan
paper (soon)
/
codes (soon)
Key Words: SAM; 3D Point Cloud Segmentation; Severely Limited Label
TL;DR: Leveraging the SAM to address the severe label scarcity in 3D
point cloud segmentation, enbaling good perfromance with only 0.1% label ratio.
|
|
Novel Scenes & Classes: Towards Adaptive Open-set Object Detection
IEEE International Conference on Computer Vision (ICCV), 2023,
ORAL
Wuyang Li, Xiaoqing Guo, Yixuan Yuan
paper
/
codes
Key Words: Open-Set/ Domain Adaptive Object Detection
TL;DR: Formulating a real-world friendly setting with
out-of-distribution
and domain shift, and addressing with high-order subgraphs.
|
|
MRM: Masked Relation Modeling for Medical Image Pre-Training with
Genetics
IEEE International Conference on Computer Vision (ICCV), 2023
Qiushi Yang, Wuyang Li, Paul Li, Yixuan Yuan
paper
/
codes
Key Words: Multimodal Pretraining; Masked Image Modeling
TL;DR: Observing the failure in MIM when the informative
foreground
is limited, and turning to masking relation to solve this issue.
|
|
Adjustment and Alignment for Unbiased Open Set Domain Adaptation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
Wuyang Li, Jie Liu, Bo Han, Yixuan Yuan
paper
/
codes/
video
Key Words: Open Set Domain Adaptation; Causal Theory
TL;DR: Driven by new observations, proposing a theoretically
grounded method to tackle biased learning and cross-domain transfer with
causality.
|
|
SIGMA: Semantic-complete Graph Matching for Domain Adaptive Object
Detection
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022,
ORAL, Best Paper Finalist
Wuyang Li, Xinyu Liu, Yixuan Yuan
paper /
codes /
η₯δΉ
Key Words: Domain Adaptive Object Detection; Graph Matching
TL;DR: Reformulating domain adaptation as a graph matching
problem among fine-grained cross-domain feature points.
|
|
Towards Robust Adaptive Object Detection under Noisy Annotations
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Xinyu Liu, Wuyang Li, Qiushi Yang, Baopu Li, Yixuan Yuan
paper /
codes
Key Words: Open Set Domain Adaptation; Noisy Label
TL;DR: Exploring a more practical setting with noisy
annotations
in DAOD and solving by measuring the latent transferability of object noise.
|
|
SCAN: Cross Domain Object Detection with Semantic Conditioned
Adaptation
The Association for the Advance of Artificial Intelligence (AAAI), 2022,
ORAL
Wuyang Li, Xinyu Liu, Xiwen Yao, Yixuan Yuan
paper /
codes
Key Words: Open Set Domain Adaptation; Graph-based Learning
TL;DR: Discovering the key factor leading to the perforrmance
drop in DAOD and addressing with cross-domain semantic-conditioned kernels.
|
|
Intervention & Interaction Federated Abnormality Detection with
Noisy
Clients
Medical Image Computing and Computer Assisted Intervention (MICCAI),
2022,
ORAL, Early Accept
Xinyu Liu, Wuyang Li, Yixuan Yuan
paper
/
codes
Key Words: Federated Learning; Noisy Label; Causal Theory
TL;DR: Studying the recognition bias in federated object
detection under noisy annotations, and addressing the bias with a novel causal
intervention.
|
|
SIGMA++: Improved Semantic-complete Graph Matching for Domain
Adaptive Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI),
2023,
IF:
24.314
Wuyang Li, Xinyu Liu, Yixuan Yuan
paper /
codes
Key Words: Domain Adaptive Object Detection; Hypergraph
Matching
TL;DR: Improving pair-wise SIGMA with high-order hypergraph,
thereby addressing the domain misalignment with group-level adaptation.
|
|
HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection
IEEE Transactions on Image Processing (TIP), 2022, IF: 11.041
Wuyang Li, Zhen Chen, Baopu Li, Dingwen Zhang, Yixuan Yuan
paper
/
codes
Key Words: Generic Object Detection; Graph-based Learning
TL;DR: Discovering the heterogeneous feature demands between
the classification and regression, and solving via task-decoupled designs.
|
|
Decoupled Unbiased Teacher for Source-Free Domain Adaptive Medical
Object
Detection
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Xinyu Liu, Wuyang Li, Yixuan Yuan
paper /
codes
Key Words: Source-free Domain Adaptive Object Detection;
Causal Theory
TL;DR: Identifying the bias at the data, model and prediction
levels in SFDA, and sovling with causal intervention.
|
π Other Publication
Image/Video Generative Models
-
Endora: Video Generation Models as Endoscopy
Simulators
Chenxin Li, Hengyu Liu, Yifan Liu, Brandom Feng, Wuyang
Li, Xinyu Liu, Zhen Chen, Jing Shao, Yixuan Yuan
Medical Image Computing and Computer Assisted Intervention (MICCAI),
2024
-
DiffRect: Latent Diffusion Label Rectification for Semi-supervised
Medical Image Segmentation
Xinyu Liu, Wuyang Li, Yixuan Yuan
Medical Image Computing and Computer Assisted Intervention (MICCAI),
2024
Brain MRI Analysis
-
FM-APP: Foundation Model for Any Phenotype Prediction via fMRI to
sMRI Knowledge Transfer
Zhibin He, Wuyang Li, Yifan Liu, Xinyu Liu, Junwei Han,
Tuo Zhang, Yixuan Yuan
IEEE Transactions on Medical Imaging (TMI), 2024, Under Review
-
F2TNet: FMRI to T1w MRI Knowledge Transfer Network for Brain
Multi-phenotype Prediction
Zhibin He, Wuyang Li, Yu Jiang, Zhihao Peng, Pengyu Wang,
Xiang Li, Tianming Liu, Junwei Han, et al.
Medical Image Computing and Computer Assisted Intervention (MICCAI),
2024
-
H2GM: A Hierarchical Hypergraph Matching Framework for Brain
Landmark Alignment
Zhibin He, Wuyang Li, Tuo Zhang, Yixuan Yuan
Medical Image Computing and Computer Assisted Intervention (MICCAI),
2023
LLM Agent
-
LLM-guided Decoupled Probabilistic Prompt for Continual Learning
in Medical Image Diagnosis
Yiwen Luo, Wuyang Li, Chen Cheng, Xiang Li, Tianming Liu,
Yixuan Yuan
IEEE Transactions on Medical Imaging (TMI), 2024, Under Revision
3D Vision
-
LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical
Scene Reconstruction
Hengyu Liuβ, Yifan Liuβ, Chenxin Liβ, Wuyang Li, Yixuan
Yuan
Medical Image Computing and Computer Assisted Intervention (MICCAI),
2024
-
Stereo Vision Meta-Lens-Assisted Driving Vision
Xiaoyuan Liu, Wuyang Li, Takeshi Yamaguchi, Zihan Geng,
Takuo Tanaka, Din Ping Tsai, Mu Ku Chen
ACS Photonics, 2024, Selected as Cover Paper
-
GRAB-Net: Graph-based Boundary-aware Network for Medical Point
Cloud Segmentation
Yifan Liu, Wuyang Li, Jie Liu, Hui Chen, Yixuan Yuan
IEEE Transactions on Medical Imaging (TMI), 2023
Graph-based Learning
-
Medical Federated Learning with Joint Graph Purification for Noisy
Label Learning
Zhen Chen, Wuyang Li, Xiaohan Xing, Yixuan Yuan
IEEE Transactions on Neural Networks and Learning Systems (TNNLS),
2023
-
SCAN++: Enhanced Semantic Conditioned Adaptation for Domain
Adaptive Object Detection
Wuyang Li, Xinyu Liu, Yixuan Yuan
IEEE Transactions on Multimedia (TMM), 2022
Medical Imaging Analysis
-
U-KAN Makes Strong Backbone for Medical Image Segmentation and
Generation
Chenxin Liβ, Xinyu Liuβ, Wuyang Liβ, Cheng Wangβ, Hengyu
Liu, Yixuan Yuan (β Equal Contribution)
ArXiv preprint arXiv:2406.02918, 2024
-
Joint polyp detection and segmentation with heterogeneous
endoscopic data
Wuyang Li, Chen Yang, Jie Liu, Xiaoqing Guo, Yixuan
Yuan
International Symposium on Biomedical Imaging (ISBI) Workshop,
2021
|
π‘ Service
Journals Reviewer
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- International Journal of Computer Vision (IJCV)
- IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
- IEEE Transactions on Image Processing (TIP)
- IEEE Transactions on Automation Science and Engineering (TASE)
- Pattern Recognition (PR)
Conferences Reviewer
- Conference on Neural Information Processing Systems (neurIPS), 2024
- IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023, 2024
- IEEE International Conference on Computer Vision (ICCV), 2023
- AAAI Conference on Artificial Intelligence (AAAI), 2023, 2024
|
π
Selected Honors
- [2023] Outstanding Academic Performance Award (OAPA), CityU
- [2023] Research Tuition Scholarship (RTS), CityU
- [2022] Outstanding Academic Performance Award (OAPA), CityU
- [2022] Research Tuition Scholarship (RTS), CityU
- [2018] National Scholarship (Top 2% student)
- [2017] National Scholarship (Top 2% student)
- [2017] Tianjin Mathematical Competition (Second Prize)
- [2017-2020] Outstanding Student Scholarship (Top 10% student)
|
|
Tianjin University (TJU), China
Sep. 2016 - Jun. 2020οΌ Bachelor's degree of Communication Engineering.
GPA: 3.83/4.00, 91.3/100, Ranking 6/120
|
|
NUS (Suzhou) Research Institute (NUSRI), China
Sep. 2019 - Jun. 2020: Exchanging program of Electrical and Computer
Engineering.
Supervisors: Prof. Zhiying Zhou
Complete the project: Towards Webpage-based Augmentation Reality (AR)
|
|
City University of Hong Kong (CityU), China
Sep. 2020 - Dec. 2023: Ph.D Study of Electrical Engineering.
Supervisors: Prof. Yixuan Yuan
|
|
Founder & Director, ScholaGO Education Technology Company Limited
Jun. 2024 - Present: Founder & Director of the startup company
I am fortunate to work with other co-founders to develop our product.
|
π¨ Personal Interests
- Painting and Designing: I used to do sketch training with art
candidates
and
have a
a certain level of graphic design foundation. I have a strong
interest
in user
needs
analysis and product design.
- I am looking for the opportunity to establish a start-up team
and
create
some
awesome high-tech products.
|
|