I am a Ph.D. student at the College of Computer and Information Technology, Beijing Jiaotong University(BJTU). I am supervised by Prof. Dongxia Chang in the Center for Digital Media Information Processing Lab (Mepro). I have published several papers in SCI/CCF conferences and journals, including ACM MM, TKDE, TMM, TCSVT, and PR. (Resume: EN/中文)
My research interests include multi-view/multi-modal representation learning, deep clustering, self-supervised learning, and contrastive learning. In particular, I focus on:
- 🔍 Contrastive Multi-view Clustering
- 🧠 Incremental Multi-view/Multi-Modal Representation Learning
- 🌐 Self-supervised Multi-view/Multi-Modal Representation Learning
🔥 News
- 2026.06: 🎉🎉 One paper has been accepted by IEEE Transactions on CSVT. Congratulations, Brother Kaixuan❗️
- 2026.05: 🎉🎉 One paper has been accepted by Pattern Recognition. Congratulations, Brother Shaohan❗️
- 2026.05: 🎉🎉 One paper has been accepted by IEEE Transactions on CSVT. Congratulations, Brother Linhua❗️
- 2026.03: 🎉🎉 One paper has been accepted by IEEE Transactions on Multimedia. Congratulations, Brother Zisen❗️
- 2026.03: 🎉🎉 One paper has been accepted by ICME 2026. Congratulations, Brother Zechang❗️
- 2026.02: 🎉🎉 One paper has been accepted by IEEE Transactions on Knowledge and Data Engineering. Congratulations, Brother Zisen❗️
- 2026.01: 🎉🎉 One paper has been accepted by IEEE Transactions on Knowledge and Data Engineering.
- 2025.12: 🎉🎉 One paper has been accepted by the Journal of Dental Research. Congratulations, Brother Aobo❗️
- 2025.10: 🎉🎉 One paper has been accepted by Neurocomputing. Congratulations, Brother Zisen❗️
- 2025.09: 🎉🎉 One paper has been accepted by Neurocomputing. Congratulations, Brother Teng❗️
- 2025.07: 🎉🎉 One paper has been accepted by IEEE Transactions on Multimedia.
- 2025.07: 🎉🎉 One paper has been accepted by ACM MM 2025.
- 2025.03: 🎉🎉 One paper has been accepted by Neurocomputing.
📝 Publications
i † means equal contribution (Co-First Author)
🎓 First-author Publications

Beyond One-Layer Decision: Hierarchical Multi-view Clustering with Progressive Cross-Layer Fusion
Kaixuan Zhou†, Pengyuan Li†, Jiahui Zhang, Dongxia Chang*, Yiming Wang, Yao Zhao
- We propose a novel Hierarchical-Aware Multi-view Clustering framework. Unlike conventional approaches restricted to single-layer semantics, this framework establishes a holistic learning paradigm that transcends the representation bottleneck by simultaneously exploiting intra-view hierarchy and inter-view consensus.

Disentangled Contrastive Multi-view Clustering via Semantic Relevance Invariance
Pengyuan Li, Dongxia Chang*, Yiming Wang, Zisen Kong, Linhua Kong, Yao Zhao
- We propose a disentangled contrastive multi-view clustering via semantic relevance invariance, which achieves intra-view and inter-view disentanglement and thus a more discriminative representation. The method not only makes disentangled representations containing different underlying information but also ensures their semantic relevance consistency.

Deep Multi-view Clustering with Intra-view Similarity and Cross-view Correlation Learning
Pengyuan Li, Dongxia Chang*, Yiming Wang, Man Liu, Zisen Kong, Linhua Kong, Yao Zhao
- We present a novel deep learning framework that mitigates view bias through joint optimization of intra-view similarity and cross-view correlation. The proposed method enhances fine-grained structures within each view and adaptively balances diverse information across views, ultimately improving clustering performance.

AEMVC: Mitigate Imbalanced Embedding Space in Multi-view Clustering
Pengyuan Li†, Man Liu†, Dongxia Chang*, Yiming Wang, Zisen Kong, Yao Zhao
- We found that the embedding space learned using the encoder-decoder architecture cannot embrace the efficacy of different feature directions. Therefore, we propose a novel Activate-Then-Eliminate Strategy for Multi-View Clustering to adjust the contribution strength of different feature directions dynamically.

DCMVC: Dual Contrastive Multi-view Clustering
Pengyuan Li, Dongxia Chang*, Zisen Kong, Yiming Wang, Yao Zhao
- We propose a novel deep contrastive multi-view clustering method termed DCMVC. The dual contrastive mechanism can alleviate the constraints of a single positive sample on contrastive learning by incorporating category information to regularize the feature structure and fully explore the consistency of similar samples.

Deep Learning on Histology Images for Differentiating of Fibro-Osseous
Aobo Zhang†, Pengyuan Li†, Jiang Xue†, Jianyun Zhang, Zhu You, Shaohua Ge, Zhixiu Xu, Zhipeng Sun, Dongxia Chang*, Lisha Sun, Tiejun Li
- Our results demonstrate that integrating multi-slide and weakly supervised strategies significantly enhances diagnostic performance for fibro-osseous lesions. Compared to human pathologists, the multi-slide models achieved higher accuracy, whereas weakly supervised models consistently outperformed fully supervised models.
📑 Other-author Publications

Multi-Level Decoupled Trend Learning for GNN-Based Multivariate Time Series Prediction
Shaohan Li, Zhenfeng Zhu, Youru Li, Yeyu Yan, Shuai Zheng, Pengyuan Li, Yan Zhuang, Yao Zhao
- We propose a multi-level decoupled trend learning (MDTL) framework for MTS prediction, which decouples the complex MTS signals at trend and spatial-temporal dependency levels and then fuses them in a flexible way.

Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection
Linhua Kong, Dongxia Chang, Lian Liu, Zisen Kong, Pengyuan Li, Yao Zhao
- We propose a novel radar camera alignment model called RCAlign based on the sparse BEV alignment methods for 3D object detection.

ATMCA: Augmented Tensorized Consensus Learning for Multi-view Clustering with Anchor-Aligned
Zisen Kong, Pengyuan Li, Dongxia Chang, Yiming Wang, Yao Zhao
- We provide an intuitive solution to the Anchor-Unaligned Problem. The method introduces the reordering alignment mechanism and augmented tensorized consensus learning into the joint optimization framework.

Beyond Forced Modality Balance: Intrinsic Information Budgets for Multimodal Learning、
Zechang Xiong, Da Li*, Kexin Tang, Pengyuan Li, Wenkang Kong, Yulan Hu
- We argue that modality balance should be defined by an intrinsic equilibrium determined by the information capacity of each modality, rather than heuristic equalization. Therefore, we propose IIBalance, a multimodal learning framework for balanced learning under capacity-aware guidance.

Tensorial Multi-view Clustering via Alternative Rank Minimization and Inter-view Alignment
Zisen Kong, Dongxia Chang*, Yiming Wang, Pengyuan Li, Yao Zhao
- We propose a novel rank minimization strategy for tighter rank function approximation. The strategy can effectively utilize the low-rank structure and higher-order correlations embedded in different views, which helps to generate a discriminative consensus representation.

Bipartite Contrastive Multi-view Clustering with Singular Value Modulation
Teng Zhang, Pengyuan Li, Zisen Kong, Dongxia Chang*, Yao Zhao
- We reformulate contrastive learning as a binary classification problem, avoiding the limitation in previous contrastive methods that heavily depend on naturally paired data. By capturing sample-level and category-level consistency relationships among multiple views, the learned representations are further refined.

Local Geometry-Enhanced Anchor Learning for Multi-view Clustering
Zisen Kong, Zhiqiang Fu, Dongxia Chang*, Yiming Wang, Pengyuan Li, Yao Zhao
- We introduce a coarse-grained anchor learning mechanism that maps each view anchor to the consensus space, effectively improving the expressiveness and learning of the framework.
🎖 Honors and Awards
- 2025.11 First-class Academic Scholarship of Beijing Jiaotong University.
- 2025.11 Special Scholarship of Beijing Jiaotong University - Jiaokong Technology Scholarship.
- 2023.11 First-class Academic Scholarship of Beijing Jiaotong University.
- 2023.06 Outstanding Graduate Student of the School of Computer Science, Beijing Jiaotong University.
- 2022.10 National Bronze Award of the 2022 China University Computer Competition - Team Programming Ladder Competition.
- 2022.10 National Bronze Award of China Computer Design Contest 2022.
📌 Services
Conferences
- Reviewer: NeurIPS/ICML/AAAI/ACM MM
Journal
- Reviewer: IEEE TPAMI/TIP/TKDE/TCYB/TNNLS/TMM, Neurocomputing
📖 Educations
- 2024.06 - now, Ph.D. Student @ Beijing Jiaotong University, supervised by Prof. Dongxia Chang.
- 2023.09 - 2024.06, Master Student @ Beijing Jiaotong University, supervised by Prof. Dongxia Chang.
💻 Internships
- 2023.03 - 2023.06, PCITC, China.