I am a Ph.D. student at the College of Computer and Information Technology, Beijing Jiaotong University(BJTU). I am supervised by Prof. Dongxia Chang in the Center for Digital Media Information Processing Lab (Mepro). I have published several papers in SCI/CCF conferences and journals, including ACM MM, TKDE, TMM, TCSVT, and PR. (Resume: EN/中文)

My research interests include multi-view/multi-modal representation learning, deep clustering, self-supervised learning, and contrastive learning. In particular, I focus on:

  • 🔍 Contrastive Multi-view Clustering
  • 🧠 Incremental Multi-view/Multi-Modal Representation Learning
  • 🌐 Self-supervised Multi-view/Multi-Modal Representation Learning

🔥 News

  • 2026.06:  🎉🎉 One paper has been accepted by IEEE Transactions on CSVT. Congratulations, Brother Kaixuan❗️
  • 2026.05:  🎉🎉 One paper has been accepted by Pattern Recognition. Congratulations, Brother Shaohan❗️
  • 2026.05:  🎉🎉 One paper has been accepted by IEEE Transactions on CSVT. Congratulations, Brother Linhua❗️
  • 2026.03:  🎉🎉 One paper has been accepted by IEEE Transactions on Multimedia. Congratulations, Brother Zisen❗️
  • 2026.03:  🎉🎉 One paper has been accepted by ICME 2026. Congratulations, Brother Zechang❗️
  • 2026.02:  🎉🎉 One paper has been accepted by IEEE Transactions on Knowledge and Data Engineering. Congratulations, Brother Zisen❗️
  • 2026.01:  🎉🎉 One paper has been accepted by IEEE Transactions on Knowledge and Data Engineering.
  • 2025.12:  🎉🎉 One paper has been accepted by the Journal of Dental Research. Congratulations, Brother Aobo❗️
  • 2025.10:  🎉🎉 One paper has been accepted by Neurocomputing. Congratulations, Brother Zisen❗️
  • 2025.09:  🎉🎉 One paper has been accepted by Neurocomputing. Congratulations, Brother Teng❗️
  • 2025.07:  🎉🎉 One paper has been accepted by IEEE Transactions on Multimedia.
  • 2025.07:  🎉🎉 One paper has been accepted by ACM MM 2025.
  • 2025.03:  🎉🎉 One paper has been accepted by Neurocomputing.

📝 Publications

i † means equal contribution (Co-First Author)

🎓 First-author Publications

TCSVT 2026
sym

Beyond One-Layer Decision: Hierarchical Multi-view Clustering with Progressive Cross-Layer Fusion

TCSVT CCF B JCR Q1 IF 10.8

Kaixuan Zhou†, Pengyuan Li†, Jiahui Zhang, Dongxia Chang*, Yiming Wang, Yao Zhao

Paper | Code

  • We propose a novel Hierarchical-Aware Multi-view Clustering framework. Unlike conventional approaches restricted to single-layer semantics, this framework establishes a holistic learning paradigm that transcends the representation bottleneck by simultaneously exploiting intra-view hierarchy and inter-view consensus.
TKDE 2026
sym

Disentangled Contrastive Multi-view Clustering via Semantic Relevance Invariance

IEEE Transactions on Knowledge and Data Engineering CCF A JCR Q1 IF 11.6

Pengyuan Li, Dongxia Chang*, Yiming Wang, Zisen Kong, Linhua Kong, Yao Zhao

Paper | Code

  • We propose a disentangled contrastive multi-view clustering via semantic relevance invariance, which achieves intra-view and inter-view disentanglement and thus a more discriminative representation. The method not only makes disentangled representations containing different underlying information but also ensures their semantic relevance consistency.
TMM 2026
sym

Deep Multi-view Clustering with Intra-view Similarity and Cross-view Correlation Learning

IEEE Transactions on Multimedia CCF A JCR Q1 IF 9.9

Pengyuan Li, Dongxia Chang*, Yiming Wang, Man Liu, Zisen Kong, Linhua Kong, Yao Zhao

Paper | Code

  • We present a novel deep learning framework that mitigates view bias through joint optimization of intra-view similarity and cross-view correlation. The proposed method enhances fine-grained structures within each view and adaptively balances diverse information across views, ultimately improving clustering performance.
ACM MM 2025
sym

AEMVC: Mitigate Imbalanced Embedding Space in Multi-view Clustering

ACM Multimedia CCF A

Pengyuan Li†, Man Liu†, Dongxia Chang*, Yiming Wang, Zisen Kong, Yao Zhao

Paper | Code

  • We found that the embedding space learned using the encoder-decoder architecture cannot embrace the efficacy of different feature directions. Therefore, we propose a novel Activate-Then-Eliminate Strategy for Multi-View Clustering to adjust the contribution strength of different feature directions dynamically.
Neurocomputing 2025
sym

DCMVC: Dual Contrastive Multi-view Clustering

Neurocomputing CCF C JCR Q2 IF 6.7

Pengyuan Li, Dongxia Chang*, Zisen Kong, Yiming Wang, Yao Zhao

Paper | Code

  • We propose a novel deep contrastive multi-view clustering method termed DCMVC. The dual contrastive mechanism can alleviate the constraints of a single positive sample on contrastive learning by incorporating category information to regularize the feature structure and fully explore the consistency of similar samples.
JDR 2025
sym

Deep Learning on Histology Images for Differentiating of Fibro-Osseous

Journal of Dental Research JCR Q1 IF 6.6

Aobo Zhang†, Pengyuan Li†, Jiang Xue†, Jianyun Zhang, Zhu You, Shaohua Ge, Zhixiu Xu, Zhipeng Sun, Dongxia Chang*, Lisha Sun, Tiejun Li

Paper | Code

  • Our results demonstrate that integrating multi-slide and weakly supervised strategies significantly enhances diagnostic performance for fibro-osseous lesions. Compared to human pathologists, the multi-slide models achieved higher accuracy, whereas weakly supervised models consistently outperformed fully supervised models.

📑 Other-author Publications

PR 2026
sym

Multi-Level Decoupled Trend Learning for GNN-Based Multivariate Time Series Prediction

Pattern Recognition CCF B JCR Q1 IF 9.1

Shaohan Li, Zhenfeng Zhu, Youru Li, Yeyu Yan, Shuai Zheng, Pengyuan Li, Yan Zhuang, Yao Zhao

Paper | Code

  • We propose a multi-level decoupled trend learning (MDTL) framework for MTS prediction, which decouples the complex MTS signals at trend and spatial-temporal dependency levels and then fuses them in a flexible way.
TCSVT 2026
sym

Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection

TCSVT CCF B JCR Q1 IF 10.8

Linhua Kong, Dongxia Chang, Lian Liu, Zisen Kong, Pengyuan Li, Yao Zhao

Paper | Code

  • We propose a novel radar camera alignment model called RCAlign based on the sparse BEV alignment methods for 3D object detection.
TMM 2026
sym

ATMCA: Augmented Tensorized Consensus Learning for Multi-view Clustering with Anchor-Aligned

IEEE Transactions on Multimedia CCF A JCR Q1 IF 9.9

Zisen Kong, Pengyuan Li, Dongxia Chang, Yiming Wang, Yao Zhao

Paper | Code

  • We provide an intuitive solution to the Anchor-Unaligned Problem. The method introduces the reordering alignment mechanism and augmented tensorized consensus learning into the joint optimization framework.
ICME 2026
sym

Beyond Forced Modality Balance: Intrinsic Information Budgets for Multimodal Learning

IEEE International Conference on Multimedia and Expo 2026 CCF B Spotlight

Zechang Xiong, Da Li*, Kexin Tang, Pengyuan Li, Wenkang Kong, Yulan Hu

Paper | Code

  • We argue that modality balance should be defined by an intrinsic equilibrium determined by the information capacity of each modality, rather than heuristic equalization. Therefore, we propose IIBalance, a multimodal learning framework for balanced learning under capacity-aware guidance.
TKDE 2026
sym

Tensorial Multi-view Clustering via Alternative Rank Minimization and Inter-view Alignment

IEEE Transactions on Knowledge and Data Engineering CCF A JCR Q1 IF 11.6

Zisen Kong, Dongxia Chang*, Yiming Wang, Pengyuan Li, Yao Zhao

Paper | Code

  • We propose a novel rank minimization strategy for tighter rank function approximation. The strategy can effectively utilize the low-rank structure and higher-order correlations embedded in different views, which helps to generate a discriminative consensus representation.
Neurocomputing 2025
sym

Bipartite Contrastive Multi-view Clustering with Singular Value Modulation

Neurocomputing CCF C JCR Q2 IF 6.7

Teng Zhang, Pengyuan Li, Zisen Kong, Dongxia Chang*, Yao Zhao

Paper | Code

  • We reformulate contrastive learning as a binary classification problem, avoiding the limitation in previous contrastive methods that heavily depend on naturally paired data. By capturing sample-level and category-level consistency relationships among multiple views, the learned representations are further refined.
Neurocomputing 2025
sym

Local Geometry-Enhanced Anchor Learning for Multi-view Clustering

Neurocomputing CCF C JCR Q2 IF 6.7

Zisen Kong, Zhiqiang Fu, Dongxia Chang*, Yiming Wang, Pengyuan Li, Yao Zhao

Paper | Code

  • We introduce a coarse-grained anchor learning mechanism that maps each view anchor to the consensus space, effectively improving the expressiveness and learning of the framework.

🎖 Honors and Awards

  • 2025.11 First-class Academic Scholarship of Beijing Jiaotong University.
  • 2025.11 Special Scholarship of Beijing Jiaotong University - Jiaokong Technology Scholarship.
  • 2023.11 First-class Academic Scholarship of Beijing Jiaotong University.
  • 2023.06 Outstanding Graduate Student of the School of Computer Science, Beijing Jiaotong University.
  • 2022.10 National Bronze Award of the 2022 China University Computer Competition - Team Programming Ladder Competition.
  • 2022.10 National Bronze Award of China Computer Design Contest 2022.

📌 Services

Conferences

  • Reviewer: NeurIPS/ICML/AAAI/ACM MM

Journal

  • Reviewer: IEEE TPAMI/TIP/TKDE/TCYB/TNNLS/TMM, Neurocomputing

📖 Educations

  • 2024.06 - now, Ph.D. Student @ Beijing Jiaotong University, supervised by Prof. Dongxia Chang.
  • 2023.09 - 2024.06, Master Student @ Beijing Jiaotong University, supervised by Prof. Dongxia Chang.

💻 Internships

  • 2023.03 - 2023.06, PCITC, China.