This project develops new mathematical and computational tools for integrating high-dimensional datasets with partially shared structures, a challenge that arises across various fields, including molecular biology, precision medicine, business analytics, and economics. When data are collected from multiple sources—such as different individuals, experimental conditions, or technologies—joint analysis can reveal complex patterns that would be missed if each dataset were analyzed in isolation. However, existing methods often struggle to distinguish meaningful signals from noise, particularly when the data are high-dimensional and heterogeneous. This project addresses these limitations by creating a principled framework to uncover and align shared low-dimensional structures across datasets, ultimately enabling more accurate, interpretable, and biologically relevant insights. The project will also contribute to the broader community by developing open-source software tools and offering interdisciplinary training opportunities for students at various levels. This project will build new theoretical foundations and methods at the intersection of random matrix theory, manifold learning, and high-dimensional statistics, and it is closely related to artificial intelligence. Key contributions include new results in random matrix theory for composite and kernel matrices formed from multiple datasets, a Procrustes-based framework for aligning low-dimensional structures in high-dimensio