# A summary-data-based Mendelian randomization method with application to correlated lipidomic data

> **NIH NIH R21** · UNIV OF NORTH CAROLINA CHAPEL HILL · 2022 · $214,259

## Abstract

PROJECT SUMMARY
In recent years, state-of-the-art technologies have allowed for high-throughput profilings of -omics markers, and
these emerging data have been powerful tools to understand complex biological systems. Meanwhile, genome-
wide association studies (GWAS) with -omics markers have been conducted, and the emerging summary
statistics has provided evaluable opportunity to examine causality between -omics markers and disease
outcomes using Mendelian randomization (MR). However, most of the time, these novel -omics data are high-
dimensional and correlated, imposing significant challenges in use of MR. We will develop a summary-data-
based
MR method for use with high-dimensional correlated data. Our mode is a multivariable linear mixed model,
and the innovation lies in that we will account for random errors in both independent variables and dependent
variable under fixed- and random-effects models. Our model allows for multiple -omics markers as independent
variables, and boosts power by including variants with less stringent P value as IVs and implementing stepwise
selection which retains the most important -omics markers. We will conduct stratified analysis by tissue
expression and biological function
of genetic variants to test for robustness of causal associations. W
e will
develop an R script and share it with the whole research community in Github (aim 1). We will conduct a
simulation study to compare the performance of our MR model to previous summary-data-based-MR methods
(aim 2). High-density lipoprotein cholesterol (HDL-c) levels have long been recognized an important inverse
predictor of coronary heart disease. However, clinical trials and Mendelian randomization suggest that level of
HDL-c is a poor biomarker of HDL function, and raise an important question about whether other novel measures
of HDL (i.e. lipids in HDLs of different particle sizes) may have causal effects on cardiometabolic disease. HDL
is a collection of particles with different sizes highly heterogeneous in lipid composition. Small HDL that exerts
stronger cholesterol efflux capacity has higher contents of
phospholipids and lower triglycerides and cholesterols,
while large HDL has higher contents of triglycerides and cholesterols and lower phospholipids. Thus, it is
imperative to understand whether lipids in HDL lipoproteins are causally related to cardiometabolic disease.
We
will apply our MR method to examine causal effects of lipids (cholesterol, phospholipids, and triglycerides) in
HDL particles (very large, large, medium, small, very small) on risk of cardiometabolic disease (coronary heart
disease, stroke, and type 2 diabetes) using the recently released lipidomics data in the UK Biobank and existing
summary statistics with lipid fractions from the
MAGNETIC consortium (aim 3).

## Key facts

- **NIH application ID:** 10424741
- **Project number:** 1R21HG012365-01
- **Recipient organization:** UNIV OF NORTH CAROLINA CHAPEL HILL
- **Principal Investigator:** Ming Ding
- **Activity code:** R21 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2022
- **Award amount:** $214,259
- **Award type:** 1
- **Project period:** 2022-09-23 → 2024-08-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10424741

## Citation

> US National Institutes of Health, RePORTER application 10424741, A summary-data-based Mendelian randomization method with application to correlated lipidomic data (1R21HG012365-01). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10424741. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*