Shared and disease-specific pathways in frontotemporal dementia, Alzheimer’s and Parkinson’s diseases
This repository contains the code for bioinformatics analyses described in the article "Shared and disease-specific pathways in frontotemporal dementia, Alzheimer’s and Parkinson’s diseases".
This project investigated plasma proteomics data from the SomaScan 7K platform in over 10K individuals from the Global Neurodegeneration Proteomics Consortium (GNPC). This data is utilized to identify proteins associated with Alzheimer disease (AD), Parkinson disease (PD), and Frontotemporal dementia (FTD). Idnetified proteins were leveraged to create disease-spcific prediction models, uncover shared and distinct proteomic alterations across these diseases. We also performed biological pathways and cell type enrichment analyses to understand underlying disease biology.
The code covers the following main analysis steps:
- Data pre-processing: Proteomics data preparation, QC, and normalization
- Differential expression analyses
- Disease correlation analyses
- Pathway enrichment analyses
- Cell type enrichment analysis
- Network and perturbation analyses
- Prediction model development using Lasso regression
- Sensitivity analyses
Pre-processed and normalized proteomic datasets used in preparing this manuscript will be shared at the Global Neurodegeneration Proteomics Consortium (https://www.neuroproteome.org/) website in July 2025.
The code is available under the MIT License.
The code was tested on R 4.3.0 on Linux operating systems, but should be compatible with later versions of R installed on current Linux, Mac, or Windows systems.
To run the code, the correct working directory containing the input data must be specified at the beginning of the R-scripts, otherwise the scripts can be run as-is.