GenBrain - A Generative Foundation Model of Multimodal Brain Imaging

The version corresponding to the submitted manuscript is tagged as v1.0. This repository is currently being organized and further updates will be added over time.

Overview

GenBrain is a generative foundation model designed for multimodal brain imaging.

GenBrain is pre-trained on large-scale neuroimaging data from the UK Biobank and evaluated on 81 heterogeneous datasets, demonstrating strong generalization across imaging modalities and downstream tasks.

This repository provides the core implementation of GenBrain, along with pretrained models for research and development in neuroimaging and AI-driven brain analysis.

System Requirements

OS: Ubuntu 22.04.5 LTS (kernel 3.10.0-1160)
Python: 3.10.12
Libraries: PyTorch 2.4.1+cu118, nibabel 5.3.2, more details see requirements.txt.
GPU: NVIDIA A100-SXM, 80 GB memory

Installation Guide

Install Instructions

# Create conda environment if needed.
conda create -n myenv python=3.10.12
conda activate myenv
# Install packages from requirements.txt
pip install -r requirements.txt

Install time on a "normal" desktop computer takes about 10~20 minutes.
A Docker image enabling direct execution of the code will be made available shortly.

Demo

A demo for image enhancement is provided.
Additional demos for other downstream tasks can be executed using the provided source code.
Model weights can be downloaded from Google Drive or obtained by contacting the corresponding author.

Instructions: We assume that the corrupted images (T1w/T2-FLAIR modalities) have been registered to the MNI152 2mm standard space, and that the corresponding label files and model weights are properly configured. The demo can be run using:
```
python run_inference.py
```
Expected output: An enhanced MRI image.
Runtime: On a standard desktop computer equipped with an NVIDIA GPU with more than 20 GB of memory, inference for a single sample using DDIM (50 steps) takes approximately 10–30 seconds.

Instructions for Use

GenBrain Pretraining Instructions:
1. Data Preprocessing: UKB multimodal brain image dataset are non-linearly registered to the MNI152 2mm standard space. Brain voxels are then extracted according to nonzero indices in the template and saved as .npy files (stored as 1d array, N_voxel=228,453). Note: (FSL software is recommended for preprocessing)
2. Configure Pretraining Settings and Files: Prepare model pretraining parameters, data file, and label file (including individual age, sex, and imaging modality), files are in labels directory. Phenotypic information and imaging modality details can be found in data_info.json.
3. Running the Training: Multi-GPU training requires support for Distributed Data Parallel (Recommend Nvidia A100 GPUs):
```
torchrun --nnodes=N_Node --nproc_per_node=N_GPU train.py
torchrun --nnodes=1 --nproc_per_node=6 train.py    # Our: 1 node with 6 Nvidia A100 GPUs
```
  Pretraining code is in pretraining_evaluation directory.
GenBrain Fine-tuning Instructions:
1. Data Preprocessing
  - Perform data preprocessing similar to the pretraining stage.
  - Ensure input formats, normalization, and any filtering steps match the pretraining pipeline.
2. Load Pretrained Weights and Adapt Model Architecture
  - Load GenBrain's pretrained weights.
  - Modify the architecture according to the target task:
    - Image-level tasks: e.g. adjust patchify layer.
    - Disease label fine-tune: e.g add disease-label embedder.
    - ...
3. Running the Fine-tuning and Inference:
```
torchrun --nnodes=N_Node --nproc_per_node=N_GPU finetune.py # fine-tune
python evaluate.py # inference
```
4. Note: Our downstream task code can be referenced for implementation details and examples.
Reproduction instructions: Follow the default hyper-parameter settings in the source code or according to our paper.

Support downstream tasks

Image Enhancement
- Denoising and motion correction.
- Fine-tuned GenBrain v1.0 Support T1w and T2-FLAIR modalities.
Cross-modality Synthesis
It can be used for :
- Structural Synthesis (e.g. T1w<->FLAIR)
- Functional Synthesis: rs-fMRI fc to task-based fMRI activations (e.g. 15 seed-based rs-fMRI fcs to "shapes" task contrast maps in UKB)
- Structure-Function Synthesis: dMRI scalar maps to rs-fMRI fc (e.g. 9 dMRI maps to Language Network-fc)
Cross-site Diagnosis
- Fine-tune GenBrain on images with disease labels. Synthetic images and real images volumetric measures are extracted by WMH-SynthSeg. These volumetric features served as quantitative inputs for a machine learning classification model(LightGBM).
- By using synthetic images' features, machine learning classification model' generalizability can be improved in cross-site diagnosis. Specifically, this approach enhances the prediction of Schizophrenia and Alzheimer’s disease.
Improving BWAS Reliability
- Fine-tune GenBrain on images with disease labels (modalities like vbm, seed-based rs-fMRI fc).
- Leveraging the population-level prior learned by GenBrain, synthetic images generated by the model improve the reliability of brain-wide association studies.
- Diseases: Schizophrenia, Major Depressive Disorder, Autism Spectrum Disorder.
Clinical Application
- Fine-tune GenBrain on clinical-grade images with disease labels. Synthetic images are added to real images at varying ratios to train the predictive model, improving its diagnostic performance.
- Specifically, this approach enhances the prediction of acute stroke severity ds004889 and chronic aphasia ds004884.
Image Super-resolution
- Fine-tune GenBrain for image super-resolution task (e.g 2mm T1w -> 1mm T1w, operated in MNI152 standard space).
- The pipeline first applies nearest-neighbor interpolation to upsample the low-resolution image to the target resolution, followed by GenBrain for super-resolution.

Pretrained Weights

You can download the pretrained weight here: Google Drive Link

License

This project is licensed under the MIT License.

Citation

For usage of the code and associated manuscript, please cite GenBrain: A Generative Foundation Model of Multimodal Brain Imaging.

Contact

If you encounter any issues while using this code, please feel free to contact me. I will be happy to help.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
demo		demo
diffusion		diffusion
downstream_task_clinical_application		downstream_task_clinical_application
downstream_task_cross_modality_synthesis		downstream_task_cross_modality_synthesis
downstream_task_cross_site_diagnosis		downstream_task_cross_site_diagnosis
downstream_task_image_enhancement		downstream_task_image_enhancement
downstream_task_image_super_resolution		downstream_task_image_super_resolution
downstream_task_improving_bwas_reliability		downstream_task_improving_bwas_reliability
labels		labels
preprocess_file		preprocess_file
pretraining_evaluation		pretraining_evaluation
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenBrain - A Generative Foundation Model of Multimodal Brain Imaging

Overview

System Requirements

Installation Guide

Demo

Instructions for Use

Support downstream tasks

Pretrained Weights

License

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GenBrain - A Generative Foundation Model of Multimodal Brain Imaging

Overview

System Requirements

Installation Guide

Demo

Instructions for Use

Support downstream tasks

Pretrained Weights

License

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages