Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
tjiagoM authored Jun 2, 2021
1 parent aac963a commit 2411859
Showing 1 changed file with 30 additions and 10 deletions.
40 changes: 30 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,20 +1,19 @@
[![DOI](https://zenodo.org/badge/265633374.svg)](https://zenodo.org/badge/latestdoi/265633374)
[![GitHub license](https://img.shields.io/github/license/tjiagoM/gtex-transcriptome-modelling)](https://github.com/tjiagoM/gtex-transcriptome-modelling/blob/master/LICENSE)
[![DOI](https://img.shields.io/badge/DOI-10.1038/s41540--021--00186--6-blue.svg)](https://doi.org/10.1038/s41540-021-00186-6)

# Multilayer modelling of the human transcriptome and biological mechanisms of complex diseases and traits

# Multilayer modelling and analysis of the human transcriptome
*Tiago Azevedo, Giovanna Maria Dimitri, Pietro Lió, Eric R. Gamazon*

![image](https://user-images.githubusercontent.com/7031523/120554714-fc28d080-c3f1-11eb-90a6-df1962370bee.png)

This repository contains all the code necessary to run and further extend the experiments presented in the following paper accepted at _npj Systems Biology and Applications_: [https://doi.org/10.1038/s41540-021-00186-6](https://doi.org/10.1038/s41540-021-00186-6).

*Tiago Azevedo, Giovanna Maria Dimitri, Pietro Lio, Eric R. Gamazon*

This repository contains all the code necessary to run and further extend the experiments presented in the following bioRxiv preprint: [https://doi.org/10.1101/2020.05.21.109082](https://doi.org/10.1101/2020.05.21.109082)

## Abstract

Here, we performed a comprehensive intra-tissue and inter-tissue network analysis of the human transcriptome.
We generated an atlas of communities in co-expression networks in 49 tissues (GTEx v8), evaluated their tissue specificity, and investigated their methodological implications.
UMAP embeddings of gene expression from the communities (representing nearly 18% of all genes) robustly identified biologically-meaningful clusters.
Methodologically, integration of the communities into a transcriptome-wide association study of C-reactive protein (CRP) in 361,194 individuals in the UK Biobank identified genetically-determined expression changes associated with CRP and led to considerably improved performance.
Furthermore, a deep learning framework applied to the communities in nearly 11,000 tumours profiled by The Cancer Genome Atlas across 33 different cancer types learned biologically-meaningful latent spaces, representing metastasis (<img src="https://render.githubusercontent.com/render/math?math=p < 2.2 \times 10^{-16}">) and stemness (<img src="https://render.githubusercontent.com/render/math?math=p < 2.2 \times 10^{-16}">).
Our study provides a rich genomic resource to catalyse research into inter-tissue regulatory mechanisms and their downstream phenotypic consequences.
Here, we performed a comprehensive intra-tissue and inter-tissue multilayer network analysis of the human transcriptome. We generated an atlas of communities in gene co-expression networks in 49 tissues (GTEx v8), evaluated their tissue specificity, and investigated their methodological implications. UMAP embeddings of gene expression from the communities (representing nearly 18% of all genes) robustly identified biologically-meaningful clusters. Notably, new gene expression data can be embedded into our algorithmically derived models to accelerate discoveries in high-dimensional molecular datasets and downstream diagnostic or prognostic applications. We demonstrate the generalisability of our approach through systematic testing in external genomic and transcriptomic datasets. Methodologically, prioritisation of the communities in a transcriptome-wide association study of the biomarker C-reactive protein (CRP) in 361,194 individuals in the UK Biobank identified genetically-determined expression changes associated with CRP and led to considerably improved performance. Furthermore, a deep learning framework applied to the communities in nearly 11,000 tumors profiled by The Cancer Genome Atlas across 33 different cancer types learned biologically-meaningful latent spaces, representing metastasis (<img src="https://render.githubusercontent.com/render/math?math=p < 2.2 \times 10^{-16}">) and stemness (<img src="https://render.githubusercontent.com/render/math?math=p < 2.2 \times 10^{-16}">). Our study provides a rich genomic resource to catalyse research into inter-tissue regulatory mechanisms, and their downstream consequences on human disease.


## Repository Structure
Expand Down Expand Up @@ -140,6 +139,27 @@ The previous command will run the `02_01_correct_confounds.R` script and log the
The scripts for the multilayer modeling approach to TWAS/PrediXcan (CRP in UKB) and Variational Autoencoder model (TCGA) are in this [external repository](https://github.com/gamazonlab/MultilayerModelingTranscriptome).


## Citing this work

To cite our work, we provide the following BibTeX:

```bibtex
@article{Azevedo2021,
doi = {10.1038/s41540-021-00186-6},
url = {https://doi.org/10.1038/s41540-021-00186-6},
year = {2021},
month = may,
publisher = {Springer Science and Business Media {LLC}},
volume = {7},
number = {1},
pages={1--13},
author = {Tiago Azevedo and Giovanna Maria Dimitri and Pietro Li{\'{o}} and Eric R. Gamazon},
title = {Multilayer modelling of the human transcriptome and biological mechanisms of complex diseases and traits},
journal = {npj Systems Biology and Applications}
}
```


## Questions?

If you run into any problem or have a question, please just open an issue.

0 comments on commit 2411859

Please sign in to comment.