PolyQTL: A Bayesian method to detect multiple eQTL with control for population structure and relatedness

PolyQTL is a statistical method to perform multiple eQTL detection, and it has a control for population structure and relatedness with the available genetic relatedness matrix (GRM).

This repository contains source code, and sample data, and the step to run it. If you have any questions or comments, please contact bzeng30@gatech.edu or greg.gibson@biology.gatech.edu

License

Software distributed under the terms of the GNU General Public License as published by the Free Software Foundation.

Installation

GCC >=4.7.0, C++ library openmp are needed.

There is a Makefile, and you can just run "make" to install the package.

Running

Genetic relatedness matrix (GRM) is needed to be used to control for population structure and relatedness. You can calculate it with external packages, like GCTA, GEMMA.

There are two modes to run the package: 1. Conditional analysis. In this mode, conditional analysis was firstly conducted, and in each iteration, mixed linear model component of GEMMA package was used to detect peak signal. For each detected peak signal, all variants locating in high LD (r2>=0.3) were extracted, and sampling of causal states was run to estimate the importance of explored variants; 2. One-step. In this mode, the step to detect peak signal was skipped, and importance of each variant was estimated.

To run PolyQTL in conditional analysis mode, three files are wanted: 1. GRM; 2. phenotype-genotype file plink bfile format for genotype; 3. binary plink format genotype.

For the phenotype-genotype file, it should be in the format:

Ind phe1 phe2 ... phen variant1 variant2 variant3 ... variantM

Ind1 a11 a12  ... p11 p12 p13 ... p1M

Ind2 a21 a22  ... p21 p22 p23 ... p2M

.

.

.

Indn an1 an2   ... pn1 pn2 pn3 ... pnM

The package needs some informations to recognise phenotypes, and currently, the phenotype name should be in the format "phe_*".

To run PolyQTL in one-step mode, three files are needed: 1. GRM; 2. phenotype; 3. genotype dosage.

For one-column phenotype, it should be one-column, and contains the phenotype value

For genotype dosage, it should be in the format below:

rs1	rs2	rs3	rs4
1	2	2	2
1	1	1	1
0	2	2	2
0	2	0	0
1	2	1	1
1	2	2	2
1	2	2	2
0	1	1	1
2	2	2	2

Examples

You can run the following command to have a sense of how PolyQTL works.

Before running the command, pleas unzip the GRM file: unzip data/conditional_analysis/GRM_for_1000G_1843_individual.zip, unzip data/one-step/GRM_for_1000G_1843_individual.zip

Conditional-analysis mode

./PolyQTL -t 1 -P data/conditional_analysis/CATSPER1_genotype_phenotype -T CATSPER1 -G data/conditional_analysis/GRM_for_1000G_1843_individual -o output_test -Z data/conditional_analysis/genotype_CATSPER1

In this simulation, two variants rs11227309 rs77836214 were chosen to be causal variants, and explain 4%~8% of phenotype variance. Heritability was set to be 0.6, and Fst=0.2
One-step mode

./PolyQTL -o output_PolyQTL -p data/one-step/CATSPER1.phe -c 1 -t 1 -x data/one-step/CATSPER1.geno -G data/one-step/GRM_for_1000G_1843_individual

In this simulation, one variant, rs11227309 was chosen to be causal and explains 5% of the phenotype variance. Heritability was set to be 0.6, and Fst=0.2.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
armadillo-8.100.1		armadillo-8.100.1
data		data
image		image
Makefile		Makefile
MeQTLPolyG.cpp		MeQTLPolyG.cpp
MeQTLPolyGModel.h		MeQTLPolyGModel.h
PolyQTL		PolyQTL
PostCal.cpp		PostCal.cpp
PostCal.h		PostCal.h
README.md		README.md
TopKSNP.cpp		TopKSNP.cpp
TopKSNP.h		TopKSNP.h
Util.cpp		Util.cpp
Util.h		Util.h
conditional_function.cpp		conditional_function.cpp
conditional_function.h		conditional_function.h
gemma.cpp		gemma.cpp
gemma.h		gemma.h
gemma_gzstream.cpp		gemma_gzstream.cpp
gemma_gzstream.h		gemma_gzstream.h
gemma_io.cpp		gemma_io.cpp
gemma_io.h		gemma_io.h
gemma_lapack.cpp		gemma_lapack.cpp
gemma_lapack.h		gemma_lapack.h
gemma_lmm.cpp		gemma_lmm.cpp
gemma_lmm.h		gemma_lmm.h
gemma_mathfunc.cpp		gemma_mathfunc.cpp
gemma_mathfunc.h		gemma_mathfunc.h
gemma_param.cpp		gemma_param.cpp
gemma_param.h		gemma_param.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PolyQTL: A Bayesian method to detect multiple eQTL with control for population structure and relatedness

License

Installation

Running

Examples

About

Releases

Packages

Languages

jxzb1988/PolyQTL

Folders and files

Latest commit

History

Repository files navigation

PolyQTL: A Bayesian method to detect multiple eQTL with control for population structure and relatedness

License

Installation

Running

Examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages