-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TAD_pathways #19
base: master
Are you sure you want to change the base?
TAD_pathways #19
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for your interest and contributions to this project @Colin-Cheng ! Many of these changes are exciting. I have made several in line comments that must be addressed before the changes are merged in.
One thing that is not mentioned in the in line comments: Please add *.pyc
to the .gitignore
file and remove all .pyc
files from this Pull Request.
@@ -33,7 +33,7 @@ experimental validation at | |||
First, clone the repository and navigate into the top directory: | |||
|
|||
```bash | |||
git clone git@github.com:greenelab/tad_pathways_pipeline.git | |||
git clone https://github.com/marislab/tad_pathways_pipeline.git |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is not an acceptable change to the master pipeline. This change can live in the marislab fork.
#-s, --SNP_File path to the snp file | ||
|
||
# process arguments | ||
function usage() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I love the resourcefulness here and am a big fan of the essence of these modifications.
However, can you port this logic into a python script with arguments defined by argparse
? I would much prefer to keep run_pipeline.sh
unchanged since it will impact the original analysis.
scripts/build_custom_tad_genelist.py
Outdated
@@ -16,6 +16,7 @@ | |||
--snp_data_file The file name of where SNP data is stored. SNP data is | |||
generated by `scripts/build_snp_list.R` | |||
--output_file The name of the results file to build | |||
--TAD-Boundary The name of tad cell to use |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we make this argument lowercase and add a suffix: --tad-boundary-file
scripts/build_custom_tad_genelist.py
Outdated
@@ -30,6 +31,7 @@ | |||
parser = argparse.ArgumentParser() | |||
parser.add_argument("-s", "--snp_data_file", help="Location of SNP data") | |||
parser.add_argument("-o", "--output_file", help="Name of the output file") | |||
parser.add_argument("-t", "--TAD-Boundary", help="Name of the tad cell") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
see above comment regarding the name of the argument. Lets also add a default.
parser.add_argument("-t", "--tad-boundary-file", help="Name of the tad cell",
default='data/GENE_index_hg19_hESC.tsv.bz2')
scripts/build_custom_tad_genelist.py
Outdated
@@ -59,12 +61,13 @@ def assign_custom_snp_to_tad(snp_signal, tad_boundary): | |||
# Load Constants | |||
snp_data_file = args.snp_data_file | |||
output_file = args.output_file | |||
tad_cell = args.TAD_Boundary |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great! (except it now needs to be updated to lowercase)
tad_boundary_file = args.tad-boundary-file
@@ -0,0 +1,677 @@ | |||
""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove from PR
@@ -0,0 +1,234 @@ | |||
""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove from PR
@@ -0,0 +1,313 @@ | |||
""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove from PR
@@ -0,0 +1,2819 @@ | |||
chromosome db type start stop strand gene_type gene_name |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove from PR
@@ -0,0 +1,46 @@ | |||
gene_type gene_class hg19 mm9 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove from PR
This version of TAD_pathways enable users to just input the TAD domain file and SNPs file into the pipeline. And the program will go through the tad pathways analysis.