This is the repository for supplementary materials for the upcoming publication
Files:
- 📑
04_Lab_journal.ipynb
- laboratory journal with commands to reproduce pipeline - ⚙️
VirExtr_pipeline
-snakemake
pipeline to extract densoviral sequences from MAGs using customBLAST
database,scripts/extract_seqs.py
script andVirSorter2
- ⚙️
GA_pipeline
-snakemake
pipeline to annotate densoviral genomes usingprokka
- 📁
scripts
- folder with miscellaneous scripts used for data analysis:- 📝
extract_seqs.py
- script to extract longest alignment sequences according toBLAST
results from filtered MAGs - 📝
rename_seqs.py
- script to rename sequences from>NODE_632_length_4705_cov_657475.178280
toDensovirinae_sp._isolate_RBHC_VM_#
etc. - 📝
add_space_gbk.py
- script that fixesprokka
output by adding space between seq name and seq length making it eligable for input inProksee
- 📝
midpoint_root.py
- script to root phylogenetic tree at mid point
- 📝
- 📁
BLAST_vs_Densoviruses
- folder withBLAST
outputs of running filtered MAGs against customBLAST
database containing densoviruses complete genomes fromRefSeq
- 📑
denso.yaml
- conda environment - 📑
vs2.yaml
- conda environment
Instruction:
- Create new environment
denso
conda env create -f denso.yaml
- Create new environment
vs2
conda env create -f vs2.yaml
For running lab journal use denso.yaml
environment. vs2.yaml
environment will be automatically used when running VirExtr_pipeline
snakemake pipeline
This part of work was done using:
- OS: Ubuntu 22.04 (WSL2 on Windows 11 22H2)
- RAM: 32GB (16 for WSL2)
- CPU: Intel Xeon E5-2670v3