Download the SANPRO programs from the GitHub repository located at
https://github.com/CNIC-Proteomics/SANPRO.
Next, decompress the zip file.
You need to install the Python programming language and the following packages. To install these packages, you can use the pip module in Python:
pip install -r python_requirements.txt
The "get_appris" service requires the APPRIS data files. To obtain these files for a specific species, you need to execute the following bash script:
./create_appris_dbs.sh
This bash script contains the following programs:
The following script downloads annotations for the APPRIS methods that locate annotations in specific regions of the protein:
python download_appris.py -s human -o test/appris -vv &> logs/download_apprishuman.log
Additionally, to convert the method annotations in GTF format to another GTF format that references the protein region, you can use:
python convert_appris.py -ia "test/appris/*.gtf" -iu "test/human_202306.uniprot.tsv" -o "test/appris/human_202306.appris.gtf"