Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Old-Shatterhand authored Nov 29, 2024
1 parent 88f750c commit 5899a74
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,6 +46,7 @@ pip install grakel

to install DataSAIL in an already existing environment. Alternatively, one can install DataSAIL-lite from conda.
DataSAIL-lite is a version of DataSAIL that does not install all clustering algorithms as the standard DataSAIL.
Installing either package usually takes less than 5 minutes.

DataSAIL is available for Python 3.8 and newer.

Expand All @@ -58,7 +59,7 @@ datasail --e-type P --e-data <path_to_fasta> --e-sim mmseqs --output <path_to_ou
````

to split a set of proteins that have been clustered using mmseqs. For a full list of arguments, run `datasail -h` and
checkout [ReadTheDocs](https://datasail.readthedocs.io/en/latest/index.html). There is a more detailed explanation of the arguments and example notebooks.
checkout [ReadTheDocs](https://datasail.readthedocs.io/en/latest/index.html). There is a more detailed explanation of the arguments and example notebooks. The runtime largy depends on the number and type of splits to be computed and the size of the dataset. For small datasets (less then 10k samples) DataSAIL finished within minutes. On large datasets (more than 100k samples) it can take several hours to complete.

## When to use DataSAIL and when not to use

Expand Down

0 comments on commit 5899a74

Please sign in to comment.