-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing annotations #4
Comments
The CAT pipeline was dependent on the Minigraph-Cactus graph, resulting in its applicability to only 44 samples (HG002, HG005, NA19240 were set aside to facilitate their use in benchmarking). Conversely, the Ensembl pipeline should include gene annotations for all 47 samples. The link to access the Ensembl gene annotations is: https://s3-us-west-2.amazonaws.com/human-pangenomics/index.html?prefix=submissions/8E6C4ACC-FEA9-4DD8-94A3-B92234206F95--Y1_ENSEMBL_V1/ @mhaukness-ucsc, could you please check if the above link is the version used in the HPRC marker paper? @juklucas, in your opinion, should we consider providing an index file for the Ensembl gene annotations as well? |
Thanks so much! I see the Ensembl annotations and will try them out. |
The above link should be correct for CAT for comparisons to marker paper results; however Ensembl should be used for new analysis. |
Do the coordinates of Ensembl gene annotations in this link match the assembly version in assembly_index/Year1_assemblies_v2_genbank.index? |
Hi,
Thank you for this fantastic resource!
The CAT genes index does not appear to have annotation entries for 3 samples:
HG002
HG005
NA19240
https://github.com/human-pangenomics/HPP_Year1_Assemblies/blob/main/annotation_index/Year1_assemblies_v2_genbank_CAT_genes.index
Are the gene annotations for these 3 samples available elsewhere?
Thanks!
The text was updated successfully, but these errors were encountered: