Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metadata table #1

Open
mrvollger opened this issue Feb 23, 2021 · 0 comments
Open

Metadata table #1

mrvollger opened this issue Feb 23, 2021 · 0 comments

Comments

@mrvollger
Copy link

Hi,

I have a suggestion for this github. I think it would be an important resource to maintain a metadata table with all the information we have for each of these samples. Some things I can think of of the top of my head:

  1. Sample
  2. Gender
  3. Super population
  4. Population
  5. Mother
  6. Family ID
  7. Father
  8. Siblings
  9. Phenotype
  10. AWS link for HiFi
  11. AWS link for ONT
  12. AWS link for Hi-C
  13. AWS link for Strand-Seq
  14. AWS link for BioNano
  15. Coverage statistics for all read technologies
  16. AWS link maternal Illumina reads
  17. AWS link paternal Illumina reads

This will probably be a tedious task but I think it will improve access to the data. I have attached a table that I have been maintaining with population and super population information for all the HiFi samples I know of in case it is helpful in getting started.

Let me know what you think.

Best,
Mitchell

  Sample Super Population Population
1 HG00438 EAS Southern Han Chinese
2 HG00514 EAS Southern Han Chinese
3 HG00621 EAS Southern Han Chinese
4 HG00673 EAS Southern Han Chinese
5 HG00733 AMR Puerto Ricans from Puerto Rico
6 HG00735 AMR Puerto Ricans from Puerto Rico
7 HG00741 AMR Puerto Ricans from Puerto Rico
8 HG01071 AMR Puerto Ricans from Puerto Rico
9 HG01106 AMR Puerto Ricans from Puerto Rico
10 HG01109 AMR Puerto Ricans from Puerto Rico
11 HG01123 AMR Colombians from Medellin, Colombia
12 HG01175 AMR Puerto Ricans from Puerto Rico
13 HG01243 AMR Puerto Ricans from Puerto Rico
14 HG01258 AMR Colombians from Medellin, Colombia
15 HG01358 AMR Colombians from Medellin, Colombia
16 HG01361 AMR Colombians from Medellin, Colombia
17 HG01891 AFR African Caribbeans in Barbados
18 HG01928 AMR Peruvians from Lima, Peru
19 HG01952 AMR Peruvians from Lima, Peru
20 HG01978 AMR Peruvians from Lima, Peru
21 HG02080 EAS Kinh in Ho Chi Minh City, Vietnam
22 HG02148 AMR Peruvians from Lima, Peru
23 HG02257 AFR African Caribbeans in Barbados
24 HG02486 AFR African Caribbeans in Barbados
25 HG02559 AFR African Caribbeans in Barbados
26 HG02572 AFR Gambian in Western Divisions in the Gambia
27 HG02622 AFR Gambian in Western Divisions in the Gambia
28 HG02630 AFR Gambian in Western Divisions in the Gambia
29 HG02717 AFR Gambian in Western Divisions in the Gambia
30 HG02723 AFR Gambian in Western Divisions in the Gambia
31 HG02818 AFR Gambian in Western Divisions in the Gambia
32 HG02886 AFR Gambian in Western Divisions in the Gambia
33 HG03125 AFR Esan in Nigeria
34 HG03453 AFR Mende in Sierra Leone
35 HG03486 AFR Mende in Sierra Leone
36 HG03492 SAS Punjabi from Lahore, Pakistan
37 HG03516 AFR Esan in Nigeria
38 HG03540 AFR Gambian in Western Divisions in the Gambia
39 HG03579 AFR Mende in Sierra Leone
40 NA12878 EUR Utah Residents (CEPH) with Northern and Western European Ancestry
41 HG002 EUR Ashkenazi
42 NA18906 AFR Yoruba in Ibadan, Nigeria
43 NA19240 AFR Yoruba in Ibadan, Nigeria
44 NA20129 AFR Americans of African Ancestry in SW USA
45 NA21309 AFR Massai, Kenya
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant