Recently I have been using the Complete Genomics high coverage sequencing data from the 1000 Genomes project. Its a bit difficult to find information on the populations, samples, and available sequencing data since they are all stored in different places on their ftp server. I decided to make a post that tried to combine all the useful information into one spot. Here are links to files that may be of interest.
population file: gives a key of abbreviations used for the 1000 Genomes populations
pedigree file: provides relationship information on 1000 Genomes individuals that are related as well as gender and 1000 Genomes population
sample file: detailed spreadsheet that offers information on sample id, accession number, population, family, gender, relationship, sequencing center, and coverage for each 1000G sample.
Welcome to the Genome Toolbox! I am glad you navigated to the blog and hope you find the contents useful and insightful for your genomic needs. If you find any of the entries particularly helpful, be sure to click the +1 button on the bottom of the post and share with your colleagues. Your input is encouraged, so if you have comments or are aware of more efficient tools not included in a post, I would love to hear from you. Enjoy your time browsing through the Toolbox.