Mountain View
biomedical and healthCAre Data Discovery Index Ecosystem
help Advanced Search
Title: Supporting data for the paper: "An integrated catalog of reference genes in the human gut microbiome"      
dateReleased:
07-02-2014
privacy:
not applicable
aggregation:
instance of dataset
dateCreated:
06-23-2014
refinement:
curated
ID:
doi:10.5524/100064
creators:
Li, J
Jia, H
Cai, X
Zhong, H
Feng, Q
Sunagawa, S
Arumugam, M
Kultima,
Prifti, E
Nielsen, T
Juncker, A
Manichanh, C
Chen, B
Zhang, W
Levenez, F
Wang, J
Xu, X
Xiao, L
Liang, S
Zhang, D
Zhang, Z
Chen, W
Zhao, H
Al-Aama, JY
Edris, S
Yang, H
Hansen, T
Nielsen, HB
Brunak, S
Kristiansen, K
Guarner, F
Pedersen, O
Doré, J
Ehrlich, SD
,
Bork, P
availability:
available
types:
sequence
description:
Here we sequenced 249 fecal samples from European adults, leading to a total of 760 samples in the Metagenome of the Human Intestinal Tract MetaHIT project. All 6.4TB whole-genome shotgun sequencing data from 1267 fecal samples in MetaHIT, the Human Microbiome Project HMP and our diabetes study on Chinese adults were processed with the MOCAT pipeline. The resulting gene catalogs were merged using CD-HIT and complemented with genes from 511 sequenced human gut-related prokaryotic genomes that were present in our gut metagenomes. The final high-quality integrated reference catalog of the human gut microbiome contains 9,879,896 non-redundant genes. The genes were phylogenetically annotated according to 3449 bacterial and archaeal genomes and draft genomes from NCBI, and functionally annotated using orthologous groups from the Kyoto Encyclopedia of Genes and Genomes KEGG and the evolutionary genealogy of genes: Non-supervised Orthologous Groups eggNOG databases. In addition, 11 samples from the Chinese cohort were re-extracted using the MetaHIT DNA extraction protocol and shotgun-sequenced to compare with the original data generated by a slightly different DNA extraction protocol.
accessURL: https://doi.org/10.5524/100064
storedIn:
GigaScience Database
qualifier:
not compressed
format:
HTML
accessType:
landing page
primary:
true
authentication:
none
authorization:
none
abbreviation:
GigaDB
homePage: http://gigadb.org/
ID:
SCR:006565
name:
Giga Science Database

Feedback?

If you are having problems using our tools, or if you would just like to send us some feedback, please post your questions on GitHub.