Mountain View
biomedical and healthCAre Data Discovery Index Ecosystem
help Advanced Search
Title: The evolution of lncRNA repertoires and expression patterns in tetrapods      
dateReleased:
01-19-2014
description:
Only a minuscule fraction of long non-coding RNAs (lncRNAs) are well characterized. The evolutionary history of lncRNAs can provide insights into their functionality, but comparative analyses have been precluded by our ignorance of lncRNAs in non-model organisms. Here, we use RNA sequencing to identify lncRNAs in eleven tetrapod species and we present the first large-scale evolutionary study of lncRNA repertoires and expression patterns. We identify ~11,000 primate- specific lncRNA families, which show evidence for selective constraint during recent evolution, and ~2,400 highly conserved lncRNAs (including ~400 genes that likely originated more than 300 million years ago). We find that lncRNAs, in particular ancient ones, are generally actively regulated and may predominantly function in embryonic development. lncRNA X-inactivation patterns reveal an extremely female-biased monotreme-specific lncRNA, which may partially compensate X-dosage in this lineage. Most lncRNAs evolve rapidly in terms of sequence and expression levels, but global patterns like tissue specificities are often conserved. We compared expression patterns of homologous lncRNA and protein-coding families across tetrapods to reconstruct an evolutionarily conserved co-expression network. This network, which surprisingly contains many lncRNA hubs, suggests potential functions for lncRNAs in fundamental processes like spermatogenesis or synaptic transmission, but also in more specific mechanisms such as placenta growth suppression through miRNA production. [Batch 1 and 2] To broaden our understanding of lncRNA evolution, we used an extensive RNA-seq dataset to establish lncRNA repertoires and homologous gene families in 11 tetrapod species. We analyzed the poly- adenylated transcriptomes of 8 organs (cortex/whole brain without cerebellum, cerebellum, heart, kidney, liver, placenta, ovary and testis) and 11 species (human, chimpanzee, bonobo, gorilla, orangutan, macaque, mouse, opossum, platypus, chicken and the frog Xenopus tropicalis), which shared a common ancestor ~370 millions of years (MY) ago. Our dataset included 47 strand-specific samples, which allowed us to confirm the orientation of gene predictions and to address the evolution of sense-antisense transcripts. See also GSE43721 (Soumillon et al, Cell Reports, 2013) for three strand-specific samples for mouse brain, liver and testis.
privacy:
not applicable
aggregation:
instance of dataset
ID:
E-GEOD-43520
refinement:
raw
alternateIdentifiers:
43520
keywords:
functional genomics
dateModified:
05-31-2014
availability:
available
types:
gene expression
accessURL: https://www.ebi.ac.uk/arrayexpress/files/E-GEOD-43520/E-GEOD-43520.raw.1.zip
storedIn:
ArrayExpress
qualifier:
gzip compressed
format:
TXT
accessType:
download
authentication:
none
authorization:
none
accessURL: https://www.ebi.ac.uk/arrayexpress/files/E-GEOD-43520/E-GEOD-43520.processed.1.zip
storedIn:
ArrayExpress
qualifier:
gzip compressed
format:
TXT
accessType:
download
authentication:
none
authorization:
none
accessURL: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE43520
storedIn:
Gene Expression Omnibus
qualifier:
not compressed
format:
HTML
accessType:
landing page
primary:
true
authentication:
none
authorization:
none
abbreviation:
EBI
homePage: http://www.ebi.ac.uk/
ID:
SCR:004727
name:
European Bioinformatics Institute
homePage: https://www.ebi.ac.uk/arrayexpress/
ID:
SCR:002964
name:
ArrayExpress

Feedback?

If you are having problems using our tools, or if you would just like to send us some feedback, please post your questions on GitHub.