TitleTranscriptome sequencing reveals thousands of novel long non-coding RNAs in B cell lymphoma.
Publication TypeJournal Article
Year of Publication2015
AuthorsVerma, Akanksha, Jiang Yanwen, Du Wei, Fairchild Lauren, Melnick Ari, and Elemento Olivier
JournalGenome Med
Volume7
Pagination110
Date Published2015 Nov 01
ISSN1756-994X
KeywordsAnimals, B-Lymphocytes, Cell Line, Tumor, Dogs, Humans, Lymphoma, Large B-Cell, Diffuse, RNA, Long Noncoding, Sequence Analysis, RNA, Transcriptome
Abstract

<p><b>BACKGROUND: </b>Gene profiling of diffuse large B cell lymphoma (DLBCL) has revealed broad gene expression deregulation compared to normal B cells. While many studies have interrogated well known and annotated genes in DLBCL, none have yet performed a systematic analysis to uncover novel unannotated long non-coding RNAs (lncRNA) in DLBCL. In this study we sought to uncover these lncRNAs by examining RNA-seq data from primary DLBCL tumors and performed supporting analysis to identify potential role of these lncRNAs in DLBCL.</p><p><b>METHODS: </b>We performed a systematic analysis of novel lncRNAs from the poly-adenylated transcriptome of 116 primary DLBCL samples. RNA-seq data were processed using de novo transcript assembly pipeline to discover novel lncRNAs in DLBCL. Systematic functional, mutational, cross-species, and co-expression analyses using numerous bioinformatics tools and statistical analysis were performed to characterize these novel lncRNAs.</p><p><b>RESULTS: </b>We identified 2,632 novel, multi-exonic lncRNAs expressed in more than one tumor, two-thirds of which are not expressed in normal B cells. Long read single molecule sequencing supports the splicing structure of many of these lncRNAs. More than one-third of novel lncRNAs are differentially expressed between the two major DLBCL subtypes, ABC and GCB. Novel lncRNAs are enriched at DLBCL super-enhancers, with a fraction of them conserved between human and dog lymphomas. We see transposable elements (TE) overlap in the exonic regions; particularly significant in the last exon of the novel lncRNAs suggest potential usage of cryptic TE polyadenylation signals. We identified highly co-expressed protein coding genes for at least 88 % of the novel lncRNAs. Functional enrichment analysis of co-expressed genes predicts a potential function for about half of novel lncRNAs. Finally, systematic structural analysis of candidate point mutations (SNVs) suggests that such mutations frequently stabilize lncRNA structures instead of destabilizing them.</p><p><b>CONCLUSIONS: </b>Discovery of these 2,632 novel lncRNAs in DLBCL significantly expands the lymphoma transcriptome and our analysis identifies potential roles of these lncRNAs in lymphomagenesis and/or tumor maintenance. For further studies, these novel lncRNAs also provide an abundant source of new targets for antisense oligonucleotide pharmacology, including shared targets between human and dog lymphomas.</p>

DOI10.1186/s13073-015-0230-7
Alternate JournalGenome Med
PubMed ID26521025
PubMed Central IDPMC4628784