Nanopore Full-length transcriptome overview
What’s new of full-length transcriptome ?Traditional 2nd generation transcriptome sequencing can only analyze the regulatory mechanism at the gene level to find the key genes related to traits. Genes can produce multiple transcripts at certain time or situation, the richness and complexity of the transcripts are the direct cause of protein diversity, which will eventually lead to a variety of phenotypes. The full-length transciptome sequencing on 3rd generation platform does not need to break mRNA randomly, the transcript can be sequenced from 5' end to 3' UTR region at once. 3rd generation full-length transcriptome sequencing can tell the complex transcription in organisms, it can reveal the real structure of sequences during transcription, such as alternative splicing, APA, fusion genes, etc.
High cost performance, high throughput, accurate quantification at transcriptome level, low efficiency of multiple alignment
No need to break sequences and gene structure, alternative splicing, fusion gene and other structural characteristics can be identified accurately
Long sequencing reads, no GC specificity amd bases bias
Identification of differential expressed genes (DEGs) and differential expressed transcripts (DETs), functional annotation analysis, in-depth exploration of the regulatory mechanism of functional genes and key pathways
Gene structure identification
Alternative splicing, non-coding RNA, gene family, evolution relationship
Genome annotation quality promotion
Novel genes, gene structure of new alternative splicesome
Practical data display
Sequencing amount is around 2 Gb~20 Gb, the N50 length is around 1500 bp, the mean sequencing length is 1~2 kb and the mean Q score is above Q10.
|Species||Sample Number||Reads Number||Base Number||N50||Average Length||Max Length||Data Quality control|
True cases of Nanopore full-length transcriptome data
Data saturation: Compared to NGS, Nanopore technology needs less data amount to cover the same amount transcripts.
Accurate quantification, low GC bias, low multiple alignment, differential expressed genes (DEG) and differential expressed transcripts (DET) can be handled at once
2 Gb Nanopore data and 6 Gb Illumina data have almost the same count of detected genes, the shared identified differential expressed genes have the same up & down-regulation relationships
|Background||Data amount||Common DEGs||Consensus up-regulated genes||Consensus down-regulated genes|
|Plant with 27,628 genes and 48,332 transcipts||2 Gb||628||174||454|
DEGs identification on ONT and Illumina platforms with the same amount of data
Transcripts number and species identification
|Species||Full-length Rate||Mapping Rate||Known Transcripts||New Transcripts||Known Genes||New Transcripts for
|New Genes||Transcripts for
Full-length transcripts identification of different species
Gene structure identification
Comparison of transcriptome data between Nanopore and Pacbio sequencing platforms
|Background: A Plant with 27,628 genes and 48,332 transcripts|
|Data amount||Identified full-length sequences||Redundant-removed transcripts||Known full-length transcripts||Novel full-length transcripts||Identified genes||Identified
|Comparison of full-length transcripts on ONT and PB platforms|
Analysis of classic cases
Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN。PK113-7D。 (Nucleic Acids Research, 2018, IF=11。561)
1 Use Nanopore to do full-length transcriptome sequencing for Saccharomyces cerevisiae
2 ~509 MB (59X) data were obtained from yeast under glucose condition. Total ~623 MB (72X) data were obtained from yeast under alcohol condition. MinION sequencing depth 64X, Illumina sequencing depth 118X. Mean coverage is very similar between the two platforms.
3 Differential expression and functional enrichment analysis show that the up-regulated genes in glucose culture were enriched to terms related to transcription and translation processes, which was consistent with the phenotype of faster growth in glucose culture. Under the condition of ethanol culture, the up-regulated genes were mainly enriched in TCA cycle, glyoxylic acid pathway and mitochondrial electron transport.
Figure 3。 Summary of the direct RNA sequencing data。 (A) The histogram plot shows the distribution of read length of high quality reads obtained from yeast cell growth ethanol (magenta) and glucose (cyan), respectively, with the distribution of expected transcript lengths derived fromthe ORFs annotation。 (B) Bar plots of the detected highly expressed transcripts are presented as an average normalized count with standard error over four biological replicates for each growth condition。 The constitutively expressed, highly expressed in ethanol growth and highly expressed in glucose growth are illustrated in the left middle and right box, respectively。 (C) The bubble scatter plots show the relationship between the fraction of detected full-length transcripts by the direct RNA sequencing with the transcript length and the level transcript expression。 The violin-boxplots on the right show the overall distribution of the fraction of detected full-length transcripts。
BMKCloud developed by Biomarker Co。 Ltd。 and is an open cloud platform for big biological data analysis。 It has the largest professional user and developer user groups in China。 It provides users with comprehensive bioinformatics analysis including bioinformatics analysis platform, computing resources, public data, information analysis training, social platform, and how to integrate and utilize public data。
Flow chart for Genome-guided Full-length Transcriptome Analysis Platform (GFTAP)
Easy to use
* Online graphical operation and no need to know Linux/programming language
* Tasks can be delivered in 1 minute and be delivered in anywhere with Internet
* A single flow for all intergrated analysis
* Multi-groups-analysis can be delivered in a single submission
* Reference genomes be updated weekly and personalized references are surported
Fast and efficient
* 6 samples be analyzed in 24 hours (gold account)
* 18 main analysis items were integrated in a single flow
* Both NGS and Nanopore fastq data are supported
* DEG, PCA, WGCNA and more analysis items were integrated in a single flow
* Both private and public data are supported
Friendly to personalized demands
* Personalized parameters are supported in tasks submission
* Majority (90%+) of the parameters are open after reports been generated
* Continuous updating and results can be updated after APP been updated
1) Bayega A, Oikonomopoulos S, Zorbas E, et al。 Transcriptome landscape of the developing olive fruit fly embryo delineated by Oxford Nanopore long-read RNA-Seq[J]。 bioRxiv, 2018。
2) Benetta E D, Antoshechkin I, Yang T, et al. Genome Elimination Mediated by Gene Expression from a Selfish Chromosome[J]. bioRxiv, 2019.
3) Byrne A, Supple M A, Volden R, et al。 Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus)[J]。 Frontiers in Genetics, 2019。
4) Chuang T, Chen Y, Chen C, et al. Integrative transcriptome sequencing reveals extensive alternative trans-splicing and cis-backsplicing in human cells.[J]. Nucleic Acids Research, 2018, 46(7): 3671-3691.
5) Cruzgarcia L, Obrien G, Sipos B, et al. Generation of a Transcriptional Radiation Exposure Signature in Human Blood Using Long-Read Nanopore Sequencing[J]. Radiation Research, 2019, 193(2).
6) Fleming M B, Patterson E L, Reeves P A, et al. Exploring the fate of mRNA in aging seeds: protection, destruction, or slow decay?[J]. Journal of Experimental Botany, 2018, 69(18): 4309-4321.
7) Garalde D R, Snell E A, Jachimowicz D, et al. Highly parallel direct RNA sequencing on an array of nanopores[J]. Nature Methods, 2018, 15(3): 201-206.
8) Grunberger F, Knuppel R, Juttner M, et al。 Nanopore-based native RNA sequencing provides insights into prokaryotic transcription, operon structures, rRNA maturation and modifications[J]。 bioRxiv, 2019。
9) Gupta I, Collier P G, Haase B, et al. Single-cell isoform RNA sequencing (ScISOr-Seq) across thousands of cells reveals isoforms of cerebellar cell types.[J]. bioRxiv, 2018.
10) Hardwick S A, Bassett S D, Kaczorowski D C, et al. Targeted, High-Resolution RNA Sequencing of Non-coding Genomic Regions Associated With Neuropsychiatric Functions[J]. Frontiers in Genetics, 2019.
11) Lea W A, Parnell S C, Wallace D P, et al. Human-Specific Abnormal Alternative Splicing of Wild-Type PKD1 Induces Premature Termination of Polycystin-1[J]. Journal of The American Society of Nephrology, 2018, 29(10): 2482-2492.
12) Li R, Ren X, Ding Q, et al. Direct full-length RNA sequencing reveals unexpected transcriptome complexity during Caenorhabditis elegans development[J]. Genome Research, 2020, 30(2): 287-298.
13) Ono H, Yoshida M. Direct RNA sequencing approach to compare non-model mitochondrial transcriptomes: an application to a cephalopod host and its mesozoan parasite.[J]. Methods, 2020.
14) Panda K, Slotkin R K。 Long-Read cDNA Sequencing Enables a 'Gene-Like' Transcript Annotation of Arabidopsis Transposable Elements[J]。 bioRxiv, 2020。
15) Parker M T, Knop K, Sherwood A, et al. Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m6A modification[J]. eLife, 2020: 1-35
16) Piroon J , Thidathip W , Rui P , et al. Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN. PK113-7D[J]. Nucleic Acids Research(7):7.
17) Roach N P, Sadowski N, Alessi A F, et al. The full-length transcriptome of C. elegans using direct RNA sequencing[J]. bioRxiv, 2019.
18) Sessegolo C, Cruaud C, Silva C D, et al. Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules[J]. Scientific Reports, 2019, 9(1).
19) Tang A D, Soulette C M, Van Baren M J, et al。 Full-length transcript characterization of SF3B1 mutation in chronic lymphocytic leukemia reveals downregulation of retained introns[J]。 bioRxiv, 2018。
20) Workman R E, Tang A D, Tang P S, et al. Nanopore native RNA sequencing of a human poly(A) transcriptome[J]. bioRxiv, 2018.