Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data
Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data
Loading...
Date
2015
Authors
Editors
Journal ISSN
Electronic ISSN
ISBN
Bibliographical data
Publisher
Series
URI (citable link)
DOI (citable link)
International patent number
Link to the license
EU project number
Project
Open Access publication
Collections
Title in another language
Publication type
Journal article
Publication status
Published
Published in
Genome Biology ; 16 (2015), 1. - 150. - BioMed Central. - ISSN 1465-6906. - eISSN 1465-6914
Abstract
Background
Understanding the regulation of gene expression, including transcription start site usage, alternative splicing, and polyadenylation, requires accurate quantification of expression levels down to the level of individual transcript isoforms. To comparatively evaluate the accuracy of the many methods that have been proposed for estimating transcript isoform abundance from RNA sequencing data, we have used both synthetic data as well as an independent experimental method for quantifying the abundance of transcript ends at the genome-wide level.
Results
We found that many tools have good accuracy and yield better estimates of gene-level expression compared to commonly used count-based approaches, but they vary widely in memory and runtime requirements. Nucleotide composition and intron/exon structure have comparatively little influence on the accuracy of expression estimates, which correlates most strongly with transcript/gene expression levels. To facilitate the reproduction and further extension of our study, we provide datasets, source code, and an online analysis tool on a companion website, where developers can upload expression estimates obtained with their own tool to compare them to those inferred by the methods assessed here.
Conclusions
As many methods for quantifying isoform abundance with comparable accuracy are available, a user’s choice will likely be determined by factors such as the memory and runtime requirements, as well as the availability of methods for downstream analyses. Sequencing-based methods to quantify the abundance of specific transcript regions could complement validation schemes based on synthetic data and quantitative PCR in future or ongoing assessments of RNA-seq analysis methods.
Understanding the regulation of gene expression, including transcription start site usage, alternative splicing, and polyadenylation, requires accurate quantification of expression levels down to the level of individual transcript isoforms. To comparatively evaluate the accuracy of the many methods that have been proposed for estimating transcript isoform abundance from RNA sequencing data, we have used both synthetic data as well as an independent experimental method for quantifying the abundance of transcript ends at the genome-wide level.
Results
We found that many tools have good accuracy and yield better estimates of gene-level expression compared to commonly used count-based approaches, but they vary widely in memory and runtime requirements. Nucleotide composition and intron/exon structure have comparatively little influence on the accuracy of expression estimates, which correlates most strongly with transcript/gene expression levels. To facilitate the reproduction and further extension of our study, we provide datasets, source code, and an online analysis tool on a companion website, where developers can upload expression estimates obtained with their own tool to compare them to those inferred by the methods assessed here.
Conclusions
As many methods for quantifying isoform abundance with comparable accuracy are available, a user’s choice will likely be determined by factors such as the memory and runtime requirements, as well as the availability of methods for downstream analyses. Sequencing-based methods to quantify the abundance of specific transcript regions could complement validation schemes based on synthetic data and quantitative PCR in future or ongoing assessments of RNA-seq analysis methods.
Summary in another language
Subject (DDC)
570 Biosciences, Biology
Keywords
Bias Correction, Differential Expression Analysis, Expression Estimate, Read Alignment, Alignment File
Conference
Review
undefined / . - undefined, undefined. - (undefined; undefined)
Cite This
ISO 690
KANITZ, Alexander, Foivos GYPAS, Andreas J. GRUBER, Andreas R. GRUBER, Georges MARTIN, Mihaela ZAVOLAN, 2015. Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data. In: Genome Biology. BioMed Central. 16(1), 150. ISSN 1465-6906. eISSN 1465-6914. Available under: doi: 10.1186/s13059-015-0702-5BibTex
@article{Kanitz2015-12Compa-50944, year={2015}, doi={10.1186/s13059-015-0702-5}, title={Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data}, number={1}, volume={16}, issn={1465-6906}, journal={Genome Biology}, author={Kanitz, Alexander and Gypas, Foivos and Gruber, Andreas J. and Gruber, Andreas R. and Martin, Georges and Zavolan, Mihaela}, note={Article Number: 150} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/50944"> <dc:creator>Gruber, Andreas R.</dc:creator> <dcterms:abstract xml:lang="eng">Background<br />Understanding the regulation of gene expression, including transcription start site usage, alternative splicing, and polyadenylation, requires accurate quantification of expression levels down to the level of individual transcript isoforms. To comparatively evaluate the accuracy of the many methods that have been proposed for estimating transcript isoform abundance from RNA sequencing data, we have used both synthetic data as well as an independent experimental method for quantifying the abundance of transcript ends at the genome-wide level.<br /><br />Results<br />We found that many tools have good accuracy and yield better estimates of gene-level expression compared to commonly used count-based approaches, but they vary widely in memory and runtime requirements. Nucleotide composition and intron/exon structure have comparatively little influence on the accuracy of expression estimates, which correlates most strongly with transcript/gene expression levels. To facilitate the reproduction and further extension of our study, we provide datasets, source code, and an online analysis tool on a companion website, where developers can upload expression estimates obtained with their own tool to compare them to those inferred by the methods assessed here.<br /><br />Conclusions<br />As many methods for quantifying isoform abundance with comparable accuracy are available, a user’s choice will likely be determined by factors such as the memory and runtime requirements, as well as the availability of methods for downstream analyses. Sequencing-based methods to quantify the abundance of specific transcript regions could complement validation schemes based on synthetic data and quantitative PCR in future or ongoing assessments of RNA-seq analysis methods.</dcterms:abstract> <dc:contributor>Gruber, Andreas J.</dc:contributor> <dc:contributor>Zavolan, Mihaela</dc:contributor> <dcterms:rights rdf:resource="http://creativecommons.org/licenses/by/4.0/"/> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2020-09-22T08:16:11Z</dc:date> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/> <dcterms:title>Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data</dcterms:title> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2020-09-22T08:16:11Z</dcterms:available> <dc:contributor>Kanitz, Alexander</dc:contributor> <dcterms:issued>2015-12</dcterms:issued> <dc:contributor>Martin, Georges</dc:contributor> <dc:contributor>Gruber, Andreas R.</dc:contributor> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/50944/3/Gruber_2-eh7kcs5pe3pa6.pdf"/> <dc:language>eng</dc:language> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/50944/3/Gruber_2-eh7kcs5pe3pa6.pdf"/> <dc:creator>Zavolan, Mihaela</dc:creator> <dc:rights>Attribution 4.0 International</dc:rights> <dc:contributor>Gypas, Foivos</dc:contributor> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/50944"/> <dc:creator>Kanitz, Alexander</dc:creator> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/> <dc:creator>Gruber, Andreas J.</dc:creator> <dc:creator>Gypas, Foivos</dc:creator> <dc:creator>Martin, Georges</dc:creator> </rdf:Description> </rdf:RDF>
Internal note
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Examination date of dissertation
Method of financing
Comment on publication
Alliance license
Corresponding Authors der Uni Konstanz vorhanden
International Co-Authors
Bibliography of Konstanz
Refereed
Yes