Publikation: Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)
Dateien
Datum
Autor:innen
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
DOI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Sammlungen
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
DESSIMOZ, Christophe, Stefan ZOLLER, Tereza MANOUSAKI, Huan QIU, Axel MEYER, Shigehiro KURAKU, 2011. Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes). In: Briefings in Bioinformatics. 2011, 12(5), pp. 474-484. ISSN 1467-5463. eISSN 1477-4054. Available under: doi: 10.1093/bib/bbr038BibTex
@article{Dessimoz2011-09Compa-19455, year={2011}, doi={10.1093/bib/bbr038}, title={Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)}, number={5}, volume={12}, issn={1467-5463}, journal={Briefings in Bioinformatics}, pages={474--484}, author={Dessimoz, Christophe and Zoller, Stefan and Manousaki, Tereza and Qiu, Huan and Meyer, Axel and Kuraku, Shigehiro} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/19455"> <dc:contributor>Dessimoz, Christophe</dc:contributor> <dc:creator>Manousaki, Tereza</dc:creator> <dc:creator>Qiu, Huan</dc:creator> <dc:contributor>Zoller, Stefan</dc:contributor> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dc:creator>Meyer, Axel</dc:creator> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/19455"/> <dc:contributor>Kuraku, Shigehiro</dc:contributor> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-06-13T09:42:33Z</dc:date> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/19455/2/Dessimoz_194550.pdf"/> <dcterms:bibliographicCitation>Briefings in Bioinformatics ; 12 (2011), 5. - S. 474-484</dcterms:bibliographicCitation> <dc:creator>Zoller, Stefan</dc:creator> <dc:language>eng</dc:language> <dcterms:title>Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)</dcterms:title> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/> <dcterms:abstract xml:lang="eng">Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.</dcterms:abstract> <dcterms:issued>2011-09</dcterms:issued> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/19455/2/Dessimoz_194550.pdf"/> <dc:rights>terms-of-use</dc:rights> <dc:contributor>Meyer, Axel</dc:contributor> <dc:contributor>Manousaki, Tereza</dc:contributor> <dc:creator>Kuraku, Shigehiro</dc:creator> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-06-13T09:42:33Z</dcterms:available> <dc:contributor>Qiu, Huan</dc:contributor> <dc:creator>Dessimoz, Christophe</dc:creator> </rdf:Description> </rdf:RDF>