Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)

dc.contributor.authorDessimoz, Christophedeu
dc.contributor.authorZoller, Stefandeu
dc.contributor.authorManousaki, Tereza
dc.contributor.authorQiu, Huandeu
dc.contributor.authorMeyer, Axel
dc.contributor.authorKuraku, Shigehiro
dc.date.accessioned2012-06-13T09:42:33Zdeu
dc.date.available2012-06-13T09:42:33Zdeu
dc.date.issued2011-09
dc.description.abstractRecent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.eng
dc.description.versionpublished
dc.identifier.citationBriefings in Bioinformatics ; 12 (2011), 5. - S. 474-484deu
dc.identifier.doi10.1093/bib/bbr038deu
dc.identifier.pmid21712341
dc.identifier.ppn408154721deu
dc.identifier.urihttp://kops.uni-konstanz.de/handle/123456789/19455
dc.language.isoengdeu
dc.legacy.dateIssued2012-06-13deu
dc.rightsterms-of-usedeu
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/deu
dc.subjectChondrichthyesdeu
dc.subjecttrained gene predictiondeu
dc.subjectnext generation sequencingdeu
dc.subjectgenome assemblydeu
dc.subjectorthologydeu
dc.subject.ddc570deu
dc.titleComparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)eng
dc.typeJOURNAL_ARTICLEdeu
dspace.entity.typePublication
kops.citation.bibtex
@article{Dessimoz2011-09Compa-19455,
  year={2011},
  doi={10.1093/bib/bbr038},
  title={Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)},
  number={5},
  volume={12},
  issn={1467-5463},
  journal={Briefings in Bioinformatics},
  pages={474--484},
  author={Dessimoz, Christophe and Zoller, Stefan and Manousaki, Tereza and Qiu, Huan and Meyer, Axel and Kuraku, Shigehiro}
}
kops.citation.iso690DESSIMOZ, Christophe, Stefan ZOLLER, Tereza MANOUSAKI, Huan QIU, Axel MEYER, Shigehiro KURAKU, 2011. Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes). In: Briefings in Bioinformatics. 2011, 12(5), pp. 474-484. ISSN 1467-5463. eISSN 1477-4054. Available under: doi: 10.1093/bib/bbr038deu
kops.citation.iso690DESSIMOZ, Christophe, Stefan ZOLLER, Tereza MANOUSAKI, Huan QIU, Axel MEYER, Shigehiro KURAKU, 2011. Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes). In: Briefings in Bioinformatics. 2011, 12(5), pp. 474-484. ISSN 1467-5463. eISSN 1477-4054. Available under: doi: 10.1093/bib/bbr038eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/19455">
    <dc:contributor>Dessimoz, Christophe</dc:contributor>
    <dc:creator>Manousaki, Tereza</dc:creator>
    <dc:creator>Qiu, Huan</dc:creator>
    <dc:contributor>Zoller, Stefan</dc:contributor>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:creator>Meyer, Axel</dc:creator>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/19455"/>
    <dc:contributor>Kuraku, Shigehiro</dc:contributor>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-06-13T09:42:33Z</dc:date>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/19455/2/Dessimoz_194550.pdf"/>
    <dcterms:bibliographicCitation>Briefings in Bioinformatics ; 12 (2011), 5. - S. 474-484</dcterms:bibliographicCitation>
    <dc:creator>Zoller, Stefan</dc:creator>
    <dc:language>eng</dc:language>
    <dcterms:title>Comparative genomics approach to detecting split-coding regions in a low-coverage genome : lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)</dcterms:title>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/>
    <dcterms:abstract xml:lang="eng">Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.</dcterms:abstract>
    <dcterms:issued>2011-09</dcterms:issued>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/19455/2/Dessimoz_194550.pdf"/>
    <dc:rights>terms-of-use</dc:rights>
    <dc:contributor>Meyer, Axel</dc:contributor>
    <dc:contributor>Manousaki, Tereza</dc:contributor>
    <dc:creator>Kuraku, Shigehiro</dc:creator>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-06-13T09:42:33Z</dcterms:available>
    <dc:contributor>Qiu, Huan</dc:contributor>
    <dc:creator>Dessimoz, Christophe</dc:creator>
  </rdf:Description>
</rdf:RDF>
kops.description.openAccessopenaccessgreen
kops.flag.knbibliographytrue
kops.identifier.nbnurn:nbn:de:bsz:352-194550deu
kops.sourcefieldBriefings in Bioinformatics. 2011, <b>12</b>(5), pp. 474-484. ISSN 1467-5463. eISSN 1477-4054. Available under: doi: 10.1093/bib/bbr038deu
kops.sourcefield.plainBriefings in Bioinformatics. 2011, 12(5), pp. 474-484. ISSN 1467-5463. eISSN 1477-4054. Available under: doi: 10.1093/bib/bbr038deu
kops.sourcefield.plainBriefings in Bioinformatics. 2011, 12(5), pp. 474-484. ISSN 1467-5463. eISSN 1477-4054. Available under: doi: 10.1093/bib/bbr038eng
kops.submitter.emailoleg.kozlov@uni-konstanz.dedeu
relation.isAuthorOfPublication6edcfc22-c686-4e9d-a0e4-291d61adb1ff
relation.isAuthorOfPublication77c33793-52cc-44a7-9936-fec7d6e8d15c
relation.isAuthorOfPublication11ee28a4-06fc-4447-a5fb-95a044b3895a
relation.isAuthorOfPublication.latestForDiscovery6edcfc22-c686-4e9d-a0e4-291d61adb1ff
source.bibliographicInfo.fromPage474
source.bibliographicInfo.issue5
source.bibliographicInfo.toPage484
source.bibliographicInfo.volume12
source.identifier.eissn1477-4054
source.identifier.issn1467-5463
source.periodicalTitleBriefings in Bioinformatics

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Dessimoz_194550.pdf
Größe:
304.71 KB
Format:
Adobe Portable Document Format
Dessimoz_194550.pdf
Dessimoz_194550.pdfGröße: 304.71 KBDownloads: 407

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
1.92 KB
Format:
Plain Text
Beschreibung:
license.txt
license.txtGröße: 1.92 KBDownloads: 0