Publikation:

Towards automatic detecting of overlapping genes : clustered BLAST analysis of viral genomes

Lade...
Vorschaubild

Dateien

Zu diesem Dokument gibt es keine Dateien.

Datum

2010

Autor:innen

Neuhaus, Klaus
Fürst, David
Scherer, Siegfried

Herausgeber:innen

Kontakt

ISSN der Zeitschrift

Electronic ISSN

ISBN

Bibliografische Daten

Verlag

Schriftenreihe

Auflagebezeichnung

ArXiv-ID

Internationale Patentnummer

Angaben zur Forschungsförderung

Projekt

Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz

Gesperrt bis

Titel in einer weiteren Sprache

Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published

Erschienen in

PIZZUTI, Clara, ed., Marylyn D. RITCHIE, ed., Mario GIACOBINI, ed.. Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010, pp. 228-239. Lecture Notes in Computer Science. 6023. ISBN 978-3-642-12210-1. Available under: doi: 10.1007/978-3-642-12211-8_20

Zusammenfassung

Overlapping genes (encoded on the same DNA locus but in different frames) are thought to be rare and, therefore, were largely neglected in the past. In a test set of 800 viruses we found more than 350 potential overlapping open reading frames of >500 bp which generate BLAST hits, indicating a possible biological function. Interestingly, five overlaps with more than 2000 bp were found, the largest may even contain triple overlaps. In order to perform the vast amount of BLAST searches required to test all detected open reading frames, we compared two clustering strategies (BLASTCLUST and k-means) and queried the database with one representative only. Our results show that this approach achieves a significant speed-up while retaining a high quality of the results (>99% precision compared to single queries) for both clustering methods. Future wet lab experiments are needed to show whether the detected overlapping reading frames are biologically functional.

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
004 Informatik

Schlagwörter

overlapping genes, clustering, BLAST analysis

Konferenz

Rezension
undefined / . - undefined, undefined

Forschungsvorhaben

Organisationseinheiten

Zeitschriftenheft

Zugehörige Datensätze in KOPS

Zitieren

ISO 690NEUHAUS, Klaus, Daniela OELKE, David FÜRST, Siegfried SCHERER, Daniel A. KEIM, 2010. Towards automatic detecting of overlapping genes : clustered BLAST analysis of viral genomes. In: PIZZUTI, Clara, ed., Marylyn D. RITCHIE, ed., Mario GIACOBINI, ed.. Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010, pp. 228-239. Lecture Notes in Computer Science. 6023. ISBN 978-3-642-12210-1. Available under: doi: 10.1007/978-3-642-12211-8_20
BibTex
@inproceedings{Neuhaus2010Towar-12732,
  year={2010},
  doi={10.1007/978-3-642-12211-8_20},
  title={Towards automatic detecting of overlapping genes : clustered BLAST analysis of viral genomes},
  number={6023},
  isbn={978-3-642-12210-1},
  publisher={Springer Berlin Heidelberg},
  address={Berlin, Heidelberg},
  series={Lecture Notes in Computer Science},
  booktitle={Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics},
  pages={228--239},
  editor={Pizzuti, Clara and Ritchie, Marylyn D. and Giacobini, Mario},
  author={Neuhaus, Klaus and Oelke, Daniela and Fürst, David and Scherer, Siegfried and Keim, Daniel A.}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/12732">
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/12732/1/Mansmannpdf.pdf"/>
    <dc:creator>Fürst, David</dc:creator>
    <dc:creator>Scherer, Siegfried</dc:creator>
    <dc:contributor>Scherer, Siegfried</dc:contributor>
    <dcterms:title>Towards automatic detecting of overlapping genes : clustered BLAST analysis of viral genomes</dcterms:title>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Oelke, Daniela</dc:contributor>
    <dc:creator>Oelke, Daniela</dc:creator>
    <dc:language>eng</dc:language>
    <dc:rights>terms-of-use</dc:rights>
    <dc:contributor>Fürst, David</dc:contributor>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/12732"/>
    <dc:contributor>Neuhaus, Klaus</dc:contributor>
    <dcterms:issued>2010</dcterms:issued>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/12732/1/Mansmannpdf.pdf"/>
    <dc:contributor>Keim, Daniel A.</dc:contributor>
    <dcterms:bibliographicCitation>First publ. in: Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics : 8th European conference, EvoBIO 2010, Istanbul, Turkey, April 7 - 9, 2010 ; proceedings / Clara Pizzuti... (Eds.). - Berlin : Springer, 2010. - pp. 228-239. - (Lecture Notes in Computer Science ; 6023). - ISBN 978-3-642-12210-1</dcterms:bibliographicCitation>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-07-12T08:24:53Z</dcterms:available>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-07-12T08:24:53Z</dc:date>
    <dc:creator>Keim, Daniel A.</dc:creator>
    <dc:creator>Neuhaus, Klaus</dc:creator>
    <dcterms:abstract xml:lang="eng">Overlapping genes (encoded on the same DNA locus but in different frames) are thought to be rare and, therefore, were largely neglected in the past. In a test set of 800 viruses we found more than 350 potential overlapping open reading frames of &gt;500 bp which generate BLAST hits, indicating a possible biological function. Interestingly, five overlaps with more than 2000 bp were found, the largest may even contain triple overlaps. In order to perform the vast amount of BLAST searches required to test all detected open reading frames, we compared two clustering strategies (BLASTCLUST and k-means) and queried the database with one representative only. Our results show that this approach achieves a significant speed-up while retaining a high quality of the results (&gt;99% precision compared to single queries) for both clustering methods. Future wet lab experiments are needed to show whether the detected overlapping reading frames are biologically functional.</dcterms:abstract>
  </rdf:Description>
</rdf:RDF>

Interner Vermerk

xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter

Kontakt
URL der Originalveröffentl.

Prüfdatum der URL

Prüfungsdatum der Dissertation

Finanzierungsart

Kommentar zur Publikation

Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen