Publikation: A Two-Step Clustering for 3-D Gene Expression Data Reveals theMain Features of the Arabidopsis Stress Response
Dateien
Datum
Autor:innen
Herausgeber:innen
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
DOI (zitierfähiger Link)
Internationale Patentnummer
Link zur Lizenz
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Sammlungen
Core Facility der Universität Konstanz
Titel in einer weiteren Sprache
Publikationstyp
Publikationsstatus
Erschienen in
Zusammenfassung
We developed an integrative approach for discovering gene modules, i.e. genes that are tightly correlated under several experimental conditions and applied it to a threedimensional Arabidopsis thaliana microarray dataset. The dataset consists of approximately 23000 genes responding to 9 abiotic stress conditions at 6-9 different points in time. Our approach aims at finding relatively small and dense modules lending themselves to a specific biological interpretation. In order to detect gene modules within this dataset, we employ a two-step clustering process. In the first step, a k-means clustering on one condition is performed, which is subsequently used in the second step as a seed for the clustering of the remaining conditions. To validate the significance of the obtained modules, we performed a permutation analysis and determined a null hypothesis to compare the module scores against, providing a p-value for each module. Significant modules were mapped to the Gene Ontology (GO) in order to determine the participating biological processes.
As a result, we isolated modules showing high significance with respect to the p-values obtained by permutation analysis and GO mapping. In these modules we identified a number of genes that are either part of a general stress response with similar characteristics under different conditions (coherent modules), or part of a more specific stress response to a single stress condition (single response modules). We also found genes clustering within several conditions, which are, however, not part of a coherent module. These genes have a distinct temporal response under each condition. We call the modules they are contained in individual response modules (IR).
Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
Schlagwörter
Konferenz
Rezension
Zitieren
ISO 690
STRAUCH, Martin, Jochen SUPPER, Christian SPIETH, Dierk WANKE, Joachim KILIAN, Klaus HARTER, Andreas ZELL, 2007. A Two-Step Clustering for 3-D Gene Expression Data Reveals theMain Features of the Arabidopsis Stress Response. In: Journal of Integrative Bioinformatics. 2007, 4(1), 54. Available under: doi: 10.2390/biecoll-jib-2007-54BibTex
@article{Strauch2007TwoSt-8734, year={2007}, doi={10.2390/biecoll-jib-2007-54}, title={A Two-Step Clustering for 3-D Gene Expression Data Reveals theMain Features of the Arabidopsis Stress Response}, number={1}, volume={4}, journal={Journal of Integrative Bioinformatics}, author={Strauch, Martin and Supper, Jochen and Spieth, Christian and Wanke, Dierk and Kilian, Joachim and Harter, Klaus and Zell, Andreas}, note={Article Number: 54} }
RDF
<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/8734"> <dc:creator>Wanke, Dierk</dc:creator> <dc:contributor>Spieth, Christian</dc:contributor> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/8734/1/jib_54.pdf"/> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-03-24T17:46:04Z</dcterms:available> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/> <dc:contributor>Strauch, Martin</dc:contributor> <dc:contributor>Kilian, Joachim</dc:contributor> <dc:rights>terms-of-use</dc:rights> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <dcterms:issued>2007</dcterms:issued> <dcterms:abstract xml:lang="eng">We developed an integrative approach for discovering gene modules, i.e. genes that are tightly correlated under several experimental conditions and applied it to a threedimensional Arabidopsis thaliana microarray dataset. The dataset consists of approximately 23000 genes responding to 9 abiotic stress conditions at 6-9 different points in time. Our approach aims at finding relatively small and dense modules lending themselves to a specific biological interpretation. In order to detect gene modules within this dataset, we employ a two-step clustering process. In the first step, a k-means clustering on one condition is performed, which is subsequently used in the second step as a seed for the clustering of the remaining conditions. To validate the significance of the obtained modules, we performed a permutation analysis and determined a null hypothesis to compare the module scores against, providing a p-value for each module. Significant modules were mapped to the Gene Ontology (GO) in order to determine the participating biological processes.<br />As a result, we isolated modules showing high significance with respect to the p-values obtained by permutation analysis and GO mapping. In these modules we identified a number of genes that are either part of a general stress response with similar characteristics under different conditions (coherent modules), or part of a more specific stress response to a single stress condition (single response modules). We also found genes clustering within several conditions, which are, however, not part of a coherent module. These genes have a distinct temporal response under each condition. We call the modules they are contained in individual response modules (IR).</dcterms:abstract> <dc:format>application/pdf</dc:format> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dc:creator>Strauch, Martin</dc:creator> <dc:creator>Harter, Klaus</dc:creator> <dc:creator>Supper, Jochen</dc:creator> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dc:contributor>Harter, Klaus</dc:contributor> <dc:creator>Kilian, Joachim</dc:creator> <dc:creator>Zell, Andreas</dc:creator> <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/8734"/> <dcterms:bibliographicCitation>First publ. in: Journal of Integrative Bioinformatics 4 (2007), 1, 54</dcterms:bibliographicCitation> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/8734/1/jib_54.pdf"/> <dc:contributor>Supper, Jochen</dc:contributor> <dc:contributor>Zell, Andreas</dc:contributor> <dc:creator>Spieth, Christian</dc:creator> <dc:contributor>Wanke, Dierk</dc:contributor> <dcterms:title>A Two-Step Clustering for 3-D Gene Expression Data Reveals theMain Features of the Arabidopsis Stress Response</dcterms:title> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2011-03-24T17:46:04Z</dc:date> <dc:language>eng</dc:language> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/> </rdf:Description> </rdf:RDF>