Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor

dc.contributor.authorIslam, S.M. Ashiqul
dc.contributor.authorDíaz-Gay, Marcos
dc.contributor.authorWu, Yang
dc.contributor.authorBarnes, Mark
dc.contributor.authorVangara, Raviteja
dc.contributor.authorBergstrom, Erik N.
dc.contributor.authorHe, Yudou
dc.contributor.authorVella, Mike
dc.contributor.authorGruber, Andreas J.
dc.contributor.authorAlexandrov, Ludmil B.
dc.date.accessioned2023-03-14T08:02:38Z
dc.date.available2023-03-14T08:02:38Z
dc.date.issued2022
dc.description.abstractMutational signature analysis is commonly performed in cancer genomic studies. Here, we present SigProfilerExtractor, an automated tool for de novo extraction of mutational signatures, and benchmark it against another 13 bioinformatics tools by using 34 scenarios encompassing 2,500 simulated signatures found in 60,000 synthetic genomes and 20,000 synthetic exomes. For simulations with 5% noise, reflecting high-quality datasets, SigProfilerExtractor outperforms other approaches by elucidating between 20% and 50% more true-positive signatures while yielding 5-fold less false-positive signatures. Applying SigProfilerExtractor to 4,643 whole-genome- and 19,184 whole-exome-sequenced cancers reveals four novel signatures. Two of the signatures are confirmed in independent cohorts, and one of these signatures is associated with tobacco smoking. In summary, this report provides a reference tool for analysis of mutational signatures, a comprehensive benchmarking of bioinformatics tools for extracting signatures, and several novel mutational signatures, including one putatively attributed to direct tobacco smoking mutagenesis in bladder tissues.
dc.description.versionpublished
dc.identifier.doi10.1016/j.xgen.2022.100179
dc.identifier.ppn1839091703
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/66400
dc.language.isoeng
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectmutagenesis
dc.subjectmutational signatures
dc.subjectcancer genomics
dc.subjectgenomics
dc.subject.ddc570
dc.titleUncovering novel mutational signatures by de novo extraction with SigProfilerExtractoreng
dc.typeJOURNAL_ARTICLE
dspace.entity.typePublication
kops.citation.bibtex
@article{Islam2022Uncov-66400,
  year={2022},
  doi={10.1016/j.xgen.2022.100179},
  title={Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor},
  number={11},
  volume={2},
  journal={Cell Genomics},
  author={Islam, S.M. Ashiqul and Díaz-Gay, Marcos and Wu, Yang and Barnes, Mark and Vangara, Raviteja and Bergstrom, Erik N. and He, Yudou and Vella, Mike and Gruber, Andreas J. and Alexandrov, Ludmil B.},
  note={Article Number: 100179}
}
kops.citation.iso690ISLAM, S.M. Ashiqul, Marcos DÍAZ-GAY, Yang WU, Mark BARNES, Raviteja VANGARA, Erik N. BERGSTROM, Yudou HE, Mike VELLA, Andreas J. GRUBER, Ludmil B. ALEXANDROV, 2022. Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor. In: Cell Genomics. Elsevier. 2022, 2(11), 100179. eISSN 2666-979X. Available under: doi: 10.1016/j.xgen.2022.100179deu
kops.citation.iso690ISLAM, S.M. Ashiqul, Marcos DÍAZ-GAY, Yang WU, Mark BARNES, Raviteja VANGARA, Erik N. BERGSTROM, Yudou HE, Mike VELLA, Andreas J. GRUBER, Ludmil B. ALEXANDROV, 2022. Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor. In: Cell Genomics. Elsevier. 2022, 2(11), 100179. eISSN 2666-979X. Available under: doi: 10.1016/j.xgen.2022.100179eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/66400">
    <dc:creator>Islam, S.M. Ashiqul</dc:creator>
    <dc:contributor>Barnes, Mark</dc:contributor>
    <dc:creator>Vella, Mike</dc:creator>
    <dc:creator>Díaz-Gay, Marcos</dc:creator>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/66400/4/Islam_2-1dbw5dkpf7ikh0.pdf"/>
    <dc:contributor>He, Yudou</dc:contributor>
    <dc:creator>Alexandrov, Ludmil B.</dc:creator>
    <dc:language>eng</dc:language>
    <dc:contributor>Alexandrov, Ludmil B.</dc:contributor>
    <dc:contributor>Vangara, Raviteja</dc:contributor>
    <dc:contributor>Vella, Mike</dc:contributor>
    <dc:creator>Gruber, Andreas J.</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-03-14T08:02:38Z</dcterms:available>
    <dc:contributor>Díaz-Gay, Marcos</dc:contributor>
    <dcterms:issued>2022</dcterms:issued>
    <dcterms:abstract>Mutational signature analysis is commonly performed in cancer genomic studies. Here, we present SigProfilerExtractor, an automated tool for de novo extraction of mutational signatures, and benchmark it against another 13 bioinformatics tools by using 34 scenarios encompassing 2,500 simulated signatures found in 60,000 synthetic genomes and 20,000 synthetic exomes. For simulations with 5% noise, reflecting high-quality datasets, SigProfilerExtractor outperforms other approaches by elucidating between 20% and 50% more true-positive signatures while yielding 5-fold less false-positive signatures. Applying SigProfilerExtractor to 4,643 whole-genome- and 19,184 whole-exome-sequenced cancers reveals four novel signatures. Two of the signatures are confirmed in independent cohorts, and one of these signatures is associated with tobacco smoking. In summary, this report provides a reference tool for analysis of mutational signatures, a comprehensive benchmarking of bioinformatics tools for extracting signatures, and several novel mutational signatures, including one putatively attributed to direct tobacco smoking mutagenesis in bladder tissues.</dcterms:abstract>
    <dc:creator>He, Yudou</dc:creator>
    <dc:contributor>Bergstrom, Erik N.</dc:contributor>
    <dc:creator>Wu, Yang</dc:creator>
    <dc:creator>Barnes, Mark</dc:creator>
    <dcterms:rights rdf:resource="http://creativecommons.org/licenses/by/4.0/"/>
    <dc:contributor>Wu, Yang</dc:contributor>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/66400"/>
    <dcterms:title>Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor</dcterms:title>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-03-14T08:02:38Z</dc:date>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/>
    <dc:contributor>Islam, S.M. Ashiqul</dc:contributor>
    <dc:creator>Bergstrom, Erik N.</dc:creator>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/66400/4/Islam_2-1dbw5dkpf7ikh0.pdf"/>
    <dc:creator>Vangara, Raviteja</dc:creator>
    <dc:contributor>Gruber, Andreas J.</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/28"/>
    <dc:rights>Attribution 4.0 International</dc:rights>
  </rdf:Description>
</rdf:RDF>
kops.description.openAccessopenaccessgold
kops.flag.etalAuthortrue
kops.flag.isPeerReviewedtrue
kops.identifier.nbnurn:nbn:de:bsz:352-2-1dbw5dkpf7ikh0
kops.sourcefieldCell Genomics. Elsevier. 2022, <b>2</b>(11), 100179. eISSN 2666-979X. Available under: doi: 10.1016/j.xgen.2022.100179deu
kops.sourcefield.plainCell Genomics. Elsevier. 2022, 2(11), 100179. eISSN 2666-979X. Available under: doi: 10.1016/j.xgen.2022.100179deu
kops.sourcefield.plainCell Genomics. Elsevier. 2022, 2(11), 100179. eISSN 2666-979X. Available under: doi: 10.1016/j.xgen.2022.100179eng
relation.isAuthorOfPublication13fa94f3-e646-4858-90fd-6ce765933f76
relation.isAuthorOfPublication.latestForDiscovery13fa94f3-e646-4858-90fd-6ce765933f76
source.bibliographicInfo.articleNumber100179
source.bibliographicInfo.issue11
source.bibliographicInfo.volume2
source.identifier.eissn2666-979X
source.periodicalTitleCell Genomics
source.publisherElsevier

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Islam_2-1dbw5dkpf7ikh0.pdf
Größe:
4.38 MB
Format:
Adobe Portable Document Format
Islam_2-1dbw5dkpf7ikh0.pdf
Islam_2-1dbw5dkpf7ikh0.pdfGröße: 4.38 MBDownloads: 427

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
3.96 KB
Format:
Item-specific license agreed upon to submission
Beschreibung:
license.txt
license.txtGröße: 3.96 KBDownloads: 0