Polishing Structural Bulk Updates in a Native XML Database

dc.contributor.authorKircher, Lukas
dc.date.accessioned2014-11-05T07:52:05Z
dc.date.available2014-11-05T07:52:05Z
dc.date.issued2013eng
dc.description.abstractBaseX is a native XML database on the foundation of a fixed-length, sequential document encoding. Built with a strong focus on reading performance, this thesis shows that the Pre/Dist/Size encoding is yet perfectly capable of handling massive bulk update transactions. Despite its theoretical limitations regarding structural changes of the tree, we show that performance is indeed restricted by the document order and disk access patterns. During tests with the XQuery Update Facility (XQUF), we delete 1.8 million nodes in 22.4 seconds, evenly distributed over a 1.1GB XMark document instance. Compared to the prior approach, this equals a reduction of processing time by 99.99%. To achieve this, we extend the obligatory implementation of the XQUF pending update list with an additional low-level layer, that pre-calculates tree structure adjustments in-memory. This layer adds little overhead and further enables us to merge update operations and curb fragmentation that finds its origin at the user level. To not violate the document order, the XQUF is only arduously brought together with the concept of efficient bulk updates. A method is introduced that imposes an order on update primitives to finally get a ready-to-apply sequence of atomic updates. The reviewed implementation is fully consistent with the XQUF specification and has already proven rock-solid efficiency in production use. A few theoretical paragraphs on alternative approaches, disk access patterns and memory consumption highlight sleeping potential and prepare further progression.eng
dc.description.versionpublished
dc.identifier.ppn416516483
dc.identifier.urihttp://kops.uni-konstanz.de/handle/123456789/29218
dc.language.isoengeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subjectXML, XQuery, XQuery Update Facility, XQUF, query, querying, updating, update, fixed-length encoding, native XML database, atomic update, pending update list, document order, bulk update, structural update, document tree, XMark, update primitive, sequential encoding, performance, insert, delete, replace, XPath, axes, ancestor relationship, delayed, execution order, order of updates, bulk querieseng
dc.subject.ddc004eng
dc.titlePolishing Structural Bulk Updates in a Native XML Databaseeng
dc.typeMSC_THESISeng
dspace.entity.typePublication
kops.citation.bibtex
@mastersthesis{Kircher2013Polis-29218,
  year={2013},
  title={Polishing Structural Bulk Updates in a Native XML Database},
  address={Konstanz},
  school={Universität Konstanz},
  author={Kircher, Lukas}
}
kops.citation.iso690KIRCHER, Lukas, 2013. Polishing Structural Bulk Updates in a Native XML Database [Master thesis]. Konstanz: Universität Konstanzdeu
kops.citation.iso690KIRCHER, Lukas, 2013. Polishing Structural Bulk Updates in a Native XML Database [Master thesis]. Konstanz: Universität Konstanzeng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/29218">
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/29218/3/Kircher_0-258273.pdf"/>
    <dc:language>eng</dc:language>
    <dc:contributor>Kircher, Lukas</dc:contributor>
    <dc:creator>Kircher, Lukas</dc:creator>
    <dcterms:issued>2013</dcterms:issued>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/29218"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-11-05T07:52:05Z</dcterms:available>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:abstract xml:lang="eng">BaseX is a native XML database on the foundation of a fixed-length, sequential document encoding. Built with a strong focus on reading performance, this thesis shows that the Pre/Dist/Size encoding is yet perfectly capable of handling massive bulk update transactions. Despite its theoretical limitations regarding structural changes of the tree, we show that performance is indeed restricted by the document order and disk access patterns. During tests with the XQuery Update Facility (XQUF), we delete 1.8 million nodes in 22.4 seconds, evenly distributed over a 1.1GB XMark document instance. Compared to the prior approach, this equals a reduction of processing time by 99.99%. To achieve this, we extend the obligatory implementation of the XQUF pending update list with an additional low-level layer, that pre-calculates tree structure adjustments in-memory. This layer adds little overhead and further enables us to merge update operations and curb fragmentation that finds its origin at the user level. To not violate the document order, the XQUF is only arduously brought together with the concept of efficient bulk updates. A method is introduced that imposes an order on update primitives to finally get a ready-to-apply sequence of atomic updates. The reviewed implementation is fully consistent with the XQUF specification and has already proven rock-solid efficiency in production use. A few theoretical paragraphs on alternative approaches, disk access patterns and memory consumption highlight sleeping potential and prepare further progression.</dcterms:abstract>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-11-05T07:52:05Z</dc:date>
    <dcterms:title>Polishing Structural Bulk Updates in a Native XML Database</dcterms:title>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/29218/3/Kircher_0-258273.pdf"/>
  </rdf:Description>
</rdf:RDF>
kops.description.openAccessopenaccessgreen
kops.identifier.nbnurn:nbn:de:bsz:352-0-258273
kops.location.thesisKonstanzeng
kops.relation.grantingInstitutionUniversität Konstanzeng
relation.isAuthorOfPublication958729f3-29ba-49f1-af2b-f5f02466246f
relation.isAuthorOfPublication.latestForDiscovery958729f3-29ba-49f1-af2b-f5f02466246f
temp.internal.duplicates<p>Keine Dubletten gefunden. Letzte Überprüfung: 04.11.2014 10:30:03</p>deu

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Kircher_0-258273.pdf
Größe:
3 MB
Format:
Adobe Portable Document Format
Kircher_0-258273.pdf
Kircher_0-258273.pdfGröße: 3 MBDownloads: 347

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
license.txt
Größe:
3.88 KB
Format:
Item-specific license agreed upon to submission
Beschreibung:
license.txt
license.txtGröße: 3.88 KBDownloads: 0