Polishing Structural Bulk Updates in a Native XML Database
| dc.contributor.author | Kircher, Lukas | |
| dc.date.accessioned | 2014-11-05T07:52:05Z | |
| dc.date.available | 2014-11-05T07:52:05Z | |
| dc.date.issued | 2013 | eng |
| dc.description.abstract | BaseX is a native XML database on the foundation of a fixed-length, sequential document encoding. Built with a strong focus on reading performance, this thesis shows that the Pre/Dist/Size encoding is yet perfectly capable of handling massive bulk update transactions. Despite its theoretical limitations regarding structural changes of the tree, we show that performance is indeed restricted by the document order and disk access patterns. During tests with the XQuery Update Facility (XQUF), we delete 1.8 million nodes in 22.4 seconds, evenly distributed over a 1.1GB XMark document instance. Compared to the prior approach, this equals a reduction of processing time by 99.99%. To achieve this, we extend the obligatory implementation of the XQUF pending update list with an additional low-level layer, that pre-calculates tree structure adjustments in-memory. This layer adds little overhead and further enables us to merge update operations and curb fragmentation that finds its origin at the user level. To not violate the document order, the XQUF is only arduously brought together with the concept of efficient bulk updates. A method is introduced that imposes an order on update primitives to finally get a ready-to-apply sequence of atomic updates. The reviewed implementation is fully consistent with the XQUF specification and has already proven rock-solid efficiency in production use. A few theoretical paragraphs on alternative approaches, disk access patterns and memory consumption highlight sleeping potential and prepare further progression. | eng |
| dc.description.version | published | |
| dc.identifier.ppn | 416516483 | |
| dc.identifier.uri | http://kops.uni-konstanz.de/handle/123456789/29218 | |
| dc.language.iso | eng | eng |
| dc.rights | terms-of-use | |
| dc.rights.uri | https://rightsstatements.org/page/InC/1.0/ | |
| dc.subject | XML, XQuery, XQuery Update Facility, XQUF, query, querying, updating, update, fixed-length encoding, native XML database, atomic update, pending update list, document order, bulk update, structural update, document tree, XMark, update primitive, sequential encoding, performance, insert, delete, replace, XPath, axes, ancestor relationship, delayed, execution order, order of updates, bulk queries | eng |
| dc.subject.ddc | 004 | eng |
| dc.title | Polishing Structural Bulk Updates in a Native XML Database | eng |
| dc.type | MSC_THESIS | eng |
| dspace.entity.type | Publication | |
| kops.citation.bibtex | @mastersthesis{Kircher2013Polis-29218,
year={2013},
title={Polishing Structural Bulk Updates in a Native XML Database},
address={Konstanz},
school={Universität Konstanz},
author={Kircher, Lukas}
} | |
| kops.citation.iso690 | KIRCHER, Lukas, 2013. Polishing Structural Bulk Updates in a Native XML Database [Master thesis]. Konstanz: Universität Konstanz | deu |
| kops.citation.iso690 | KIRCHER, Lukas, 2013. Polishing Structural Bulk Updates in a Native XML Database [Master thesis]. Konstanz: Universität Konstanz | eng |
| kops.citation.rdf | <rdf:RDF
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bibo="http://purl.org/ontology/bibo/"
xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:void="http://rdfs.org/ns/void#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#" >
<rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/29218">
<dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/29218/3/Kircher_0-258273.pdf"/>
<dc:language>eng</dc:language>
<dc:contributor>Kircher, Lukas</dc:contributor>
<dc:creator>Kircher, Lukas</dc:creator>
<dcterms:issued>2013</dcterms:issued>
<void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
<foaf:homepage rdf:resource="http://localhost:8080/"/>
<dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
<dc:rights>terms-of-use</dc:rights>
<dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
<bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/29218"/>
<dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-11-05T07:52:05Z</dcterms:available>
<dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
<dcterms:abstract xml:lang="eng">BaseX is a native XML database on the foundation of a fixed-length, sequential document encoding. Built with a strong focus on reading performance, this thesis shows that the Pre/Dist/Size encoding is yet perfectly capable of handling massive bulk update transactions. Despite its theoretical limitations regarding structural changes of the tree, we show that performance is indeed restricted by the document order and disk access patterns. During tests with the XQuery Update Facility (XQUF), we delete 1.8 million nodes in 22.4 seconds, evenly distributed over a 1.1GB XMark document instance. Compared to the prior approach, this equals a reduction of processing time by 99.99%. To achieve this, we extend the obligatory implementation of the XQUF pending update list with an additional low-level layer, that pre-calculates tree structure adjustments in-memory. This layer adds little overhead and further enables us to merge update operations and curb fragmentation that finds its origin at the user level. To not violate the document order, the XQUF is only arduously brought together with the concept of efficient bulk updates. A method is introduced that imposes an order on update primitives to finally get a ready-to-apply sequence of atomic updates. The reviewed implementation is fully consistent with the XQUF specification and has already proven rock-solid efficiency in production use. A few theoretical paragraphs on alternative approaches, disk access patterns and memory consumption highlight sleeping potential and prepare further progression.</dcterms:abstract>
<dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2014-11-05T07:52:05Z</dc:date>
<dcterms:title>Polishing Structural Bulk Updates in a Native XML Database</dcterms:title>
<dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/29218/3/Kircher_0-258273.pdf"/>
</rdf:Description>
</rdf:RDF> | |
| kops.description.openAccess | openaccessgreen | |
| kops.identifier.nbn | urn:nbn:de:bsz:352-0-258273 | |
| kops.location.thesis | Konstanz | eng |
| kops.relation.grantingInstitution | Universität Konstanz | eng |
| relation.isAuthorOfPublication | 958729f3-29ba-49f1-af2b-f5f02466246f | |
| relation.isAuthorOfPublication.latestForDiscovery | 958729f3-29ba-49f1-af2b-f5f02466246f | |
| temp.internal.duplicates | <p>Keine Dubletten gefunden. Letzte Überprüfung: 04.11.2014 10:30:03</p> | deu |
Dateien
Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
- Name:
- Kircher_0-258273.pdf
- Größe:
- 3 MB
- Format:
- Adobe Portable Document Format
Lizenzbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
- Name:
- license.txt
- Größe:
- 3.88 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung:

