PAXQuery: Parallel Analytical XML Processing
Camacho-Rodríguez, Jesús; Colazzo, Dario; Manolescu, Ioana; Naranjo, Juan A. M. (2015), PAXQuery: Parallel Analytical XML Processing, SIGMOD/PODS'15: International Conference on Management of Data, 2015-05, Melbourne, Australia
TypeCommunication / Conférence
Conference titleSIGMOD/PODS'15: International Conference on Management of Data
Book authorSellis, Timos
MetadataShow full item record
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Naranjo, Juan A. M.
Abstract (EN)XQuery is a general-purpose programming language for processing semi-structured data, and as such, it is very expressive. As a consequence , optimizing and parallelizing complex analytics XQuery queries is still an open, challenging problem. We demonstrate PAXQuery, a novel system that parallelizes the execution of XQuery queries over large collections of XML documents. PAXQuery compiles a rich subset of XQuery into plans expressed in the PArallelization ConTracts (PACT) programming model. Thanks to this translation, the resulting plans are optimized and executed in a massively parallel fashion by the Apache Flink system. The result is a scalable system capable of querying massive amounts of XML data very efficiently, as proved by the experimental results we outline.
Subjects / KeywordsComputing methodologies; Database query processing
Showing items related by title and author.