
Fast nGram-Based String Search Over Data Encoded Using Algebraic Signatures
Litwin, Witold; Mokadem, Riad; Rigaux, Philippe; Schwartz, Thomas (2007), Fast nGram-Based String Search Over Data Encoded Using Algebraic Signatures, dans Koch, Christoph; Gehrke, Johannes; Garofalakis, Minos; Srivastava, Divesh; Aberer, Karl; Deshpande, Anand; Forescu, Daniela; Chan, Chee Yong; Ganti, Venkatesh; Kanne, Karl-Christian; Klas, Wolfgang; Neuhold, Erich J., Proceedings of the 33rd International Conference on Very Large Data Bases, University of Vienna, Austria, September 23-27, 2007, AMC, p. 207-218
Voir/Ouvrir
Type
Communication / ConférenceDate
2007Titre du colloque
33rd International Conference on Very Large Data Bases VLDB 2007Date du colloque
2007-09Ville du colloque
ViennePays du colloque
AutricheTitre de l'ouvrage
Proceedings of the 33rd International Conference on Very Large Data Bases, University of Vienna, Austria, September 23-27, 2007Auteurs de l’ouvrage
Koch, Christoph; Gehrke, Johannes; Garofalakis, Minos; Srivastava, Divesh; Aberer, Karl; Deshpande, Anand; Forescu, Daniela; Chan, Chee Yong; Ganti, Venkatesh; Kanne, Karl-Christian; Klas, Wolfgang; Neuhold, Erich J.Éditeur
AMC
Isbn
978-1-59593-649-3
Pages
207-218
Métadonnées
Afficher la notice complèteRésumé (EN)
We propose a novel string search algorithm for data stored once and read many times. Our search method combines the sublinear traversal of the record (as in Boyer Moore or Knuth-Morris-Pratt) with the agglomeration of parts of the record and search pattern into a single character – the algebraic signature – in the manner of Karp-Rabin. Our experiments show that our algorithm is up to seventy times faster for DNA data, up to eleven times faster for ASCII, and up to a six times faster for XML documents compared with an im- plementation of Boyer-Moore. To obtain this speed-up, we store records in encoded form, where each original character is replaced with an algebraic signature. Our method applies to records stored in databases in general and to distributed implementations of a Database As Service (DAS) in particular. Clients send records for insertion and search patterns already in encoded form and servers never operate on records in clear text. No one at a node can involuntarily discover the content of the stored data.Mots-clés
databases; algebraic signature; Search algorithmPublications associées
Affichage des éléments liés par titre et auteur.
-
Litwin, Witold; Mokadem, Riad; Schwarz, Thomas (2007) Communication / Conférence
-
Constantin, Camelia; du Mouza, Cedric; Litwin, Witold; Rigaux, Philippe; Schwarz, Thomas (2016) Article accepté pour publication ou publié
-
du Mouza, Cedric; Litwin, Witold; Rigaux, Philippe; Schwarz, Thomas (2009) Communication / Conférence
-
Rigaux, Philippe; Litwin, Witold; du Mouza, Cédric; Schwarz, Thomas (2009) Communication / Conférence
-
Rigaux, Philippe; Litwin, Witold; du Mouza, Cédric (2009) Article accepté pour publication ou publié