Show simple item record

hal.structure.identifier
dc.contributor.authorCamacho-Rodríguez, Jesús*
hal.structure.identifierLaboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
dc.contributor.authorColazzo, Dario*
hal.structure.identifierInstitut für Parallele und Verteilte Systeme [IPVS]
dc.contributor.authorHerschel, Melanie*
hal.structure.identifierLaboratoire d'informatique de l'École polytechnique [Palaiseau] [LIX]
hal.structure.identifier
dc.contributor.authorManolescu, Ioana
HAL ID: 742652
ORCID: 0000-0002-0425-2462
*
hal.structure.identifier
dc.contributor.authorChowdhury, Soudip Roy*
dc.date.accessioned2017-04-07T14:24:08Z
dc.date.available2017-04-07T14:24:08Z
dc.date.issued2016
dc.identifier.urihttps://basepub.dauphine.fr/handle/123456789/16496
dc.language.isoenen
dc.subjectMapReduceen
dc.subjectBig Dataen
dc.subjectPig Latinen
dc.subjectReuse-based Optimizationen
dc.subjectLinear Programmingen
dc.subject.ddc005en
dc.titleReuse-based Optimization for Pig Latinen
dc.typeCommunication / Conférence
dc.description.abstractenPig Latin is a popular language which is widely used for parallel processing of massive data sets. Currently, subexpressions occurring repeatedly in Pig Latin scripts are executed as many times as they appear, and the current Pig Latin optimizer does not identify reuse opportunities. We present a novel optimization approach aiming at identifying and reusing repeated subexpressions in Pig Latin scripts. Our optimization algorithm, named PigReuse, identifies subexpression merging opportunities, selects the best ones to execute based on a cost function, and reuses their results as needed in order to compute exactly the same output as the original scripts. Our experiments demonstrate the effectiveness of our approach.en
dc.identifier.citationpages2215-2220en
dc.relation.ispartoftitleProceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM'16)en
dc.relation.ispartofeditorMukhopadhyay, Snehasis
dc.relation.ispartofeditorZhai, ChengXiang
dc.relation.ispartofpublnameACM Pressen
dc.relation.ispartofpublcityNew Yorken
dc.relation.ispartofdate2016
dc.relation.ispartofpages2512en
dc.subject.ddclabelProgrammation, logiciels, organisation des donnéesen
dc.relation.ispartofisbn978-1-4503-4073-1en
dc.relation.conftitle25th ACM International on Conference on Information and Knowledge Management (CIKM'16)en
dc.relation.confdate2016-10
dc.relation.confcityIndianapolisen
dc.relation.confcountryUnited Statesen
dc.relation.forthcomingnonen
dc.identifier.doi10.1145/2983323.2983669en
dc.description.ssrncandidatenonen
dc.description.halcandidateouien
dc.description.readershiprechercheen
dc.description.audienceInternationalen
dc.relation.Isversionofjnlpeerreviewednonen
dc.relation.Isversionofjnlpeerreviewednonen
dc.date.updated2017-04-07T14:05:30Z
hal.faultCode{"duplicate-entry":{"hal-01425321":{"doi":"1.0"}}}
hal.author.functionaut
hal.author.functionaut
hal.author.functionaut
hal.author.functionaut
hal.author.functionaut


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record