SHARP: Harmonizing and Bridging Cross-Workflow Provenance
Gaignard, Alban; Belhajjame, Khalid; Skaf-Molli, Hala (2017), SHARP: Harmonizing and Bridging Cross-Workflow Provenance, The Semantic Web: ESWC 2017 Satellite Events, 2017-05, Portorož, Slovenia
Type
Communication / ConférenceExternal document link
https://hal.archives-ouvertes.fr/hal-01768385Date
2017Conference title
The Semantic Web: ESWC 2017 Satellite EventsConference date
2017-05Conference city
PortorožConference country
SloveniaBook author
Blomqvist, Eva; Hose, Katja; Paulheim, HeikoPublisher
Springer
Published in
Cham
ISBN
978-3-319-70407-4
Number of pages
387Pages
219-234
Publication identifier
Metadata
Show full item recordAuthor(s)
Gaignard, Alban
Belhajjame, Khalid
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Skaf-Molli, Hala

Abstract (EN)
PROV has been adopted by a number of workflow systems for encoding the traces of workflow executions. Exploiting these provenance traces is hampered by two main impediments. Firstly, workflow systems extend PROV differently to cater for system-specific constructs. The difference between the adopted PROV extensions yields heterogeneity in the generated provenance traces. This heterogeneity diminishes the value of such traces, e.g. when combining and querying provenance traces of different workflow systems. Secondly, the provenance recorded by workflow systems tends to be large, and as such difficult to browse and understand by a human user. In this paper (extending, initially published at SeWeBMeDA’17), we propose SHARP, a Linked Data approach for harmonizing cross-workflow provenance. The harmonization is performed by chasing tuple-generating and equality-generating dependencies defined for workflow provenance. This results in a provenance graph that can be summarized using domain-specific vocabularies. We experimentally evaluate SHARP (i) on publicly available provenance documents and (ii) using a real-world omic experiment involving workflow traces generated by the Taverna and Galaxy systems.Subjects / Keywords
Reproducibility; Scientific Workflows; Provenance; Prov ConstraintsRelated items
Showing items related by title and author.
-
Gaignard, Alban; Skaf-Molli, Hala; Belhajjame, Khalid (2020) Article accepté pour publication ou publié
-
Découvrabilité et réutilisation de données produites par des workflows : un cas d’usage en génomique Gaignard, Alban; Skaf-Molli, Hala; Belhajjame, Khalid Communication / Conférence
-
Découvrabilité et réutilisation de données produites par des workflows : un cas d’usage en génomique Gaignard, Alban; Skaf-Molli, Hala; Belhajjame, Khalid (2021) Communication / Conférence
-
Cohen-Boulakia, Sarah; Belhajjame, Khalid; Collin, Olivier; Chopard, Jérôme; Froidevaux, Christine; Gaignard, Alban (2017) Article accepté pour publication ou publié
-
Djaffardjy, Marine; Marchment, George; Sebe, Clémence; Blanchet, Raphaël; Belhajjame, Khalid; Gaignard, Alban; Lemoine, Frédéric; Cohen-Boulakia, Sarah (2023) Article accepté pour publication ou publié