Annotating the Behavior of Scientific Modules Using Data Examples: A Practical Approach
Belhajjame, Khalid (2014), Annotating the Behavior of Scientific Modules Using Data Examples: A Practical Approach, Advances in Database Technology - EDBT 2014, 17th International Conference on Extending Database Technology, Athens, Greece, March 24-28, Proceedings, OpenProceedings.org : Konstanz, p. 726-737
TypeCommunication / Conférence
Conference title17th International Conference on Extending Database Technology, EDBT 2014
Book titleAdvances in Database Technology - EDBT 2014, 17th International Conference on Extending Database Technology, Athens, Greece, March 24-28, Proceedings
MetadataShow full item record
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Abstract (EN)A major issue that arises when designing scientific experiments(i.e., workflows) is that of identifying the modules (which are of-ten “black boxes”), that are suitable for performing the steps of theexperiment. To assist scientists in the task of identifying suitablemodules, semantic annotations have been proposed and used to de-scribe scientific modules. Different facets of the module can be de-scribed using semantic annotations. Our experience with scientistsfrom modern sciences such as bioinformatics, biodiversity and as-tronomy, however, suggests that most of semantic annotations thatare available are confined to the description of the domain of inputand output parameters of modules. Annotations specifying the be-havior of the modules, as to the tasks they play, are rarely specified.To address this issue, we argue in this paper that data examples arean intuitive and effective means for understanding the behavior ofscientific modules. We present a heuristic for automatically gener-ating data examples that annotate scientific modules without rely-ing on the existence of the module specifications, and show throughan empirical evaluation that uses real-world scientific modules theeffectiveness of the heuristic proposed.The data examples generated can be utilized in a range of scientificmodule management operations. To demonstrate this, we presentthe results of two real-world exercises that show that: (i) Data ex-amples are an intuitive means for human users to understand thebehavior of scientific modules, and that (ii) data examples are aneffective ingredient for matching scientific modules.
Subjects / KeywordsData example; scientific module; module annotation; module comparison; scientific workflow; workflow decay
Showing items related by title and author.
Djaffardjy, Marine; Marchment, George; Sebe, Clémence; Blanchet, Raphaël; Belhajjame, Khalid; Gaignard, Alban; Lemoine, Frédéric; Cohen-Boulakia, Sarah (2023) Article accepté pour publication ou publié
A Model-Driven Framework for the Modeling and the Description of Data-as-a-Service to Assist Service Selection and Composition Alili, Hiba; Drira, Rim; Belhajjame, Khalid; Ben Ghezala, Henda Hajjami; Grigori, Daniela (2019) Communication / Conférence
Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities Cohen-Boulakia, Sarah; Belhajjame, Khalid; Collin, Olivier; Chopard, Jérôme; Froidevaux, Christine; Gaignard, Alban (2017) Article accepté pour publication ou publié