Show simple item record

dc.contributor.authorGaignard, Alban
HAL ID: 1448
ORCID: 0000-0002-3597-8557
dc.contributor.authorSkaf-Molli, Hala
HAL ID: 12153
ORCID: 0000-0003-1062-6659
dc.contributor.authorBelhajjame, Khalid
dc.date.accessioned2020-11-06T15:09:15Z
dc.date.available2020-11-06T15:09:15Z
dc.date.issued2020
dc.identifier.urihttps://basepub.dauphine.fr/handle/123456789/21195
dc.language.isoenen
dc.subjectFAIR
dc.subjectLinked Data
dc.subjectscientific workflows
dc.subjectprovenance
dc.subjectbioinformatics
dc.subjectdata summaries
dc.subject.ddc005.7en
dc.titleFindable and reusable workflow data products: A genomic workflow case study
dc.typeArticle accepté pour publication ou publié
dc.description.abstractenWhile workflow systems have improved the repeatability of scientific experiments, the value of the processed (intermediate) data have been overlooked so far. In this paper, we argue that the intermediate data products of workflow executions should be seen as first-class objects that need to be curated and published. Not only will this be exploited to save time and resources needed when re-executing workflows, but more importantly, it will improve the reuse of data products by the same or peer scientists in the context of new hypotheses and experiments. To assist curator in annotating (intermediate) workflow data, we exploit in this work multiple sources of information, namely: (i) the provenance information captured by the workflow system, and (ii) domain annotations that are provided by tools registries, such as Bio.Tools. Furthermore, we show, on a concrete bioinformatics scenario, how summarising techniques can be used to reduce the machine-generated provenance information of such data products into concise human- and machine-readable annotations.
dc.relation.isversionofjnlnameSemantic Web Journal
dc.relation.isversionofjnlvol11
dc.relation.isversionofjnlissue5
dc.relation.isversionofjnldate2020
dc.relation.isversionofjnlpages751-763
dc.relation.isversionofdoi10.3233/SW-200374
dc.subject.ddclabelOrganisation des donnéesen
dc.relation.forthcomingnonen
dc.relation.forthcomingprintnonen
dc.description.ssrncandidatenon
dc.description.halcandidatenon
dc.description.readershiprecherche
dc.description.audienceInternational
dc.relation.Isversionofjnlpeerreviewednon
dc.date.updated2020-12-17T09:29:29Z


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record