2024 Evaluating and Refining Cross-Domain Metadata Exchange Frameworks
Topics
Data Access
This group will look at producing examples of how access can be described using ODRL, and the kind of controlled vocabularies which are needed to support interoperability among communities of users and in cross-domain scenarios. Existing vocabularies (such as the Data Privacy Vocabulary) should be considered, and gaps identified. New vocabularies needed should be identified.
Reference (from Herve/Darren): Machine Actionable Rights - Objects, Collections, Organisations
Data Discovery
This topic will focus on how the existing Discovery Core profile of CDIF can be used in combination with other profiles, including Data Description and Access, and potential overlap with Provenance information. The mapping between PROV and http://Schema.org should be considered. Outputs should include documented examples.
Semantic and Syntactic Mapping
The expression of semantic mappings and their use in performing transformations of data for integration purposes is an important topic for the use of FAIR data. This group will explore the way in which mappings can be described for reuse, with an emphasis on machine-actionability. The goal here is to make recommendations aligned with those coming from the RDA group on FAIR mappings and other related work, and to align with other recommendations around data description and integration in CDIF.
Data Description
The basic core profile presented in the current CDIF draft is limited in scope. This group will further explore how recommendations can be made to extend the metadata profile to better support the description of data integration. There is a strong overlap here with mapping, discovery, and provenance metadata, and these connections should be explored through worked examples (based on the SDG Indicators, etc.)
Provenance: Initial Steps and Beyond
The description of provenance is critical for supporting the reproducibility of findings. Consider how practical steps could be taken to improve provenance information as initial recommendations. Develop CDIF profile draft. Schema.org/DCAT/DC mapped to PROV?Consider how recommendations could be aimed at more sophisticated implementers. A focus of this activity will be alignment with the work in the Context group
Provenance: Context
In order to understand the context of data across domain boundaries, there are a number of contextual factors which require description. A shared understanding of how observed events, collection or generation of data, samples, processing, and archival and dissemination practices is needed to fully describe data for the purposes of reuse. This group will build on work done at the 2023 Dagstuhl workshop to draft a common model for these aspects of provenance and the context of data. (See discussion draft of provenance framework: https://docs.google.com/document/d/1WLkXrcVmd_yNTWcs_OZYxAMqBssWoM6vEtb4MOuP8zA/edit?usp=drive_link )
Agenda, Monday October 14, 2024 |
|
| |
09:00–10:30 | Welcome and Background Participant Introductions |
|
|
10:30–10:45 | Coffee Break |
|
|
10:45–12:15 | Overview of WorldFAIR/WorldFAIR+ (Simon) Overview of Current CDIF Draft (Arofan) Overview of Topics and Suggested Deliverables (Arofan) Discovery (Steve R) Access (Darren) |
|
|
12:15–13:45 | Lunch |
|
|
13:45–15:15 | Data Description (Flavio/Luis) Semantic Mapping (Yann) Context/Provenance
|
|
|
15:15–15:30 | Coffee Break |
|
|
15:30–17:00 | Summary of DDI-CDI Workshop Outputs (Arofan) UNEP Global Environmental Data Strategy (GEDS) (Sally Radwan - Remote Presentation) Reminder: Topics & Deliverables Selection of Groups (May move to Tuesday AM) |
|
|
|
Rest of Week – Regular Schedule | |
07:30–08:45 | Breakfast |
09:00–09:15 | Morning Plenary |
09:15–10:30 | Breakout Groups |
10:30–10:45 | Coffee Break |
10:45–12:15 | Breakout Groups |
12:15–13:45 | Lunch and Walk |
13:45–15:15 | Plenary/Breakout Groups |
15:15–15:30 | Coffee Break |
15:30–16:15 | Breakout Groups |
16:15–17:00 | Closing Plenary/Discussion |
18:00–19:00 | Dinner |
19:00–20:00 | Possible Evening Session(Informal Discussion with Drinks at Own Expense) |
Workshop Summary
Date and Location
The workshop takes place at Schloss Dagstuhl – Leibniz Center for Informatics on October 13 to October 18, 2024. See also the corresponding Dagstuhl web page and its information on COVID-19.
See the separate page for practical information.
Workshop Schedule
See the separate page for practical information.
Organizers and Participants
Organizers
Michelle Edwards, University of Guelph - Canada
Arofan Gregory, CODATA and DDI Alliance - USA
Simon Hodson, CODATA - Committee on Data of the International Science Council (ISC) - France
Steven McEachern, UK Data Service, University of Essex and DDI Alliance
Hilde Orten, Sikt – Norwegian Agency for Shared Services in Education and Research and DDI Alliance
Joachim Wackerow, Independent Expert - Germany
Participants
Darren Bell, UK Data Archive
Tathagata Bhattacharjee, LSHTM (The London School of Hygiene & Tropical Medicine)
Ian Bruno, CCDC (Cambridge Crystallographic Data Centre)
Simon Cox, Independent expert
Mark Dietrich, EGI
Doug Fils, Independent expert
Heike Görzig, HZB (Helmholtz-Zentrum Berlin)
Alexandra Kokkinaki, NOC (UK National Oceanography Centre)
Polina Koroleva, UNEP (UN Environment Programme)
Yann Le Franc, eScience Factory
Kerstin Lehnert, Columbia University
Iseult Lynch, University of Birmingham
Lauren Maxwell, University of Heidelberg
Luis Gonzalez Morales, UN Statistics Division
Michael Ochola, APHRC (African Population and Health Research Center)
Steve Richard, Independent expert
Flavio Rizzolo, Statistics Canada