Topics

Data Access

This group will look at producing examples of how access can be described using ODRL, and the kind of controlled vocabularies which are needed to support interoperability among communities of users and in cross-domain scenarios. Existing vocabularies (such as the Data Privacy Vocabulary) should be considered, and gaps identified. New vocabularies needed should be identified.

Reference (from Herve/Darren): Machine Actionable Rights - Objects, Collections, Organisations

Data Discovery

This topic will focus on how the existing Discovery Core profile of CDIF can be used in combination with other profiles, including Data Description and Access, and potential overlap with Provenance information. The mapping between PROV and http://Schema.org should be considered. Outputs should include documented examples.

Semantic and Syntactic Mapping

The expression of semantic mappings and their use in performing transformations of data for integration purposes is an important topic for the use of FAIR data. This group will explore the way in which mappings can be described for reuse, with an emphasis on machine-actionability. The goal here is to make recommendations aligned with those coming from the RDA group on FAIR mappings and other related work, and to align with other recommendations around data description and integration in CDIF.

Data Description

The basic core profile presented in the current CDIF draft is limited in scope. This group will further explore how recommendations can be made to extend the metadata profile to better support the description of data integration. There is a strong overlap here with mapping, discovery, and provenance metadata, and these connections should be explored through worked examples (based on the SDG Indicators, etc.)

Provenance: Initial Steps and Beyond

The description of provenance is critical for supporting the reproducibility of findings. Consider how practical steps could be taken to improve provenance information as initial recommendations. Develop CDIF profile draft. Schema.org/DCAT/DC mapped to PROV?Consider how recommendations could be aimed at more sophisticated implementers. A focus of this activity will be alignment with the work in the Context group

Provenance: Context

In order to understand the context of data across domain boundaries, there are a number of contextual factors which require description. A shared understanding of how observed events, collection or generation of data, samples, processing, and archival and dissemination practices is needed to fully describe data for the purposes of reuse. This group will build on work done at the 2023 Dagstuhl workshop to draft a common model for these aspects of provenance and the context of data. (See discussion draft of provenance framework: https://docs.google.com/document/d/1WLkXrcVmd_yNTWcs_OZYxAMqBssWoM6vEtb4MOuP8zA/edit?usp=drive_link )

Agenda, Monday October 14, 2024
09:00–10:30	Welcome and Background Participant Introductions
10:30–10:45	Coffee Break
10:45–12:15	Overview of WorldFAIR/WorldFAIR+ (Simon) Overview of Current CDIF Draft (Arofan) Overview of Topics and Suggested Deliverables (Arofan) Discovery (Steve R) Access (Darren)
12:15–13:45	Lunch
13:45–15:15	Data Description (Flavio/Luis) Semantic Mapping (Yann) Context/Provenance Provenance: Initial Approaches/Schema.org-to-PROV (Pier-Luigi) Observations, Samples, Events (Arofan + Iseult)
15:15–15:30	Coffee Break
15:30–17:00	Summary of DDI-CDI Workshop Outputs (Arofan) UNEP Global Environmental Data Strategy (GEDS) (Sally Radwan - Remote Presentation) Reminder: Topics & Deliverables Selection of Groups (May move to Tuesday AM)
Presentations Day 1, 2

Rest of Week – Regular Schedule
07:30–08:45	Breakfast
09:00–09:15	Morning Plenary
09:15–10:30	Breakout Groups
10:30–10:45	Coffee Break
10:45–12:15	Breakout Groups
12:15–13:45	Lunch and Walk
13:45–15:15	Plenary/Breakout Groups
15:15–15:30	Coffee Break
15:30–16:15	Breakout Groups
16:15–17:00	Closing Plenary/Discussion
18:00–19:00	Dinner
19:00–20:00	Possible Evening Session(Informal Discussion with Drinks at Own Expense)

Workshop Summary

Date and Location

The workshop takes place at Schloss Dagstuhl – Leibniz Center for Informatics on October 13 to October 18, 2024. See also the corresponding Dagstuhl web page and its information on COVID-19.

See the separate page for practical information.

Workshop Schedule

See the separate page for practical information.

Organizers and Participants

Organizers

Michelle Edwards, University of Guelph - Canada
Arofan Gregory, CODATA and DDI Alliance - USA
Simon Hodson, CODATA - Committee on Data of the International Science Council (ISC) - France
Steven McEachern, UK Data Service, University of Essex and DDI Alliance
Hilde Orten, Sikt – Norwegian Agency for Shared Services in Education and Research and DDI Alliance
Joachim Wackerow, Independent Expert - Germany

Participants

Darren Bell, UK Data Archive
Tathagata Bhattacharjee, LSHTM (The London School of Hygiene & Tropical Medicine)
Ian Bruno, CCDC (Cambridge Crystallographic Data Centre)
Simon Cox, Independent expert
Mark Dietrich, EGI
Doug Fils, Independent expert
Heike Görzig, HZB (Helmholtz-Zentrum Berlin)
Alexandra Kokkinaki, NOC (UK National Oceanography Centre)
Polina Koroleva, UNEP (UN Environment Programme)
Yann Le Franc, eScience Factory
Kerstin Lehnert, Columbia University
Iseult Lynch, University of Birmingham
Lauren Maxwell, University of Heidelberg
Luis Gonzalez Morales, UN Statistics Division
Michael Ochola, APHRC (African Population and Health Research Center)
Steve Richard, Independent expert
Flavio Rizzolo, Statistics Canada

Google folder for plenary

DDI

2024 Evaluating and Refining Cross-Domain Metadata Exchange Frameworks