2024 Evaluating and Refining Cross-Domain Metadata Exchange Frameworks

 

Topics

Data Access

This group will look at producing examples of how access can be described using ODRL, and the kind of controlled vocabularies which are needed to support interoperability among communities of users and in cross-domain scenarios. Existing vocabularies (such as the Data Privacy Vocabulary) should be considered, and gaps identified. New vocabularies needed should be identified.

Reference (from Herve/Darren): Machine Actionable Rights - Objects, Collections, Organisations

Data Discovery

This topic will focus on how the existing Discovery Core profile of CDIF can be used in combination with other profiles, including Data Description and Access, and potential overlap with Provenance information. The mapping between PROV and http://Schema.org should be considered. Outputs should include documented examples.

Semantic and Syntactic Mapping

The expression of semantic mappings and their use in performing transformations of data for integration purposes is an important topic for the use of FAIR data. This group will explore the way in which mappings can be described for reuse, with an emphasis on machine-actionability. The goal here is to make recommendations aligned with those coming from the RDA group on FAIR mappings and other related work, and to align with other recommendations around data description and integration in CDIF.

Data Description

The basic core profile presented in the current CDIF draft is limited in scope. This group will further explore how recommendations can be made to extend the metadata profile to better support the description of data integration. There is a strong overlap here with mapping, discovery, and provenance metadata, and these connections should be explored through worked examples (based on the SDG Indicators, etc.)

Provenance: Initial Steps and Beyond

The description of provenance is critical for supporting the reproducibility of findings. Consider how practical steps could be taken to improve provenance information as initial recommendations. Develop CDIF profile draft. Schema.org/DCAT/DC mapped to PROV?Consider how recommendations could be aimed at more sophisticated implementers. A focus of this activity will be alignment with the work in the Context group

Provenance: Context

In order to understand the context of data across domain boundaries, there are a number of contextual factors which require description. A shared understanding of how observed events, collection or generation of data, samples, processing, and archival and dissemination practices is needed to fully describe data for the purposes of reuse. This group will build on work done at the 2023 Dagstuhl workshop to draft a common model for these aspects of provenance and the context of data. (See discussion draft of provenance framework: https://docs.google.com/document/d/1WLkXrcVmd_yNTWcs_OZYxAMqBssWoM6vEtb4MOuP8zA/edit?usp=drive_link )

Agenda, Monday October 14, 2024

 

 

09:00–10:30

Welcome and Background

Participant Introductions

 

 

10:30–10:45

Coffee Break

 

 

10:45–12:15

Overview of WorldFAIR/WorldFAIR+ (Simon)

Overview of Current CDIF Draft (Arofan)

Overview of Topics and Suggested Deliverables (Arofan)

Discovery (Steve R)

Access (Darren)

 

 

12:15–13:45

Lunch

 

 

13:45–15:15

Data Description (Flavio/Luis)

Semantic Mapping (Yann)

Context/Provenance

  • Provenance: Initial Approaches/Schema.org-to-PROV (Pier-Luigi)

  • Observations, Samples, Events (Arofan + Iseult)

 

 

15:15–15:30

Coffee Break

 

 

15:30–17:00

Summary of DDI-CDI Workshop Outputs (Arofan)

UNEP Global Environmental Data Strategy (GEDS) (Sally Radwan - Remote Presentation)

Reminder: Topics & Deliverables

Selection of Groups (May move to Tuesday AM)

 

 

Presentations Day 1, 2

 

Rest of Week – Regular Schedule

07:30–08:45

Breakfast

09:00–09:15

Morning Plenary

09:15–10:30

Breakout Groups

10:30–10:45

Coffee Break

10:45–12:15

Breakout Groups

12:15–13:45

Lunch and Walk

13:45–15:15

Plenary/Breakout Groups

15:15–15:30

Coffee Break

15:30–16:15

Breakout Groups

16:15–17:00

Closing Plenary/Discussion

18:00–19:00

Dinner

19:00–20:00

Possible Evening Session(Informal Discussion with Drinks at Own Expense)

Search workshop pages


Workshop Summary

 


Date and Location

The workshop takes place at Schloss Dagstuhl – Leibniz Center for Informatics on October 13 to October 18, 2024. See also the corresponding Dagstuhl web page and its information on COVID-19.

See the separate page for practical information.


Workshop Schedule

See the separate page for practical information.


Organizers and Participants

Organizers

Participants

  • Darren Bell, UK Data Archive

  • Tathagata Bhattacharjee, LSHTM (The London School of Hygiene & Tropical Medicine)

  • Ian Bruno, CCDC (Cambridge Crystallographic Data Centre)

  • Simon Cox, Independent expert

  • Mark Dietrich, EGI

  • Doug Fils, Independent expert

  • Heike Görzig, HZB (Helmholtz-Zentrum Berlin)

  • Alexandra Kokkinaki, NOC (UK National Oceanography Centre)

  • Polina Koroleva, UNEP (UN Environment Programme)

  • Yann Le Franc, eScience Factory

  • Kerstin Lehnert, Columbia University

  • Iseult Lynch, University of Birmingham

  • Lauren Maxwell, University of Heidelberg

  • Luis Gonzalez Morales, UN Statistics Division

  • Michael Ochola, APHRC (African Population and Health Research Center)

  • Steve Richard, Independent expert

  • Flavio Rizzolo, Statistics Canada

 

Google folder for plenary