Dagstuhl Sprint, October 2017 (Week One)


Purpose

Introduction

The DDI (Data Documentation Initiative) metadata standard, originally created in 1995 to document social science research data, has in recent years become relevant to new user groups, including the official statistics and medical research communities. In order to respond to these new users, DDI is developing a model-based specification (DDI Version 4) that can be expressed in XML Schema, RDF-S/OWL, relational database schema, and program languages. Such a data model will make it easier to interact with other disciplines and other standards, to understand the specification, to develop and maintain it in a consistent and structured way, and to enable software development that is less dependent on specific DDI versions.

Goals

The overall goal of both DDI workshops in Dagstuhl 2017 is to combine the good and mature parts of the work of the last years in order to have three working functional views, a working class library, the resulting bindings and a working production framework. This will be an important step for the publication of a pilot of DDI 4 which is planned for publication in the middle of 2018. The pilot should prove that the new approaches of DDI 4 work and make sense. This includes especially the model-driven approach, the functional views, and the datum-based data description.

The three planned functional views are Data Description, Data Capture, and Codebook. The goal is to have working functional views with core sets of items (in contrast of functional views which cover everything in the specific area). Additional features can be added to the views in future.

Existing work should be used, refined, and integrated. Start of new work should be minimized and only focus on missing parts which could block the finalization of working components.

Issues in this work – especially missing parts – should be described with underlying reasoning and plan for resolution (including priority level)

All working areas can be understood as a building block for the goal described above. Additionally, work will be done on strategic documents including the plan for a DDI-based infrastructure, the future strategic plan, and mission and guiding principles.

Topics Overview
 Topics Overview - Click here to expand...

The workshop will focus on five main areas of work:

Re-usable Structured Documentation

  • Improvement of structure. Definition of required and optional items for multiple documentation types.
  • Modular examples with the perspective of a specific audiences or purposes
    • Software developer, archivist, data producer, training tutorial
    • The examples should be realized in the used technical format (restructured text)
    • Examples in different target formats should be realized, like for single documents, linked documents, slides.
  • This work will continue similar work in the 2016 workshop in Dagstuhl.

Production Framework

Test Cases

  • Technical test cases in binding syntax (XML and RDF) for core items of Functional Views (Data Description and Data Capture). The test cases of W3C specifications are the role model for this. A good example is the suite of test cases for CSV on the Web.

RDF-S/OWL

Work plan and partial resolution of tasks described in document from Cologne meeting (2016-12). Details see document “RDF Work Specification”.

Program Libraries

  • Java binding
    • Conceptual approach: documentation, rules, prototype, review of existing approaches.
    • Rules for round trip of metadata between bindings, especially XML, RDF, and Java
      • Each representation has its own limitations. These limitations shouldn’t have an influence on the metadata round trip. A common set of structural features of these representations could be the solution.
    • JSON-LD. Conceptual approach: generation on basis of RDF-S/OWL or on basis of UML model.
Draft Agenda

Link to Dagstuhl Daily Schedule (meal times, etc.)

Time Period

Group 1

Group 2

Group 3

Monday am

Test Cases

Dan, Darren, Deirdre, Eric, Jay, Larry*, Oliver, Olof, Benjamin

DDI4 Goals and Design Rules

Achim, Arofan, Dan, Guillaume, Johan, Jon, Wendy*

 

Monday pm

Production Framework

Dan, Johan, Jon, Oliver, Olof, Wendy*

OWL/RDF

Achim*, Arofan, Dan, Darren, Deirdre, Eric, Guillaume, Benjamin

 

Tuesday am

Production Framework

Achim, Dan, Johan, Jon, Oliver, Olof, Wendy*

OWL/RDF

Achim*, Arofan, Dan, Darren, Deirdre, Eric, Guillaume, Benjamin

 

Tuesday pm

Structured Documentation

Jay, Johan, Jon*, Larry, Wendy

OWL/RDF

Achim*, Arofan, Dan, Darren, Deirdre, Eric, Guillaume, Oliver, Olof, Benjamin

 

Tuesday EveningCOGS Discussion

Wednesday

Plenary

Packaging & Views : Production Framework

RDF Discussion to date - What to propose


Wednesday am

Test Cases

Achim, Dan, Darren, Deirdre, Eric, Larry*, Oliver, Olof, Benjamin


Data Description Capture Integration Issues

Dan, Guillaume, Jay, Johan, Jon*, Larry, Wendy

 

Wednesday pm

Test Cases

Achim, Dan, Darren, Deirdre, Eric, Johan, Larry*, Oliver, Benjamin

Data Description Capture Integration Issues

Dan, Guillaume, Jay, Johan, Jon*, Larry, Wendy

 RDF

Achim, Darren, Guillaume, Eric, Jay, Deirdre, Benjamin....

Thursday Plenary

EDDI Sprint

Thursday am

Canonical XMI & production process

Achim*, Arofan, Guillaume, Wendy, Benjamin, Dan, Johan, Olof


Breakouts:

Larry, Eric, Benjamin, Oliver

Darren, Dierdre

Thursday pm

Roundtrip btw Bindings

Achim, Dan, Darren, Deirdre, Eric, Jay, Larry*, Oliver, Benjamin

Prototype Description

Arofan, Guillaume, Johan, Jon, Olof, Wendy*

Programming Libraries

Dan, Darren, Deirdre, Oliver*, Olof


Friday PlenaryPrototype Description

Friday am

Prototype Description

Achim, Arofan, Dan, Johan, Jon, Olof, Wendy*

RDF/Production Framework 

Darren, Deirdre, Eric, Guillaume, Jay, Larry, Oliver, Benjamin

Validation Discussion

Who?

Friday pm




  File Modified

ZIP Archive Build-2017-07-10.zip

Jul 14, 2017 by Jon Johnson

ZIP Archive Build-2017-07-03.zip

Jul 14, 2017 by Jon Johnson

ZIP Archive Build-2017-06-28.zip

Jul 14, 2017 by Jon Johnson

Microsoft Word Document lawrence_sprint_report.docx

Jul 14, 2017 by Jon Johnson

ZIP Archive Build-2017-06-02.zip

Jul 14, 2017 by Jon Johnson

ZIP Archive Build-2017-06-01.zip

Jul 14, 2017 by Jon Johnson

ZIP Archive Build-2017-05-30.zip only difference to the last build is the existence of xhtml schema files

Jul 14, 2017 by Jon Johnson

File AES.yml

Jul 14, 2017 by Jon Johnson

File 2017_02_24SimpleCodebookView.yml

Jul 14, 2017 by Jon Johnson

Microsoft Powerpoint Presentation SampleUseCasesForTheSimpleCodebookView.pptx

Jul 14, 2017 by Jon Johnson

Microsoft Word Document DDI Data Description Example of Use_Unit_CSV.DOCX

Jul 14, 2017 by Jon Johnson

Microsoft Excel Spreadsheet CodebookFunctionalView.xlsx

Jul 14, 2017 by Jon Johnson

ZIP Archive Build-2017-05-26.zip Build contains XSD, RDF/OWL and reStructured

Jul 14, 2017 by Jon Johnson

Microsoft Powerpoint Presentation Collections Relativity.pptx

Jul 14, 2017 by Jon Johnson

Microsoft Powerpoint Presentation Collections-116-114.pptx

Jul 14, 2017 by Jon Johnson

Microsoft Powerpoint Presentation collectionrevisions.pptx

Jul 14, 2017 by Jon Johnson

Microsoft Powerpoint Presentation mood and mode v5.pptx

Jul 14, 2017 by Jon Johnson

Microsoft Powerpoint Presentation DDI 4 goal.pptx

Jul 14, 2017 by Jon Johnson

Microsoft Word Document Design Principles.docx

Oct 15, 2017 by Wendy Thomas

Microsoft Word Document Technical Committee Review Coverage.docx

Oct 15, 2017 by Wendy Thomas

Microsoft Word Document DDI Roadmap v1.0.docx

Oct 15, 2017 by Wendy Thomas

Microsoft Word Document Where DDI4 is today.docx

Oct 15, 2017 by Wendy Thomas

PDF File Dagstuhl2017Week1Topics.pdf

Oct 15, 2017 by Wendy Thomas

Microsoft Word Document RDF Work Specification.docx

Oct 16, 2017 by Wendy Thomas

ZIP Archive ddi4_2017-09-28.zip

Oct 16, 2017 by Oliver Hopt

Microsoft Word Document WhyanewversionofDDI.docx

Oct 16, 2017 by Wendy Thomas

ZIP Archive ddi4-2017-10-16.zip

Oct 16, 2017 by Oliver Hopt

ZIP Archive ddi4_2017-10-16_new.zip

Oct 16, 2017 by Oliver Hopt

ZIP Archive ddi4_2017-10-17.zip

Oct 17, 2017 by Oliver Hopt

ZIP Archive xsd-20171018T065828Z-001B.zip

Oct 18, 2017 by Larry Hoyle

Microsoft Powerpoint Presentation forWednesdayNight.pptx

Oct 18, 2017 by Wendy Thomas

Microsoft Word Document RoundTripNotes2017_10_19.docx

Oct 19, 2017 by Larry Hoyle

Microsoft Word Document NotesDagstuhlProgrammingLanguages2017_10_19_4pm (1).docx

Oct 19, 2017 by Larry Hoyle

Microsoft Powerpoint Presentation GoalsWeek1.pptx

Oct 22, 2017 by Joachim Wackerow

Microsoft Word Document Validation in the DDI_jj.docx

Nov 15, 2017 by Kelly A Chatain

PDF File Report from Week One of Dagstuhl Sprint.pdf

Jan 19, 2018 by Kelly A Chatain

Materials for Use in Sprint

Please see the individual topic pages for these materials.

Local Information

Dates: October 16 -  20, 2017

The workshop takes place at Schloss Dagstuhl. It has the Dagstuhl event number 17423 and a related web page.

See the separate page for practical information.


Attendees

First Name

Last Name

Organization

Darren

Bell

UK Data Service

Kelly

Chatain

Survey Research Center, University of Michigan

Guillaume

Duffes

INSEE - French National Institute of Statistics and Economic Studies

Johan

Fihn

Swedish National Data Service (SND)

Jay

Greenfield


Arofan

Gregory


Oliver

Hopt

GESIS - Leibniz Institute for the Social Sciences

Larry

Hoyle

Institute for Policy & Social Research, University of Kansas

Jon

Johnson

UK Data Service

Deirdre

Lungley

UK Data Service

Olof

Olsson

Swedish National Data Service (SND)

Eric

Prud'hommeaux

World Wide Web Consortium (W3C)

Dan

Smith

Colectica

Wendy

Thomas

Minnesota Population Center, University of Minnesota

Joachim

Wackerow

GESIS - Leibniz Institute for the Social Sciences

Benjamin

Zapilko

GESIS - Leibniz Institute for the Social Sciences