The DDI (Data Documentation Initiative) metadata standard, originally created in 1995 to document social science research data, has in recent years become relevant to new user groups, including the official statistics and medical research communities. In order to respond to these new users, DDI is developing a model-based specification (DDI Version 4) that can be expressed in XML Schema, RDF-S/OWL, relational database schema, and program languages. Such a data model will make it easier to interact with other disciplines and other standards, to understand the specification, to develop and maintain it in a consistent and structured way, and to enable software development that is less dependent on specific DDI versions.

Throughout the past year, content modeling teams have been working virtually to model DDI 4 to ensure that it can document a broad spectrum of data. This year’s Dagstuhl “sprint” will focus creating re-usable multi-purpose documentation, controlled vocabularies, complex data capture and description, and funding proposals.

Goals

This workshop will extend and build upon the progress made during the past 3 years of development. The sprint will focus on four main areas of work:

Re-usable Structured Documentation: As DDI releases a first version of the Codebook Functional View to the community, documentation and documentation structure becomes essential for its successful implementation. The work during this week, will concentrate on developing both high-level and field-level documentation to assist archives, libraries, and statistical agencies, migrating from an older version of DDI to DDI 4.

Data Capture: The integration of data capture into the full DDI Views model; real world examples which will be documented.

Data Description: Validation and quality assessment of the data description model using a range of real world examples.

Controlled Vocabularies: A closer working relationship with the Controlled Vocabularies Group since Controlled Vocabularies are used throughout the model.

Funding Opportunities: Explore opportunities and create re-usable documentation for potential funding proposals, scope is local, national and international.

Strategic Infrastructure: A long term plan for how the DDI Alliance fits into the larger social science research community.

Outcomes

  1. Develop a documentation structure which can be re-used for multiple purposes. Initial base content will be written that will support the needs of different user groups (i.e. technical perspective), applied usage in substantive content areas, etc. The modular form of the documentation will also provide the basis for training materials to be used by all DDI user communities.
  2. Profile of the DDI requirements for a tool to support the creation and maintenance of Controlled Vocabularies. A defined process for more closely integration of the Controlled Vocabularies into the prodution activities of DDI
  3. The existing model on data capture and data description will be expanded for complex scenarios.Building blocks for funding proposals will be written.
  4. Building blocks for funding proposals will be written.

See also the Dagstuhl webpage for this event.

In preparation, please review the following documents:

Read TheseLink

Goals of the workshop

Page
Introduction to the DDI4 Logical Data Description Package

Document v.3

Document v.4

The Dagstuhl Sprint takes place 24-28 October 2016 at Schloss Dagstuhl.

See the separate page for practical information.

NameAffiliationTopic
George Alter

University of Michigan
Population Studies Center


Ingo BarkowHTW Chur, University of Applied Sciences
Bill BlockCISER - Cornell Institute for Social and Economic Research
Kerrin BorschewskiGESIS - Leibniz Institute for the Social Sciences
Kelly Chatain

University of Michigan
Survey Research Center


Michelle EdwardsCISER - Cornell Institute for Social and Economic Research
Anne EtheridgeUKDS - UK Data Service
Dan GillmanBLS - U.S. Bureau of Labor Statistics
Arofan GregoryAeon Technology
Oliver HoptGESIS - Leibniz Institute for the Social Sciences
Larry Hoyle

University of Kansas
Institute for Policy & Social Research


Sanda Ionescu

University of Michigan
ICPSR - Interuniversity Consortium for Political and Social Research


Taina JääskeläinenFSD - Finnish Social Science Data Archive
Jon Johnson

UCL Institute of Education
Centre for Longitudinal Studies


Amber Leahey

University of Toronto
Libraries


Jared Lyle

University of Michigan
ICPSR - Interuniversity Consortium for Political and Social Research


Katy McNeillUKDS - UK Data Service
Katja MoilanenFSD - Finnish Social Science Data Archive
Barry Radler

University of Wisconsin-Madison
Institute on Aging


Dan SmithColectica
Wendy Thomas

University of Minnesota
Minnesota Population Center


Joachim WackerowGESIS - Leibniz Institute for the Social Sciences
Knut Wenzig

DIW Berlin - German Institute for Economic Research
SOEP - German Socio-Economic Panel


Wolfgang Zenk-MöltgenGESIS - Leibniz Institute for the Social Sciences