Agenda
Monday, October 17, 2016
Morning
Session 1 (plenary): Introductions, goals of the workshop, and overall introduction to DDI
Introduction of participants
Introduction to DDI
DDI-Codebook
DDI-Lifecycle
DDI Moving Forward Project
What do we work with and the challenges that we’ve encountered, especially regarding interoperability with other specifications
Session 2 (plenary): Presentations from external experts on specific specifications or usage of specifications.
External presentations:
Klas Blomqvist – GSIM
Flavio Rizzolo – LIM
Arofan Gregory – SDMX
Eric Prud’hommeaux – FHIR
Barrie Nelson - CDISC
Afternoon
Session 3 (plenary): Presentations from external experts on specific specifications or usage of specifications.
External presentations:
Nick Car – PROV
Gregg Kellogg – CSVW
Gary Berg-Cross – RDA & Ontologies
Daniella Meeker – standards in biomedical research
Darren Bell – UKDA’s use of DDI in Big Data
Session 4 (plenary): Discussion of interoperability
Interoperability
Presentation by Wendy Thomas
Begin discussion towards an agreed definition of interoperability
Evening:
Possible evening session - Topic to be determined
Tuesday, October 18, 2016
Morning
Session 1 (plenary): Discussion of interoperability
Ensure we have an agreed upon definition of Interoperability
Begin discussion of business scenarios
Short presentations of requested business scenarios (Jay, Eric, Jon, Barry)
Collection of business scenarios
Session 2 (working groups):
Session 2A: Business scenario: Situations where we would use different standards together
Outcome: Document outlining these business scenarios
Description of the Business Scenario in a consistent way
Agree on a set of functionalities across the specifications
List of requirements
Session 2B: Common Aspects of all specifications present during Data Description
Outcome: 2 tables:
High level table demonstrating what specs are used for specific aspects of … see Arofan’s table
Detailed table mapping the important properties and relationships within classes, terminology differences, identify significant differences
Afternoon
Session 3 (working groups):
Session 3A: Provenance of data, metadata
Common understanding of what provenance is
What are the requirements of provenance
Overview of real life provenance applications
Outcome: A common definition / understanding of provenance for the purpose of this workshop
Discussion of provenance
Real life examples
Shared and agreed definition of provenance
Session 3B: Design patterns
Review of practical use in DDI 4
Generic principles, what is the right balance between clean structure on the model level and understandable use on the binding level
Pros/cons on model level, binding level
Outcome: document outlining:
Evaluation of current design patterns in DDI4
What can we learn from the other specifications
Reason for pattern use and style of pattern
Session 4 (working groups):
Session 3A continued: Provenance of data, metadata
Session 3B continued: Design patterns
Evening:
Possible evening session - Topic to be determined
Wednesday, October 19, 2016
Morning
Session 1 (working groups):
Session 1A: Bindings of DDI, CDISC, HL7/FHIR, SDMX, GSIM, CSV on the Web
Approaches of modeling and/or generation
Usage of other vocabularies
The process you use to create your specifications
Outcome: document outlining why other specs have chosen their particular bindings, the process used to create them, their rationale, and approaches to new releases and versioning; how to deal with issues of identification and round-tripping of data and metadata
Session 1B: Operations (Protocols) with specifications, i.e. REST with DDI, CDISC, HL7/FHIR, SDMX, GSIM, CSV on the Web (Part 1)
Goal is to find a generic way of operations to support combined usage of specifications
Requirements from a user’s perspective
Outcome: document contrasting the approaches used to interchange protocols (REST, SPARQL, SOAP, etc..) by the different specifications; Proc and cons of having standard interfaces as part of the specification and the rationale
Session 2 (working groups):
Session 1A continued: Bindings of DDI, CDISC, HL7/FHIR, SDMX, GSIM, CSV on the Web
Session 1B continued: Operations (Protocols) with specifications, i.e. REST with DDI, CDISC, HL7/FHIR, SDMX, GSIM, CSV on the Web
Session 2 (plenary):
Discussion – to determine if there is a consensus view of which bindings/protocols should be used; what does this picture look like? (Current and future perspective - possible developments)
Afternoon
Session 3 (working groups):
Session 3A: Provenance of data/metadata (Data production)
Relationship of stages of the data life cycle
Relationship of tasks in GSBPM/GLBPM, other process models
Relationship of aggregate data to underlying microdata – description of processing
How to record provenance; Can PROV-O describe other process models such as GSBPM/GLBPM? How can the ideas of PROV-O be used in non-RDF bindings?
Outcome:
Quick presentation to the larger group on discussion and findings
Document from presentation to be used in joint doc
Session 3B: Provenance of data/metadata – RDF (Data publication)
Usage of W3C PROV-O in DDI 4 RDF and other specs with RDF bindings
How do we capture sufficient information around provenance and processing?
Goal: best practice for using PROV-O with other Specs
Outcome:
Quick presentation to the larger group on discussion and findings
Document from presentation to be used in joint doc
Session 4 (plenary):
Plenary session for groups to report back to exchange findings and open discussion
Outcome: Extension of provenance definition document to include:
Data production discussion
Data publication discussion
Evening:
Possible evening session - Topic to be determined
Thursday, October 20, 2016
Morning
Session 1 (working groups):
Session 1A: RDF bindings for statistical data and metadata
Discuss requirements for RDF applications that deal specifically with statistical data
What are the best RDF vocabularies for data and metadata description?
Specific best practice for using multiple vocabularies for this purpose
Outcome: document describing the requirements and best practices for use of RDF vocabularies in statistical applications
Session 1B: Data description, general mapping table of multiple specifications with equal and similar items
Commonalities across DDI, CDISC, HL7/FHIR, SDMX, GSIM, CSVW - data description aspect
More detailed discussion as follow-up to Tuesday’s discussion
Opportunity for folks not involved on Tuesday to participate
Outcome: document outlining the relationship of different standards for data description
Session 2 (working groups):
Session 1A continued: RDF bindings for statistical data and metadata
Session 1B continued: Data description, general mapping table of multiple specifications with equal and similar items
Afternoon
3rd Session (plenary):
Session 3A: DD-R package
discuss and document program for moving forward on a Data Description package in R
Session 3B: RESTful
Discuss and document information for moving forward on a collaborative RESTful
4th Session (working groups):
Session 4A: RDF Bindings and Production framework with Elisa Kendall - via remote
Session 3A continued: DD-R Package
Session 3B continued: RESTful
Friday, October 21, 2016
Morning
1st Session (working groups):
Session 1A: Start the conversation for future collaboration – create a small working group to work out the details
Multiple specs use case – work in the future?
Think about active proposals for future collaboration
Session 1B: Statement of design principles to be agreed by specs represented (Data Manifesto)
How do we make available?
How/where do we publish?
Outcome:
The outline and draft of agreed design principles (“Data Manifesto”)
Documentation of work
2nd Session:
Afternoon
3rd Session (plenary):
Communication and sustainability