Simple Codebook Meeting March 2, 2015Present: Michelle Edwards, Dan Gillman, Oliver Hopt, Larry Hoyle, Steve McEachern, Mary Vardigan This The group welcomed Michelle Edwards of CISER. The chair noted that this group is in a sense waiting for other groups (Discovery, Data Description, Instrument) to complete what they are doing so that we can finish our work. We recognize a need to try to incorporate the idea that we should be able to combine to incorporate both Codebook and Lifecycle into one spec (DDI 4), so we have been exploring that in our group a bit. DDI Lite was reviewed and compared with the element sets that ICPSR, GESIS, and IHSN use and they are a fairly good match. We won't be able to exactly duplicate Codebook and Lifecycle as views of DDI 4 but we can get close. Organizations that have invested in 3.2 do not want to lose that investment. Can we map 3.2 to 4 by automatically importing what's in 3.2? We may need a conversation with Guillaume about this. This should probably be at the Advisory Group level. DDI Codebook and Lifecycle have different names for the same element. We will need mappings for people. What we write out is also important. Interoperability can be defined in terms of reading and writing out of a system. If we can read 2.5 into 4, we are able to ingest anything that occurs anywhere under 2.5. We want to be able to write an instance that contains all the semantic content of Codebook. If we know that there is an equivalence we should have a 2.5 writer to write it out in that name. It is the structure and the mappings that matter. There were changes between Codebook and Lifecycle that were not necessarily clean because of the use of things by reference in 3 (categories and codes). Upward compatibility may be tougher than downward compatibility. We should probably not worry about 3 here but concern ourselves with mapping 2.5 to into 4. Is Codebook still an aggregation of Discovery, Description, and Instrument? Right now Discovery is a stripped down element set. We could start with 2.5 as a starting point and we need to be able to account for this. Then we could look at 4 and ask whether the following things are everything is covered. Can we restrict this to 2.5 Lite? Generally, yes. A Codebook view would be intended for an audience that is creating or managing codebooks and it doesn't matter what things are in other views or packages. Views can overlap as much as you want. DDI Lite is a view. DDI 2.5 is a view. We are leveraging the experience of all these repositories repositories (ICPSR, GESIS, IHSN) in serving up data, so that makes a good codebook. Use It makes sense to rely on DDI Lite, which we know is used. The group reviewed the elements in DDI Lite. ADA uses a few other elements like deposit date, alternative title, collection situation, etc.. ADA uses the default Nesstar template which is close to DDI Lite. We should look at Nesstar also. The CESSDA Profile would be the best thing to use. We need to identify where things are already defined in 4 and where things still need to be defined in 4. We need to know what is missing from 4 in order to have a sense of where we stand. Our group could then go to the AG to say what needs to be addressed in sprints. If we have something in 4 that maps to Nesstar/CESSDA profile, that allows a big chunk of DDI users to adopt 4. There is another migration path we can look at: we have 2.5 codebook - is there a more modern one? Migrate 2.5 to something different? This may be out of scope for our group but we should discuss it. |