DDI-CDI (DDI Cross-Domain Integration)
DDI-CDI is a new standard which is designed to be used with research data from any domain. While it minimally describes metadata for cataloguing and citation, its fundamental purpose is to describe data and process. The specification is domain-neutral and covers the majority of data structures in common use today: Wide, Long, Multi-Dimensional and Key-Value. It offers, for the first time, a mechanism to interoperate disparate data from multiple disciplines and domains at the lowest level of granularity i.e. the datum itself. While it is designed to complement its siblings in the DDI Alliance Product Suite - DDI-Codebook and DDI-Lifecycle, which operate in the Social, Behavioral and Economic domain - it is also intended to work with a wide variety of other domain-specific and generic metadata specifications. Integration is a first-order consideration in DDI-CDI and so it is designed from the ground up to work well with controlled vocabularies from any domain as well as with other standards.
DDI-CDI is a new kind of specification, aimed at both supplementing existing metadata models, and serving a unique purpose in its own right. Its key features include:
-
Model-driven
-
Domain-independence
-
Datum-oriented data description
-
Provenance-focused: process description down to a datapoint level if required
DDI-CDI goes back to first principles and abstracts the foundational characteristics of different data structures. On this basis, it uses a “model-based” approach using UML classes. For non-modelers, this simply means that DDI-CDI can be used in the format of your choice, whether you prefer XML, JSON, or other implementation syntaxes.
The UML Model
The core of DDI-CDI is a model described using the Unified Modeling Language (UML). It is expressed in Canonical XMI, an exchange format for UML models which has been tested to work with many different UML tools. The subset of UML features conforms to the UML Class Model Interoperable Subset (UCMIS) guidelines, which further constrain the features to guarantee greater interoperability
ddi-cdi_canonical-unique-names.xmi
The model as Canonical XMI file with UML version 2.5 and XMI 2.5.1, and with unique association names
Supports Activities
- The first one supplies a rich set of foundational metadata for variables, classifications, and other concepts and representations.
- The second one describes data in rectangular (wide), long (event), multi-dimensional (cube), and no-SQL (big data) data formats.
- The third one describes process as the primary aspect of data provenance.
Best Practices
The DDI-CDI specification has an extensive overview document, and browsable field-level documentation which also provides information about the XMI description and the syntax representations for XML and RDF encodings.
Examples
-
SPSS data -
-
STATA data -
-
Process -
-
Resources
- DDI-CDI Brochure (PDF)
- DDI Cross-Domain Integration (DDI-CDI):
- Complementing the DDI Product Suite. [slide deck, video]
- DDI-CDI and other Standards
- The current encodings and syntax representations provided for the version 1.0 are:
- Tools used to create, display, and access DDI instances (filtered by product)
- Training Materials
- DDI Glossary
License
DDI CDI is free software: you can redistribute it and/or modify it under the terms of the Creative Commons Attribution 4.0 International license. Other DDI documents are similarly distributed under the same Creative Commons license
Development Work
Development of DDI-CDI is managed by the Cross Domain Integration (CDI) Working Group. The work of the CDI can be found at Cross Domain Integration (CDI) Working Group on the DDI Confluence site.
Development Repository
Issues related to the development of DDI-CDI are managed at DDI-CDI Issues - GitHub. Please use this issue tracker for questions, suggestions, or if you need assistance.