Includes information about the physical instance of a data product (an actual data file). It completes the documentation contained in the Physical Data Product module that is specific to the individual file and serves as a descriptive record of the external data file. Physical Instance provides a citation for the data file, a link to the RecordLayout(s) used by the files records, a description of it coverage (as a constraint if different from the study), check figures for quality control (e.g. digital fingerprint, record count, etc.), and a statistical summary of the data in the file at both the variable and category level.
Includes information about the physical instance of a data product (an actual data file). It completes the documentation contained in the Physical Data Product module that is specific to the individual file and serves as a descriptive record of the external data file. Physical Instance provides a citation for the data file, a link to the RecordLayout(s) used by the files records, a description of it coverage (as a constraint if different from the study), check figures for quality control (e.g. digital fingerprint, record count, etc.), and a statistical summary of the data in the file at both the variable and category level.
A citation for the physical instance of a data set. Note that a DOI or similar unique identifier for the data file should be placed in InternationalIdentifier. It is strongly recommended that use of a Citation in this location includes the use of the optional sub-element Title.
Allows for assigning a hash value (digital fingerprint) to the data or data file. Set the attribute flag to "data" when the hash value provides a digital fingerprint to the data contained in the file regardless of the storage format (ASCII, SAS, binary, etc.). One approach to compute a data fingerprint is the Universal Numerical Fingerprint (UNF). Set the attribute flag to "dataFile" if the digital fingerprint is only for the data file in its current storage format.
Includes information about the topical, spatial, and temporal coverage of the physical instance. May be expressed as a restriction of the parent study coverage.
Provides information about other resources related to the physical instance.
Reference to the DataRelationship containing the LogicalRecord to which the RecordLayout refers. Repeat for cases where LogicalRecords are described in multiple DataRelationship structures. Note that this does not imply that all of the LogicalRecords described in the DataRelationship are contained, wholly or in part in the PhysicalInstance. This reference allows for a direct path between the PhysicalInstance and the related content found in a LogicalProduct.
References the record layout of the data documented in the physical instance.
References the content of the default missing values used in the file. The content of this file overrides default missing value information provided in the LogicalRecord. Allows for the specification that is a Systems Missing Value.
Identifies the data file documented in the physical instance and provides information about its location.
Provides the version information for the data file related to this physical instance. Note that while Physical Instance allows for multiple copies of the same data file (such as backup copies) the assumption is that they are identical in terms of content, layout, format and version.
A reference to a Quality Statement pertaining to the quality of the study methodology, metadata, or data to which it is associated. Quality statements may be related to external quality standards.
Includes information about the file structure, as well as other characteristics that are specific to the physical instance.
Contains information proprietary to the software package which produced the data file. This is expressed as a set of name-value pairs. The value may be taken from a controlled vocabulary.
Includes variable and category statistics data documented in the physical instance, or a reference to a physical instance where the statistics are described or located in line.
Contains a term from a controlled vocabulary indicating the byte ordering.
Provides the version information for the data file related to this physical instance. Note that while Physical Instance allows for multiple copies of the same data file (such as backup copies) the assumption is that they are identical in terms of content, layout, format and version.
Provides the version information for the data file related to this physical instance. Note that while Physical Instance allows for multiple copies of the same data file (such as backup copies) the assumption is that they are identical in terms of content, layout, format and version. The minimum information required is the versionNumber. Additional information on the versionDate, the type of version number when multiple types are supported by an agency, as well as information on VersionResponsibility (inline or by reference) and VersionRationale are available to provide additional information for process tracking and/or informing users of the differences between this and the previous version of the file.
This is the name of the versioning scheme as defined by the user's system, in cases where the user's system employs more than one versioning scheme.
Allows for identification of the person or organization within the maintenance agency responsible for the new version. This is primarily intended for internal use.
Person or organization within the MaintenanceAgency responsible for the version change. If it is important to retain the affiliation between and individual responsible for the version and his/her agency, it may be included in this notation. This is primarily intended for internal use.
Reference person or organization within the MaintenanceAgency responsible for the version change, as described in an OrganizationScheme. If it is important to retain the affiliation between and individual responsible for the version and his/her agency, a Relation should be created between the individual referenced here and his/her organization. This is primarily intended for internal use.
Textual description of the rationale/purpose for a version change.
The version number of the data file identified by this physical instance.
Date of version using the union set BaseDateType. Duration should not be used in this field, even though allowed by the ISO format enforced by the parser.
Contains a term indicating the byte ordering. Supports the use of a controlled vocabulary. DDI strongly recommends the use of a controlled vocabulary.
Allows for assigning a hash value (digital fingerprint) to the data or data file.
Allows for assigning a hash value (digital fingerprint) to the data or data file. Set the attribute flag to "data" when the hash value provides a digital fingerprint to the data contained in the file regardless of the storage format (ASCII, SAS, binary, etc.). One approach to compute a data fingerprint is the Universal Numerical Fingerprint (UNF). Set the attribute flag to "dataFile" if the digital fingerprint is only for the data file in its current storage format.
Contains the value of the specified digital fingerprint.
Specifies the type of the fingerprint (what algorithm or scheme).
Contains the version of the algorithm.
Brief identification of the type of data fingerprint used. The data fingerprint may be for the data file (storage format specific) or data (format neutral).
Declares if the DataFingerprint is for a data file or for the data content.
The data fingerprint applies to the data content(format neutral).
The data fingerprint applies to a specific file format of the data.
Includes information about the file structure, as well as other characteristics that are specific to the physical instance. Information includes place of production, processing checks to validate the content, processing status, the software used to create the data file, and check sums for the number of cases and overall record count.
Indicates the place where the physical instance was produced expressed as a simple string.
Documents processing checks and other operations performed on the data file.
Processing status of the data file. Supports the use of an external controlled vocabulary.
Indicates the software that was used to create the data file.
Number of cases or observations in the data file. Caution in using optional checksums is recommended. Conflict between checksums and the items being counted can cause problems with warning flags during processing. If using checksum to capture information for internal processing purposes, the use of automatically generated check sums is strongly urged.
Overall record count in the data file. Caution in using optional checksums is recommended. Conflict between checksums and the items being counted can cause problems with warning flags during processing. If using checksum to capture information for internal processing purposes, the use of automatically generated check sums is strongly urged.
Identifies the data file documented in the physical instance and provides information about its location.
Documents the location of the data file as a description. Supports the multi-language content.
A URN or URL for the data file with a flag to indicate if it is a public copy.
Set to "true" when this file is the master file (in the case that there are multiple, i.e. backup, copies).
A URN or URL for the data file with a flag to indicate if it is a public copy.
A URN or URL for a file with a flag to indicate if it is a public copy.
Set to "true" (default value) if this file is publicly available. This does not imply that there are not restrictions to access. Set to "false" if this is not publicly available, such as a backup copy, an internal processing data file, etc.
Provides a statistical summary of the data in the related file as a set of variable level and category level statistics. May refer to a set of statistics provided in another physical instance (for example if the same data is held in multiple storage formats) or if the summary statistics are held as a separate data set.
Reference to a PhysicalInstance that describes a data file containing the summary and/or category statistics OR contains the statistics in-line as in the case of the same data stored as an ASCII file and as an ORACLE file where the summary and category statistics are listed only in one of the physical instance files.
One or more statistical measures that describe the responses to a particular variable. Include both variable and category level statistics.
Inclusion of a VariableStatistics by reference.
References a PhysicalInstance module that describes a data file containing the summary and/or category statistics OR contains the statistics in-line. For example, when the same data are stored as an ASCII file and as an ORACLE file, the summary and category statistics would only be listed in one of the physical instance files, and referenced in the other(s).
References a physical instance containing the statistics inline or that identifies the data file which contains the statistics.
Set to "true" if the summary/category statistics are found inline in the referenced physical instance. Set to "false" if they are in the data file associated with the physical instance.
Contains summary and category level statistics for the referenced variable. Includes information on the total number of responses, the weights in calculating the statistics, variable level summary statistics, and category statistics. The category statistics may be provided as unfiltered values or filtered through a single variable. For example the category statistics for Sex filtered by the variable Country for a multi-national data file. Note that if no weighting factor is identified, all of the statistics provided are unweighted.
Reference to the variable to which the statistics apply.
The total number of responses to this variable. This element is especially useful if the number of responses does not match added case counts. It may also be used to sum the frequencies for variable categories.
Reference to the StandardWeight value provided in Weighting.
Reference to a variable to use for weight in calculating the statistic.
Indicates the missing values that were excluded from the statistic by referencing the ManagedMissingValuesRepresentation used by the Variable.
A summary statistic for the referenced variable.
The unfiltered values of any number of statistics by category value representing the full response distribution of the variable.
Includes category-level statistic for the referenced variable using another variable to filter the categories through. For example, the Eurobarometer may filter its category statistics by country as represented in a variable "CountryCode".
The unfiltered values of any number of statistics by category value representing the full response distribution of the variable.
A structure that is repeated for each category value for which one or more statistics are recorded. Each VariableCategory has one category value and any number of associated statistics.
A category value for which one or more statistics are recorded. Each VariableCategory has one category value and any number of associated statistics.
The value of the category.
The value of a statistic associated with the category value indicated in the sibling CategoryValue element.
References a specific Code within the variable if using a CodeRepresentation. May alternately provide the Value of the Category.
Provides the specific value of the variable being used as a filter. References a specific Code within the variable if using a CodeRepresentation. May alternately provide the Value of the Category.
A category value for which one or more statistics are recorded. Each VariableCategory has one category value and any number of associated statistics.
A reference to the coded value of the category as used by a CodeRepresentation.
The value of the category.
The value of a statistic associated with the category value.
Type of category statistic. Supports the use of an external controlled vocabulary. DDI strongly recommends the use of a controlled vocabulary.
The value of the statistics and whether it is weighted and/or includes missing values.
A classification of the type of category statistic provided. Supports the use of an external controlled vocabulary. DDI strongly recommends the use of a widely shared controlled vocabulary to support interoperability.
Category statistics filtered by the value of a second variable. Essentially a cross tabulation of one variable by another. For example variable may be crossed with country as is done in the Eurobarometer when reporting category statistics. For example, the Eurobarometer may filter its category statistics by country as represented in a variable "CountryCode".
Reference to the variable used to filter the category level statistics.
Provides filtered category statistics for the specified filter variable category.
Category statistics for the variable when the filter variable contains the specified value.
Provides the specific value of the variable being used as a filter. References a specific Code within the variable if using a CodeRepresentation. May alternately provide the Value of the Category.
Category statistics for the specified value of the variable (when the filter variable contains the specified value). Repeat for each value of the variable.
Describes a summary statistic for a variable.
Type of summary statistic, such as count, mean, mode, median, etc. Supports the use of an external controlled vocabulary. DDI strongly recommends the use of a controlled vocabulary.
The value of the statistics and whether it is weighted and/or includes missing values.
The value of the statistics and whether it is weighted and/or includes missing values.
Set to "true" if the statistic is weighted using the weight designated in VariableStatistics.
Defines the cases included in determining the statistic. The options are total=all cases, valid and missing (invalid); validOnly=Only valid values, missing (invalid) are not included in the calculation; missingOnly=Only missing (invalid) cases included in the calculation.
A classification of the type of summary statistic provided. Supports the use of an external controlled vocabulary. DDI strongly recommends the use of a widely shared controlled vocabulary to support interoperability.
The value of the statistics and whether it is weighted and/or includes missing values.
Provides a statistical summary of the data in the related file as a set of variable level and category level statistics.
Describes a summary statistic for a variable.
The unfiltered values of any number of statistics by category value representing the full response distribution of the variable.
Category statistics filtered by the value of a second variable.
The value of a statistic associated with the category value indicated in the sibling CategoryValue element.
Reference to an existing StandardWeight using the Reference structure. TypeOfObject should be set to StandardWeight.
Reference to an existing MissingValuesRepresentation using the Reference structure. TypeOfObject should be set to ManagedMissingValuesRepresentation.
Identifies the data file documented in the physical instance and provides information about its location.
Includes information about the file structure, as well as other characteristics that are specific to the physical instance.
Describes a specific software package.
Contains summary and category level statistics for the referenced variable.
Reference to an existing Variable using the Reference structure. TypeOfObject should be set to Variable.
Processing Check provided using a StructuredString to support multiple language versions of the same content as well as optional formatting of the content.
The DigitalFingerprintValue expressed as an xs:string.
Category statistics for the variable when the filter variable contains the specified value.
A category value for which one or more statistics are recorded.
References a PhysicalInstance module that describes a data file containing the summary and/or category statistics OR contains the statistics in-line.
Identifies the default missing value parameter for the this physical instance by referencing a ManagedMissingValuesRepresentation plus flag to identify it as a system missing value. TypeOfObject should be set to ManagedMissingValuesRepresentation.
Inclusion of a VariableStatistics by reference TypeOfObject should be set to VariableStatistics.
Identifies the default missing value parameter for the this physical instance by referencing a ManagedMissingValuesRepresentation. Note that this MissingValues declaration overrides the value found in the LogicalRecord if it conflicts. The assumption is that this is a System Missing Value declaration, specific to the storage format of the file. If not, change the value of isSystemMissingValue to "false". TypeOfObject should be set to ManagedMissingValuesRepresentation.
The assumption is that this is a System Missing Value declaration, specific to the storage format of the file (default value of "true"). If not, change the value to "false".
Defines the cases included in determining the statistic.
All cases, both valid and missing (invalid).
Only valid values, missing (invalid) are not included in the calculation.
Only missing (invalid) cases included in the calculation.