Standard set of characters upon which many character encodings are based (Wikipedia). This vocabulary lists some popular character sets but is not exhaustive. If the character set is not mentioned in the list, use 'Other' for this element and enter the specific name into the OtherValue element. You can search for the name of the character set in the list published by IANA (Internet Assigned Character Numbers). Use the "preferred MIME-name" if the IANA list specifies one, otherwise use the format in the Name field - http://www.iana.org/assignments/character-sets. DDI 3.1 The modules physicaldataproduct_ncube_normal, physicaldataproduct_ncube_tabular, and physicaldataproduct_ncube_proprietary are derived from the module physicaldataproduct. Therefore the definition of CharacterSet is the same in all modules. If using "Other", specify the value in the OtherValue attribute of the appropriate element. Module Name Element Name physicaldataproduct CharacterSet physicaldataproduct_ncube_normal CharacterSet physicaldataproduct_ncube_tabular CharacterSet physicaldataproduct_proprietary CharacterSet DDI 2.1 The element fileType with its attribute "charset" is used in the element fileTxt (as 3.1.5). Element Number Element/Attribute Name 3.1.5 fileType@charset Creative Commons Attribution-ShareAlike 3 http://creativecommons.org/licenses/by-sa/3.0/ http://i.creativecommons.org/l/by-sa/3.0/80x15.png Copyright © DDI Alliance http://www.ddialliance.org/ 2011 CharacterSet Character Set 1.0 urn:ddi-cv:CharacterSet urn:ddi-cv:CharacterSet:1.0 http://www.ddialliance.org/Specification/DDI-CV/CharacterSet_1.0_Genericode1.0_DDI-CVProfile1.0.xml http://www.ddialliance.org/Specification/DDI-CV/CharacterSet_1.0.html http://www.ddialliance.org/Specification/DDI-CV/CharacterSet_1.0_InputSheet_Excel2003.xls DDI Alliance The Alliance for the Data Documentation Initiative DDI Code Value of the Code Term Descriptive Term of the Code Definition Definition of the Code CodeKey The unique identification of each item in a code list. ASCII ASCII The official name is US-ASCII but use the format: ASCII. The ISO code for ASCII is ISO 14962. ISO88591 ISO-8859-1 For ISO standards, use format: ISO-n-n in Caption and ISOnn in Code. ISO standards are also known as Latin, for example, ISO-8859-1 as Latin1. ISO88592 ISO-8859-2 ISO88593 ISO-8859-3 ISO88594 ISO-8859-4 ISO88595 ISO-8859-5 ISO88596 ISO-8859-6 ISO88597 ISO-8859-7 ISO88598 ISO-8859-8 ISO88599 ISO-8859-9 ISO885910 ISO-8859-10 ISO885911 ISO-8859-11 ISO885913 ISO-8859-13 ISO885914 ISO-8859-14 ISO885915 ISO-8859-15 ISO885916 ISO-8859-16 MacOSRoman Mac OS Roman UTF8 UTF-8 For Unicode Transformation Formats, use format: UTF-n in Caption and UTFn in Code. UTF16 UTF-16 UTF32 UTF-32 Windows1251 Windows-1251 For MS-Windows character sets, use format: Windows-n in Caption and Windowsn in Code. Windows1252 Windows-1252 Windows1253 Windows-1253 Windows1254 Windows-1254 Windows1255 Windows-1255 Windows1256 Windows-1256 Windows1257 Windows-1257 Windows1258 Windows-1258 Unspecified Unspecified Use if the character set is not known, for example for some proprietary data files. Other Other Use if the character set is known, but not found in the list.