The .lex format

Files of type .lex (Lexicon), to be found in /data/lexicon/xml on the annotation DVD, comprise the CGN lexicon in XML text format. For the full documentation on this format, see /../../lexicon/lexicon.htm, and also lex.dtd and mlex.dtd which can be found on the annotation DVD.

All characters in the transcriptions that belong to the ISO-8859.1 character set that fall outside the 7-bit range have been converted according to the Character entity references for ISO 8859-1 characters. The subset of special characters that were used can be found in the DTDs stated above. In entities.htm an overview is presented of the different standards for this character (sub)set.