<subcorpus> | sample with a syntactic annotation |
<s> | sentence with a syntactic annotation |
<graph> | graphic representation of the syntactic annotation |
<terminals> | list of terminal nodes, end nodes <t>. |
<nonterminals> | list of non-terminal nodes <nt>. |
<edge> | syntactic function |
<secedge> | syntactic function |
<nt> | non-terminal node |
<t> | terminal node |
root | ID of the mother node of sentence <s>. |
id | unique node identification, with <sample number>.<sentence rank number>.<node number>, where <node number> relates to terminal as well as to non-terminal nodes |
word | word form as it occurs in the orthographic transcription (cf. data in the .ort files) |
pos | part-of-speech tag of the terminal node. This POS tag is a simplified/derived version of the POS tag in morph (see below). See corpus.header (XML) on the annotation DVD or negra.header (text) also on the annotation DVD for an overview of the tagset used. |
morhp | part-of-speech tag corresponding to the POS tag from the attributepos. See corpus.header (XML) on the annotation DVD or negra.header (text) also on the annotation DVD for a mapping of the abbreviated label notation and the full POS tags (cf. data in the .plk files) |
cat | node label, the syntactic category of a non-terminal node. |
label | syntactic function. See corpus.header (XML) on the annotation DVD or negra.header (text) also on the annotation DVD for an explanation of the labels used. |
idref | reference to the id of the daughter node |
All characters used from the ISO-8859.1 character set that fall outside the 7-bit range have been translated according to the Character entity references for ISO 8859-1 characters. The subset of special characters used can be found in stext.dtd on the annotation DVD. In entities.htm an overview is presented of the various standards for this character (sub)set.