Below an overview is presented of the symbol set used in the Spoken
Dutch Corpus:
Class | Example | CGN symbol |
---|---|---|
Plosives | put | p |
bad | b | |
tak | t | |
dak | d | |
kat | k | |
goal | g | |
Fricatives | fiets | f |
vat | v | |
sap | s | |
zat | z | |
sjaal | S | |
ravage | Z | |
licht | x | |
regen | G | |
geheel | h | |
Sonorants | lang | N |
mat | m | |
nat | n | |
oranje | J | |
lat | l | |
rat | r | |
wat | w | |
jas | j | |
Short vowels | lip | I |
leg | E | |
lat | A | |
bom | O | |
put | Y | |
Long vowels | liep | i |
buur | y | |
leeg | e | |
deuk | 2 | |
laat | a | |
boom | o | |
boek | u | |
Schwa | gelijk | @ |
Diphthongs | wijs | E+ |
huis | Y+ | |
koud | A+ | |
Marginal vowels | scène | E: |
freule | Y: | |
zone | O: | |
Nasal vowels | vaccin | E~ |
croissant | A~ | |
congé | O~ | |
parfum | Y~ |
Apart from the above symbols there are four further symbols/codes that
are used in the phonetic transcriptions:
_ | (underscore) used with shared sounds | /om_mer/ |
- | (hyphen) used for inserted sounds | /du-w-an/ |
[] | (angled brackets) for an inaudible word or part of a word. | /hE+ lest [] blAtsE+d@/ |
/hE+ lest tin blAt[] vor/ | ||
# | for non-speech speaker sounds, analogous to the code ggg to the orthographic transcription. | /hE+ lest [] blAts#/ |
For a detailed account of the .fon format we refer to the Protocol voor Brede Fonetische Transcriptie (Gillis 2001); here available in .ps- en .pdf format (Dutch only).