ELRA's SLR Validation Centre at SPEX


Validation of spoken language resources

Speech

After an open call,  the Speech Processing EXpertise centre (SPEX) in Nijmegen, the Netherlands, was selected in 1999 as ELRA's first VC for spoken language resources (SLR). SLR validation, as carried out by SPEX, concerns the quality evaluation of a database against a checklist of relevant criteria. These criteria are typically the specifications of the databases, together with some tolerance margins for deviations. General information on the topic of SLR validation can be found the article The Art of Validation in the ELRA Newsletter, Vol.5.4.

In order to avoid the duplication of efforts, ELRA uses the outcome of the SpeechDat project and its successors as the baseline for its own validation efforts. One of the objectives of the SpeechDat projects is to validate the speech databases that are collected. Specific information about the validation criteria for the various projects can be found in the Validation Standards section at ELRA's webpage (under Services around LR's).

The databases collected in varous SLR producing projects are distributed via ELRA. For each of these and later validated SLR, ELRA makes available in its catalogue the main documentation file, and the validation report as written by SPEX.

On the basis of the experiences gained in various projects, SPEX has written a validation manual for SLR. This document describes validation principles that should be taken into account by producers of new SLR, specifically those who aim to distribute SLR through ELRA.

 

In the year 2000, SPEX carried out the following tasks:

 

In the year 2001, SPEX carried out the following tasks:

  • Updating ELRA's web pages regarding SLR validation
  • Implementation for a SLR bug report procedure through ELRA's URL
  • Proposal for carrying out a quality description of selected SLR

 

In the year 2002, SPEX carried out the following tasks:

  • Monitoring the SLR bug report service
  • Implementation of bug fix actions
  • Quality description of selected SLR

 

In the year 2003, SPEX carried out the following tasks:

  • Monitoring the SLR bug report service
  • Bug fix actions; patch creation
  • Quick Quality Checks of selected SLR
  • Description of a Quick Quality Check for pronunciation lexica
  • Update of standards in SLR production and validation
  • Presentation and paper about the activities in ELRA’s VCOM at EUROSPEECH 2003 Geneva

 

In the year 2004, SPEX carried out the following tasks:

  • Monitoring the SLR bug report service
  • Bug fix actions
  • Quick Quality Checks of selected SLR
  • Description of a Quick Quality Check for speech synthesis databases
  • Update of standards in SLR validation

 

In the year 2005, SPEX carried out the following tasks:

  • Monitoring the SLR bug report service & bug fix actions
  • Quick Quality Checks of selected SLR

 

For the year 2006 and 2007, the following tasks are planned:

  • Monitoring the SLR bug report service & bug fix actions
  • Quick Quality Checks of selected SLR
  • Description of a Quick Quality Check for Multimodal Language Resources