Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

The goodness of pronunciation algorithm applied to disordered speech

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      (OATAO), Open Archive Toulouse Archive Ouverte; Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA); Institut de recherche en informatique de Toulouse (IRIT); Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP); Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI); Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT); Archean Labs; Université Paris Descartes - Paris 5 (UPD5); Université Paris Nanterre (UPN); Centre National de la Recherche Scientifique - CNRS (FRANCE); Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE); Université Toulouse III - Paul Sabatier - UT3 (FRANCE); Université Toulouse - Jean Jaurès - UT2J (FRANCE); Université Toulouse 1 Capitole - UT1 (FRANCE); Université Paris Descartes - Paris V (FRANCE); Université Paris Ouest Nanterre La Défense (FRANCE)
    • بيانات النشر:
      ISCA, 2014.
    • الموضوع:
      2014
    • نبذة مختصرة :
      In this paper, we report on a study with the aim of automatically detecting phoneme-level mispronunciations in 32 French speakers suffering from unilateral facial palsy at four different clinical severity grades. We sought to determine if the Goodness of Pronunciation (GOP) algorithm, which is commonly used in Computer-Assisted Language Learning systems to detect learners' individual errors, could also detect segmental deviances in disordered speech. For this purpose, speech read by the 32 speakers was aligned and GOP scores were computed for each phone realization. The highest scores, which indicate large dissimilarities with standard phone realizations, were obtained for the most severely impaired speakers. The corresponding speech subset was manually transcribed at phone-level. 8.3% of the phones differed from standard pronunciations extracted from our lexicon. The GOP technique allowed to detect 70.2% of mispronunciations with an equal rate of about 30% of false rejections and false acceptances. The phone substitutions detected by the algorithm confirmed that some of the speakers have difficulties to produce bilabial plosives, and showed that other sounds such as sibilants are prone to mispronunciation. Another interesting finding was the fact that speakers diagnosed with a same pathology grade do not necessarily share the same pronunciation issues.
    • File Description:
      application/pdf
    • الرقم المعرف:
      10.21437/interspeech.2014-357
    • الرقم المعرف:
      edsair.doi.dedup.....3df7eb5eeffa5dc3e0d0cd6e03a5453c