The goodness of pronunciation algorithm applied to disordered speech

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Pellegrini, Thomas; Fontan, Lionel; Mauclair, Julie; Farinas, Jérôme; Robert, Marina
المصدر:
Proceedings of The 15th Annual Conference of the International Speech Communication Association
15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)
15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Sep 2014, Singapour, Singapore. pp.1463-1467, ⟨10.21437/Interspeech.2014-357⟩
الموضوع:
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing; Goodness of Pronunciation; Disordered speech; [INFO.INFO-GR] Computer Science [cs]/Graphics [cs.GR]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; Vision par ordinateur et reconnaissance de formes; Intelligence artificielle; 01 natural sciences; [INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]; [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; Traitement des images; Pronunciation automatic assessment; [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [INFO.INFO-TI] Computer Science [cs]/Image Processing [eess.IV]; [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]; 0103 physical sciences; Traitement du signal et de l'image; Synthèse d'image et réalité virtuelle
نوع التسجيلة:
Article
Conference object
الدخول الالكتروني :
https://hal.science/hal-04080790v1/document
https://hal.science/hal-04080790v1
https://doi.org/10.21437/interspeech.2014-357
http://www.isca-speech.org/archive/interspeech_2014/i14_1463.html
https://dblp.uni-trier.de/db/conf/interspeech/interspeech2014.html#PellegriniFMFR14
https://oatao.univ-toulouse.fr/13139/
https://hal.science/hal-04080790
https://hal.science/hal-04080790/file/Pelligrini_13139.pdf
https://hal.science/hal-04080790/document
https://oatao.univ-toulouse.fr/13139/

معلومة اضافية
- Contributors:
  (OATAO), Open Archive Toulouse Archive Ouverte; Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA); Institut de recherche en informatique de Toulouse (IRIT); Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP); Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI); Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT); Archean Labs; Université Paris Descartes - Paris 5 (UPD5); Université Paris Nanterre (UPN); Centre National de la Recherche Scientifique - CNRS (FRANCE); Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE); Université Toulouse III - Paul Sabatier - UT3 (FRANCE); Université Toulouse - Jean Jaurès - UT2J (FRANCE); Université Toulouse 1 Capitole - UT1 (FRANCE); Université Paris Descartes - Paris V (FRANCE); Université Paris Ouest Nanterre La Défense (FRANCE)
- بيانات النشر:
  ISCA, 2014.
- الموضوع:
  2014
- نبذة مختصرة :
  In this paper, we report on a study with the aim of automatically detecting phoneme-level mispronunciations in 32 French speakers suffering from unilateral facial palsy at four different clinical severity grades. We sought to determine if the Goodness of Pronunciation (GOP) algorithm, which is commonly used in Computer-Assisted Language Learning systems to detect learners' individual errors, could also detect segmental deviances in disordered speech. For this purpose, speech read by the 32 speakers was aligned and GOP scores were computed for each phone realization. The highest scores, which indicate large dissimilarities with standard phone realizations, were obtained for the most severely impaired speakers. The corresponding speech subset was manually transcribed at phone-level. 8.3% of the phones differed from standard pronunciations extracted from our lexicon. The GOP technique allowed to detect 70.2% of mispronunciations with an equal rate of about 30% of false rejections and false acceptances. The phone substitutions detected by the algorithm confirmed that some of the speakers have difficulties to produce bilabial plosives, and showed that other sounds such as sibilants are prone to mispronunciation. Another interesting finding was the fact that speakers diagnosed with a same pathology grade do not necessarily share the same pronunciation issues.
- File Description:
  application/pdf
- الرقم المعرف:
  10.21437/interspeech.2014-357
- الرقم المعرف:
  edsair.doi.dedup.....3df7eb5eeffa5dc3e0d0cd6e03a5453c

تعليقات

No Comments.

The goodness of pronunciation algorithm applied to disordered speech

اتصل بنا

اتبع