Improving Accented Speech Recognition with Multi-Domain Training

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Maison, Lucas; Estève, Yannick
المصدر:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; https://hal.science/hal-04163554 ; ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes Island, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10096268⟩
الموضوع:
automatic speech recognition; multidomain training; accented speech; self-supervised learning; domain shift; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
نوع التسجيلة:
conference object
اللغة:
English

معلومة اضافية
- Contributors:
  Laboratoire Informatique d'Avignon (LIA); Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI; Thales SIX GTS France; Thales SIX; IEEE
- بيانات النشر:
  HAL CCSD
  IEEE
- الموضوع:
  2023
- Collection:
  Université d'Avignon et des Pays de Vaucluse: HAL
- الموضوع:
  Rhodes Island; Greece
- نبذة مختصرة :
  5 pages, 2 figures. Accepted to ICASSP 2023 ; International audience ; Thanks to the rise of self-supervised learning, automatic speech recognition (ASR) systems now achieve near-human performance on a wide variety of datasets. However, they still lack generalization capability and are not robust to domain shifts like accent variations. In this work, we use speech audio representing four different French accents to create fine-tuning datasets that improve the robustness of pre-trained ASR models. By incorporating various accents in the training set, we obtain both in-domain and out-of-domain improvements. Our numerical experiments show that we can reduce error rates by up to 25% (relative) on African and Belgian accents compared to single-domain training while keeping a good performance on standard French.
- ISBN:
  978-1-72816-327-7
  1-72816-327-7
- Relation:
  info:eu-repo/semantics/altIdentifier/arxiv/2303.07924; hal-04163554; https://hal.science/hal-04163554; https://hal.science/hal-04163554/document; https://hal.science/hal-04163554/file/2023056083.pdf; ARXIV: 2303.07924
- الرقم المعرف:
  10.1109/ICASSP49357.2023.10096268
- Rights:
  http://creativecommons.org/licenses/by-nc-nd/ ; info:eu-repo/semantics/OpenAccess
- الرقم المعرف:
  edsbas.EA12F2CE

تعليقات

No Comments.

Improving Accented Speech Recognition with Multi-Domain Training

اتصل بنا

اتبع