Full multicondition training for robust i-vector based speaker recognition

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Ribas, Dayana; Vincent, Emmanuel; Calvo, José Ramon
المصدر:
Interspeech 2015 ; https://inria.hal.science/hal-01158774 ; Interspeech 2015, Sep 2015, Dresden, Germany
الموضوع:
robustness; multicondition training; UBM; speech enhancement; Index Terms: speaker recognition; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
نوع التسجيلة:
conference object
اللغة:
English

معلومة اضافية
- Contributors:
  Centro de Aplicaciones de Tecnologías de Avanzada La Havane (CENATAV); Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH); Inria Nancy - Grand Est; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD); Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA); Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA); Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS); Grid'5000
- بيانات النشر:
  HAL CCSD
- الموضوع:
  2015
- Collection:
  Université de Lorraine: HAL
- الموضوع:
  Dresden; Germany
- نبذة مختصرة :
  International audience ; Multicondition training (MCT) is an established technique to handle noisy and reverberant conditions. Previous works in the field of i-vector based speaker recognition have applied MCT to linear discriminant analysis (LDA) and probabilistic LDA (PLDA), but not to the universal background model (UBM) and the total variability (T) matrix, arguing that this would be too much time consuming due to the increase of the size of the training set by the number of noise and reverberation conditions. In this paper, we propose a full MCT approach which consists of applying MCT in all stages of training, including the UBM and the T matrix, while keeping the size of the training set fixed. Experiments in highly nonstationary noise conditions show a decrease of the equal error rate (EER) to 14.16% compared to 17.90% for clean training and 18.08% for MCT of LDA and PLDA only. We also evaluate the impact of state-of-the-art multichannel speech enhancement and show further reduction of the EER down to 10.47%.
- Relation:
  hal-01158774; https://inria.hal.science/hal-01158774; https://inria.hal.science/hal-01158774/document; https://inria.hal.science/hal-01158774/file/multicondition_2015.pdf
- Rights:
  info:eu-repo/semantics/OpenAccess
- الرقم المعرف:
  edsbas.3EDB78A5

تعليقات

No Comments.

Full multicondition training for robust i-vector based speaker recognition

اتصل بنا

اتبع