Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Simulating articulatory trajectories with phonological feature interpolation

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      Laboratoire de sciences cognitives et psycholinguistique (LSCP); Département d'Etudes Cognitives - ENS Paris (DEC); École normale supérieure - Paris (ENS-PSL); Université Paris Sciences et Lettres (PSL)-Université Paris Sciences et Lettres (PSL)-École normale supérieure - Paris (ENS-PSL); Université Paris Sciences et Lettres (PSL)-Université Paris Sciences et Lettres (PSL)-École des hautes études en sciences sociales (EHESS)-Centre National de la Recherche Scientifique (CNRS); GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP); GIPSA Pôle Parole et Cognition (GIPSA-PPC); Grenoble Images Parole Signal Automatique (GIPSA-lab); Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP); Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP); Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab); Université Grenoble Alpes (UGA); Laboratoire d'Informatique et des Systèmes (LIS) (Marseille, Toulon) (LIS); Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS); Meta AI Research Paris; Meta AI; ISCA
    • بيانات النشر:
      HAL CCSD
      ISCA
    • الموضوع:
      2024
    • Collection:
      Aix-Marseille Université: HAL
    • الموضوع:
    • نبذة مختصرة :
      International audience ; As a first step towards a complete computational model of speech learning involving perception-production loops, we investigate the forward mapping between pseudo-motor commands and articulatory trajectories. Two phonological feature sets, based respectively on generative and articulatory phonology, are used to encode a phonetic target sequence. Different interpolation techniques are compared to generate smooth trajectories in these feature spaces, with a potential optimisation of the target value and timing to capture co-articulation effects. We report the Pearson correlation between a linear projection of the generated trajectories and articulatory data derived from a multi-speaker dataset of electromagnetic articulography (EMA) recordings. A correlation of 0.67 is obtained with an extended feature set based on generative phonology and a linear interpolation technique. We discuss the implications of our results for our understanding of the dynamics of biological motion.
    • Relation:
      info:eu-repo/semantics/altIdentifier/arxiv/2408.04363; ARXIV: 2408.04363
    • الرقم المعرف:
      10.21437/interspeech.2024-2192
    • الدخول الالكتروني :
      https://hal.science/hal-04699949
      https://hal.science/hal-04699949v1/document
      https://hal.science/hal-04699949v1/file/ortiztandazo24_interspeech.pdf
      https://doi.org/10.21437/interspeech.2024-2192
    • Rights:
      http://hal.archives-ouvertes.fr/licences/copyright/ ; info:eu-repo/semantics/OpenAccess
    • الرقم المعرف:
      edsbas.E219FF04