Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Preparing an endangered language for the digital age: the case of Judeo-Spanish

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Publisher Information:
      European Language Resources Association (ELRA) 2022-06-20
    • نبذة مختصرة :
      We develop machine translation and speech synthesis systems to complement the efforts of revitalizing Judeo-Spanish, the exiled language of Sephardic Jews, which survived for centuries, but now faces the threat of extinction in the digital age. Building on resources created by the Sephardic community of Turkey and elsewhere, we create corpora and tools that would help preserve this language for future generations. For machine translation, we first develop a Spanish to Judeo-Spanish rule-based machine translation system, in order to generate large volumes of synthetic parallel data in the relevant language pairs: Turkish, English and Spanish. Then, we train baseline neural machine translation engines using this synthetic data and authentic parallel data created from translations by the Sephardic community. For text-to-speech synthesis, we present a 3.5 hour single speaker speech corpus for building a neural speech synthesis engine. Resources, model weights and online inference engines are shared publicly.
    • الموضوع:
    • Availability:
      Open access content. Open access content
      cc_by_nc_nd_4
    • Note:
      application/pdf
      English
    • Other Numbers:
      IEDUB oai:doras.dcu.ie:28325
      https://doras.dcu.ie/28325/1/f8a30620-cc4a-47b0-baa0-dd510dfa6c74.tmp
      Öktem, Alp orcid logoORCID: 0000-0002-0700-1159 , Zevallos, Rodolfo, Moslem, Yasmin orcid logoORCID: 0000-0003-4595-6877 , Öztürk, Güneş and Şarhon, Karen Gerson (2022) Preparing an endangered language for the digital age: the case of Judeo-Spanish. In: Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia within the 13th Language Resources and Evaluation Conference, 20 June 2022, Marseille, France.
      1402731013
    • Contributing Source:
      DUBLIN CITY UNIV
      From OAIster®, provided by the OCLC Cooperative.
    • الرقم المعرف:
      edsoai.on1402731013
HoldingsOnline