Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Automatic Standardization of Arabic Dialects for Machine Translation ; Standardisation automatique des dialectes arabes pour la traduction automatique

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      Centre d'Études Linguistiques - Corpus, Discours et Sociétés (CEL); Université Jean Moulin - Lyon 3 (UJML); Université de Lyon-Université de Lyon; Institut des Langues et Cultures d'Europe, Amérique, Afrique, Asie et Australie (ILCEA4); Université Grenoble Alpes (UGA); Université Jean Moulin Lyon 3 - Faculté des langues (UJML3 Langues)
    • بيانات النشر:
      HAL CCSD
      Universiti Malaysia Kelantan (UMK)
    • الموضوع:
      2022
    • Collection:
      Université Grenoble Alpes: HAL
    • نبذة مختصرة :
      International audience ; Based on an annotated multimedia corpus, television series Marāyā 2013, we dig into the question of "automatic standardization" of Arabic dialects for machine translation. Here we distinguish between rule-based machine translation and statistical machine translation. Machine translation from Arabic most of the time takes standard or modern Arabic as the source language and produces quite satisfactory translations thanks to the availability of the translation memories necessary for training the models. The case is different for the translation of Arabic dialects. The productions are much less efficient. In our research we try to apply machine translation methods to a dialect/standard (or modern) Arabic pair to automatically produce a standard Arabic text from a dialect input, a process we call "automatic standardization". we opt here for the application of "statistical models" because "automatic standardization" based on rules is more hard with the lack of "diglossic" dictionaries on the one hand and the difficulty of creating linguistic rules for each dialect on the other. Carrying out this research could then lead to combining "automatic standardization" software and automatic translation software so that we take the output of the first software and introduce it as input into the second one to obtain at the end a quality machine translation. This approach may also have educational applications such as the development of applications to help understand different Arabic dialects by transforming dialectal texts into standard Arabic. ; À partir d'un corpus multimédia annoté, la série télévisée Marāyā 2013, nous creusons la question de la « standardisation automatique » des dialectes arabes pour la traduction automatique. Ici, nous distinguons la traduction automatique basée sur des règles et la traduction automatique statistique. La traduction automatique depuis l'arabe prend la plupart du temps comme langue source l'arabe standard ou moderne et produit des traductions tout à fait ...
    • ISBN:
      978-967-00-2148-5
      967-00-2148-0
    • Relation:
      info:eu-repo/semantics/altIdentifier/arxiv/2301.03447; hal-03928243; https://hal.science/hal-03928243; https://hal.science/hal-03928243/document; https://hal.science/hal-03928243/file/Alnassan_2022_Automatic%20Standardization%20of%20Arabic%20Dialects%20for%20Machine%20Translation.pdf; ARXIV: 2301.03447
    • Rights:
      http://hal.archives-ouvertes.fr/licences/copyright/ ; info:eu-repo/semantics/OpenAccess
    • الرقم المعرف:
      edsbas.3D11DA5E