Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

NCHLT isiZulu fastText-CBoW embeddings

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • المؤلفون: Roald Eiselen
  • المصدر:
    Web ; Government Documents
  • نوع التسجيلة:
    other/unknown material
  • اللغة:
    Zulu
  • معلومة اضافية
    • Contributors:
      Rico Koen; Albertus Kruger; Jacques van Heerden
    • بيانات النشر:
      North-West University; Centre for Text Technology (CTexT)
    • الموضوع:
      2023
    • نبذة مختصرة :
      Static word and subword embeddings for the continuous bag of words (CBoW) flavour of the fastText architecture (Bojanowski et al., 2017). The embedding provides real-valued vector representations for isiZulu text.
    • File Description:
      Training data: Paragraphs: 816,776; Token count: 15,801,081; Vocab size: 182,964; Embedding dimensions: 600; 4.02GB (Zipped); application/octet-stream
    • Relation:
      https://hdl.handle.net/20.500.12185/595
    • الدخول الالكتروني :
      https://hdl.handle.net/20.500.12185/595
    • Rights:
      Creative Commons Attribution 4.0 International (CC-BY 4.0)
    • الرقم المعرف:
      edsbas.7AE4AEFD