Item request has been placed!
×
Item request cannot be made.
×
Processing Request
NCHLT isiZulu fastText-CBoW embeddings
Item request has been placed!
×
Item request cannot be made.
×
Processing Request
- المؤلفون: Roald Eiselen
- المصدر:
Web ; Government Documents
- نوع التسجيلة:
other/unknown material
- اللغة:
Zulu
- معلومة اضافية
- Contributors:
Rico Koen; Albertus Kruger; Jacques van Heerden
- بيانات النشر:
North-West University; Centre for Text Technology (CTexT)
- الموضوع:
2023
- نبذة مختصرة :
Static word and subword embeddings for the continuous bag of words (CBoW) flavour of the fastText architecture (Bojanowski et al., 2017). The embedding provides real-valued vector representations for isiZulu text.
- File Description:
Training data: Paragraphs: 816,776; Token count: 15,801,081; Vocab size: 182,964; Embedding dimensions: 600; 4.02GB (Zipped); application/octet-stream
- Relation:
https://hdl.handle.net/20.500.12185/595
- الدخول الالكتروني :
https://hdl.handle.net/20.500.12185/595
- Rights:
Creative Commons Attribution 4.0 International (CC-BY 4.0)
- الرقم المعرف:
edsbas.7AE4AEFD
No Comments.