Item request has been placed!
×
Item request cannot be made.
×
Processing Request
C4Corpus (CC BY-NC-ND part)
Item request has been placed!
×
Item request cannot be made.
×
Processing Request
- المؤلفون: Gurevych, Iryna; Habernal, Ivan; Zayed, Omnia
- المصدر:
https://dkpro.github.io/dkpro-c4corpus/.
- الموضوع:
- نوع التسجيلة:
other/unknown material
- اللغة:
Afrikaans
Arabic
Bengali
Bulgarian
Czech
Danish
German
Greek, Modern (1453-)
English
Estonian
Persian
Finnish
French
Gujarati
Hebrew
Hindi
Croatian
Hungarian
Indonesian
Italian
Japanese
Kannada
Korean
Latvian
Lithuanian
Malayalam
Marathi
Macedonian
Nepali
Dutch; Flemish
Norwegian
Polish
Portuguese
Romanian; Moldavian; Moldovan
Russian
Slovak
Slovenian
Somali
Spanish; Castilian
Albanian
Swahili
Swedish
Tamil
Telugu
Tagalog
Thai
Turkish
Ukrainian
unknown
Urdu
Vietnamese
Chinese
- معلومة اضافية
- بيانات النشر:
Technische Universität Darmstadt
- الموضوع:
2016
- Collection:
LINDAT-Clarin: Repository (Centre for Language Research Infrastructure in the Czech Republic)
- نبذة مختصرة :
A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- File Description:
text/plain; application/x-gzip; downloadable_files_count: 56
- Relation:
http://www.lrec-conf.org/proceedings/lrec2016/pdf/388_Paper.pdf; http://hdl.handle.net/11372/LRT-2205
- الدخول الالكتروني :
http://hdl.handle.net/11372/LRT-2205
- Rights:
Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) ; http://creativecommons.org/licenses/by-nc-nd/4.0/ ; PUB
- الرقم المعرف:
edsbas.823DC30B
No Comments.