Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

A comparable Wikipedia corpus: from wiki syntax to POS tagged XML

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • الموضوع:
      2011
    • Collection:
      LeibnizOpen (The Leibniz Association)
    • نبذة مختصرة :
      To build a comparable Wikipedia corpus of German, French, Italian, Norwegian, Polish and Hungarian for contrastive grammar research, we used a set of XSLT stylesheets to transform the mediawiki anntations to XML. Furthermore, the data has been amnntated with word class information using different taggers. The outcome is a corpus with rich meta data and linguistic annotation that can be used for multilingual research in various linguistic topics.
    • File Description:
      application/pdf
    • الدخول الالكتروني :
      https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5189
      https://nbn-resolving.org/urn:nbn:de:bsz:mh39-51897
      https://ids-pub.bsz-bw.de/files/5189/Bubenhofer_Schwinn_Haupt-A_comparable_corpus-2011.pdf
    • Rights:
      Urheberrechtlich geschützt
    • الرقم المعرف:
      edsbas.95942026