Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Modeling fine-grained sociolinguistic variation: The promises and pitfalls of Twitter corpora and neural word embeddings

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      IMS Stuttgart; University of Stuttgart = Universität Stuttgart; Cognition, langues, langage, ergonomie (CLLE); École Pratique des Hautes Études (EPHE); Université Paris Sciences et Lettres (PSL)-Université Paris Sciences et Lettres (PSL)-Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Bordeaux Montaigne (UBM)-Centre National de la Recherche Scientifique (CNRS)-Toulouse Mind & Brain Institut (TMBI); Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT); Cognition, Langues, Langage, Ergonomie (CLLE-ERSS); Université de Toulouse (UT)-Université de Toulouse (UT)-Université Bordeaux Montaigne (UBM)-Centre National de la Recherche Scientifique (CNRS); Mark Kaunisto; Marco Schilk; Ute Römer
    • بيانات النشر:
      CCSD
      John Benjamins
    • الموضوع:
      2024
    • نبذة مختصرة :
      International audience ; This chapter examines the use of recent data sources and computational methods to study fine-grained sociolinguistic phenomena. We deploy a custom-built corpus of tweets (Miletic et al. 2020) and neural word embeddings to investigate the use of contact-induced semantic shifts in Quebec English. Drawing on an analysis of 40 lexical items, we show that our approach is beneficial in facilitating manual inspection of vast amounts of data and establishing fine-grained patterns of language variation. While it is affected by a range of noise-related issues, which we describe in detail, coarse-grained annotation provides an efficient way of circumventing them. We use the results filtered in this way to conduct a quantitative analysis of sociolinguistic constraints on contact-induced semantic shifts, further confirming the relevance of our approach.
    • ISBN:
      978-90-272-1588-8
      90-272-1588-X
    • الدخول الالكتروني :
      https://hal.science/hal-04806795
      https://hal.science/hal-04806795v1/document
      https://hal.science/hal-04806795v1/file/PREPRINT-rev2.pdf
    • Rights:
      info:eu-repo/semantics/OpenAccess
    • الرقم المعرف:
      edsbas.24F71013