Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Multi Word Term Queries for Focused Information Retrieval.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      Laboratoire Informatique d'Avignon (LIA); Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI; Equipe de recherche de Lyon en sciences de l'information et de la communication (ELICO); Université Lumière - Lyon 2 (UL2)-École nationale supérieure des sciences de l'information et des bibliothèques (ENSSIB); Université de Lyon-Université de Lyon-Sciences Po Lyon - Institut d'études politiques de Lyon (IEP Lyon); Université de Lyon-Université Jean Moulin - Lyon 3 (UJML); Université de Lyon-Université Claude Bernard Lyon 1 (UCBL); Université de Lyon; Gelbukh, Alexander
    • بيانات النشر:
      HAL CCSD
      Springer
    • الموضوع:
      2010
    • Collection:
      Portail HAL de l'Université Lumière Lyon 2
    • الموضوع:
    • نبذة مختصرة :
      International audience ; In this paper, we address both standard and focused retrieval tasks based on comprehensible language models and interactive query expansion (IQE). Query topics are expanded using an initial set of Multi Word Terms (MWTs) selected from top n ranked documents. MWTs are special text units that represent domain concepts and objects. As such, they can better represent query topics than ordinary phrases or n-grams. We tested different query representations: bag-of-words, phrases, flat list of MWTs, subsets of MWTs. We also combined the initial set of MWTs obtained in an IQE process with automatic query expansion (AQE) using language models and smoothing mechanism. We chose as baseline the Indri IR engine based on the language model using Dirichlet smoothing. The experiment is carried out on two benchmarks: TREC Enterprise track (TRECent) 2007 and 2008 collections; INEX 2008 Ad-hoc track using the Wikipedia collection.
    • Relation:
      hal-00635283; https://hal.science/hal-00635283; https://hal.science/hal-00635283/document; https://hal.science/hal-00635283/file/cicling2010_sanjuan.pdf
    • الرقم المعرف:
      10.1007/978-3-642-12116-6_50
    • الدخول الالكتروني :
      https://hal.science/hal-00635283
      https://hal.science/hal-00635283/document
      https://hal.science/hal-00635283/file/cicling2010_sanjuan.pdf
      https://doi.org/10.1007/978-3-642-12116-6_50
    • Rights:
      info:eu-repo/semantics/OpenAccess
    • الرقم المعرف:
      edsbas.31AC76C8