Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      Foresti G.L., Fusiello A., Hancock E.; Cartella, Giuseppe; Baldrati, Alberto; Morelli, Davide; Cornia, Marcella; Bertini, Marco; Cucchiara, Rita
    • بيانات النشر:
      Springer Science and Business Media Deutschland GmbH
    • الموضوع:
      2023
    • Collection:
      Archivio della ricerca dell'Università di Modena e Reggio Emilia (Unimore: IRIS)
    • نبذة مختصرة :
      The inexorable growth of online shopping and e-commerce demands scalable and robust machine learning-based solutions to accommodate customer requirements. In the context of automatic tagging classification and multimodal retrieval, prior works either defined a low generalizable supervised learning approach or more reusable CLIP-based techniques while, however, training on closed source data. In this work, we propose OpenFashionCLIP, a vision-and-language contrastive learning method that only adopts open-source fashion data stemming from diverse domains, and characterized by varying degrees of specificity. Our approach is extensively validated across several tasks and benchmarks, and experimental results highlight a significant out-of-domain generalization capability and consistent improvements over state-of-the-art methods both in terms of accuracy and recall. Source code and trained models are publicly available at: https://github.com/aimagelab/open-fashion-clip.
    • Relation:
      info:eu-repo/semantics/altIdentifier/isbn/9783031431470; ispartofbook:Proceedings of the 22nd International Conference on Image Analysis and Processing; 22nd International Conference on Image Analysis and Processing, ICIAP 2023; volume:14233; firstpage:245; lastpage:256; serie:LECTURE NOTES IN COMPUTER SCIENCE; alleditors:Foresti G.L., Fusiello A., Hancock E.; https://hdl.handle.net/11380/1309486; info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85172249158
    • الرقم المعرف:
      10.1007/978-3-031-43148-7_21
    • الدخول الالكتروني :
      https://hdl.handle.net/11380/1309486
      https://doi.org/10.1007/978-3-031-43148-7_21
    • Rights:
      info:eu-repo/semantics/openAccess
    • الرقم المعرف:
      edsbas.4C1059DD