SEER: Backdoor Detection for Vision-Language Models Through Searching Target Text and Image Trigger Jointly

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Zhu, Liuwan; Ning, Rui; Li, Jiang; Xin, Chunsheng; Wu, Hongyi
المصدر:
Computer Science Faculty Publications
الموضوع:
Algorithm; Computer science; Electrical and computer engineering; CV language and vision; CV adversarial attacks and robustness; Computer Sciences; Theory and Algorithms
نوع التسجيلة:
article in journal/newspaper
اللغة:
unknown

معلومة اضافية
- بيانات النشر:
  ODU Digital Commons
- الموضوع:
  2024
- Collection:
  Old Dominion University: ODU Digital Commons
- نبذة مختصرة :
  This paper proposes SEER, a novel backdoor detection algorithm for vision-language models, addressing the gap in the literature on multi-modal backdoor detection. While backdoor detection in single-modal models has been well studied, the investigation of such defenses in multi-modal models remains limited. Existing backdoor defense mechanisms cannot be directly applied to multi-modal settings due to their increased complexity and search space explosion. In this paper, we propose to detect backdoors in vision-language models by jointly searching image triggers and malicious target texts in feature space shared by vision and language modalities. Our extensive experiments demonstrate that SEER can achieve over 92% detection rate on backdoor detection in vision-language models in various settings without accessing training data or knowledge of downstream tasks.
- File Description:
  application/pdf
- Relation:
  https://digitalcommons.odu.edu/computerscience_fac_pubs/327; https://doi.org/10.1609/aaai.v38i7.28611
- الرقم المعرف:
  10.1609/aaai.v38i7.28611
- الدخول الالكتروني :
  https://digitalcommons.odu.edu/computerscience_fac_pubs/327
  https://doi.org/10.1609/aaai.v38i7.28611
- Rights:
  Copyright Â© 2024, Association for the Advancement of Artificial Intelligence. All rights reserved. "In the returned rights section of the AAAI copyright form, authors are specifically granted back the right to use their own papers for noncommercial uses, such as inclusion in their dissertations or the right to deposit their papers in their institutional repositories, provided there is proper attribution. The published version is not available for posting outside the AAAI Digital Library." Included in accordance with publisher policy.
- الرقم المعرف:
  edsbas.BBD24C4A

تعليقات

No Comments.

SEER: Backdoor Detection for Vision-Language Models Through Searching Target Text and Image Trigger Jointly

اتصل بنا

اتبع