On the effectiveness of hybrid pooling in mixup-based graph learning for language processing

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: DONG, Zeming; HU, Qiang; Zhang, Zhenya; GUO, Yuejun; CORDY, Maxime; PAPADAKIS, Mike; Traon, Yves Le; Zhao, Jianjun
المصدر:
Journal of Systems and Software, 216, 112139 (2024-10)
الموضوع:
Data augmentation; Manifold-mixup; Graph neural networks; Engineering; computing & technology; Computer science; Ingénierie; informatique & technologie; Sciences informatiques
نوع التسجيلة:
article in journal/newspaper
اللغة:
English

معلومة اضافية
- بيانات النشر:
  Elsevier Inc.
- الموضوع:
  2024
- Collection:
  University of Luxembourg: ORBilu - Open Repository and Bibliography
- نبذة مختصرة :
  peer reviewed ; Graph neural network (GNN)-based graph learning has been popular in natural language and programming language processing, particularly in text and source code classification. Typically, GNNs are constructed by incorporating alternating layers which learn transformations of graph node features, along with graph pooling layers that use graph pooling operators (e.g., Max-pooling) to effectively reduce the number of nodes while preserving the semantic information of the graph. Recently, to enhance GNNs in graph learning tasks, Manifold-Mixup, a data augmentation technique that produces synthetic graph data by linearly mixing a pair of graph data and their labels, has been widely adopted. However, the performance of Manifold-Mixup can be highly affected by graph pooling operators, and there have not been many studies that are dedicated to uncovering such affection. To bridge this gap, we take an early step to explore how graph pooling operators affect the performance of Mixup-based graph learning. To that end, we conduct a comprehensive empirical study by applying Manifold-Mixup to a formal characterization of graph pooling based on 11 graph pooling operations (9 hybrid pooling operators, 2 non-hybrid pooling operators). The experimental results on both natural language datasets (Gossipcop, Politifact) and programming language datasets (JAVA250, Python800) demonstrate that hybrid pooling operators are more effective for Manifold-Mixup than the standard Max-pooling and the state-of-the-art graph multiset transformer (GMT) pooling, in terms of producing more accurate and robust GNN models. Editor's note: Open Science material was validated by the Journal of Systems and Software Open Science Board.
- ISSN:
  0164-1212
  1873-1228
- Relation:
  https://api.elsevier.com/content/article/PII:S0164121224001845?httpAccept=text/xml; urn:issn:0164-1212; urn:issn:1873-1228; https://orbilu.uni.lu/handle/10993/62154; info:hdl:10993/62154; https://orbilu.uni.lu/bitstream/10993/62154/1/JSS__On_the_Effectiveness_of_Hybrid_Pooling_in_Mixup_Based_Graph_Learning_for_Language_Processing.pdf; wos:001260967800001
- الرقم المعرف:
  10.1016/j.jss.2024.112139
- الدخول الالكتروني :
  https://doi.org/10.1016/j.jss.2024.112139
  https://orbilu.uni.lu/handle/10993/62154
  https://orbilu.uni.lu/bitstream/10993/62154/1/JSS__On_the_Effectiveness_of_Hybrid_Pooling_in_Mixup_Based_Graph_Learning_for_Language_Processing.pdf
- Rights:
  open access ; http://purl.org/coar/access_right/c_abf2 ; info:eu-repo/semantics/openAccess
- الرقم المعرف:
  edsbas.FD20115

تعليقات

No Comments.

On the effectiveness of hybrid pooling in mixup-based graph learning for language processing

اتصل بنا

اتبع