Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Mehdi Achour, El; Malgouyres, François; Mamalet, Franck
المصدر:
ISSN: 1532-4435.
الموضوع:
Convolutional layers; Orthogonality; Deep learning theory; Vanishing/Exploding gradient; Robustness; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
نوع التسجيلة:
article in journal/newspaper
اللغة:
English

معلومة اضافية
- Contributors:
  Institut de Mathématiques de Toulouse UMR5219 (IMT); Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse); Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS); IRT Saint Exupéry - Institut de Recherche Technologique; ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019)
- بيانات النشر:
  HAL CCSD
  Microtome Publishing
- الموضوع:
  2022
- نبذة مختصرة :
  International audience ; Imposing orthogonality on the layers of neural networks is known to facilitate the learning by limiting the exploding/vanishing of the gradient; decorrelate the features; improve the robustness. This paper studies the theoretical properties of orthogonal convolutional layers.We establish necessary and sufficient conditions on the layer architecture guaranteeing the existence of an orthogonal convolutional transform. The conditions prove that orthogonal convolutional transforms exist for almost all architectures used in practice for 'circular' padding.We also exhibit limitations with 'valid' boundary conditions and 'same' boundary conditions with zero-padding.Recently, a regularization term imposing the orthogonality of convolutional layers has been proposed, and impressive empirical results have been obtained in different applications (Wang et al. 2020).The second motivation of the present paper is to specify the theory behind this.We make the link between this regularization term and orthogonality measures. In doing so, we show that this regularization strategy is stable with respect to numerical and optimization errors and that, in the presence of small errors and when the size of the signal/image is large, the convolutional layers remain close to isometric.The theoretical results are confirmed with experiments and the landscape of the regularization term is studied. Experiments on real data sets show that when orthogonality is used to enforce robustness, the parameter multiplying the regularization termcan be used to tune a tradeoff between accuracy and orthogonality, for the benefit of both accuracy and robustness.Altogether, the study guarantees that the regularization proposed in Wang et al. (2020) is an efficient, flexible and stable numerical strategy to learn orthogonal convolutional layers.
- Relation:
  info:eu-repo/semantics/altIdentifier/arxiv/2108.05623; hal-03315801; https://hal.science/hal-03315801; https://hal.science/hal-03315801v3/document; https://hal.science/hal-03315801v3/file/main.pdf; ARXIV: 2108.05623
- Rights:
  info:eu-repo/semantics/OpenAccess
- الرقم المعرف:
  edsbas.A7DE9019

تعليقات

No Comments.

Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks

اتصل بنا

اتبع