A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection - Pôle Télécoms et Réseaux Access content directly
Journal Articles Transactions on Machine Learning Research Journal Year : 2023

A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection

Abstract

Despite the widespread use of deep learning algorithms, vulnerability to adversarial attacks is still an issue limiting their use in critical applications. Detecting these attacks is thus crucial to build reliable algorithms and has received increasing attention in the last few years. In this paper, we introduce the HalfspAce Mass dePth dEtectoR (HAMPER), a new method to detect adversarial examples by leveraging the concept of data depths, a statistical notion that provides center-outward ordering of points with respect to (w.r.t.) a probability distribution. In particular, the halfspace-mass (HM) depth exhibits attractive properties which makes it a natural candidate for adversarial attack detection in high-dimensional spaces. Additionally, HM is non differentiable making it harder for attackers to directly attack HAMPER via gradient based-methods. We evaluate HAMPER in the context of supervised adversarial attacks detection across four benchmark datasets. Overall, we empirically show that HAMPER consistently outperforms SOTA methods. In particular, the gains are 13.1% (29.0%) in terms of AUROC↑ (resp. FPR ↓95%) on SVHN, 14.6% (25.7%) on CIFAR10 and 22.6% (49.0%) on CIFAR100 compared to the best performing method.
Fichier principal
Vignette du fichier
451_a_halfspace_mass_depth_based_m.pdf (3.27 Mo) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04575113 , version 1 (14-05-2024)

Identifiers

  • HAL Id : hal-04575113 , version 1

Cite

Marine Picot, Federica Granese, Guillaume Staerman, Marco Romanelli, Francisco Messina, et al.. A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection. Transactions on Machine Learning Research Journal, 2023. ⟨hal-04575113⟩
39 View
16 Download

Share

Gmail Mastodon Facebook X LinkedIn More