A lower bound and a near-optimal algorithm for bilevel empirical risk minimization

Mathieu Dagréou; Thomas Moreau; Samuel Vaiter; Pierre Ablin

doi:10.48550/arXiv.2302.08766

Communication Dans Un Congrès Année : 2024

A lower bound and a near-optimal algorithm for bilevel empirical risk minimization

(1) , (1) , (2, 3) , (4)

1
2
3
4

Mathieu Dagréou

Fonction : Auteur correspondant
PersonId : 1125582
IdHAL : mathieu-dagreou
ORCID : 0000-0002-6578-2213

Connectez-vous pour contacter l'auteur

Modèles et inférence pour les données de Neuroimagerie

Thomas Moreau

Fonction : Auteur
PersonId : 171108
IdHAL : tommoral
ORCID : 0000-0002-1523-3419

Modèles et inférence pour les données de Neuroimagerie

Samuel Vaiter

Fonction : Auteur
PersonId : 1995
IdHAL : samuel-vaiter
ORCID : 0000-0002-4077-708X
IdRef : 182993116

Centre National de la Recherche Scientifique

Laboratoire Jean Alexandre Dieudonné

Pierre Ablin

Fonction : Auteur

Apple Inc

Résumé

Bilevel optimization problems, which are problems where two optimization problems are nested, have more and more applications in machine learning. In many practical cases, the upper and the lower objectives correspond to empirical risk minimization problems and therefore have a sum structure. In this context, we propose a bilevel extension of the celebrated SARAH algorithm. We demonstrate that the algorithm requires $\mathcal{O}((n+m)^{\frac12}\varepsilon^{-1})$ gradient computations to achieve $\varepsilon$-stationarity with $n+m$ the total number of samples, which improves over all previous bilevel algorithms. Moreover, we provide a lower bound on the number of oracle calls required to get an approximate stationary point of the objective function of the bilevel problem. This lower bound is attained by our algorithm, which is therefore optimal in terms of sample complexity.

Mots clés

Machine Learning (stat.ML) Machine Learning (cs.LG) Optimization and Control (math.OC) FOS: Computer and information sciences FOS: Mathematics

Domaines

Mathématiques [math] Informatique [cs]

Fichier principal

56.pdf (1)

Origine	Fichiers produits par l'(les) auteur(s)

Mathieu Dagréou : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04302861

Soumis le : lundi 19 février 2024-14:45:27

Dernière modification le : vendredi 7 février 2025-18:37:43

Dates et versions

hal-04302861 , version 1 (23-11-2023)

hal-04302861 , version 2 (19-02-2024)

hal-04302861 , version 3 (19-02-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04302861 , version 3
ARXIV : 2302.08766
DOI : 10.48550/arXiv.2302.08766

Citer

Mathieu Dagréou, Thomas Moreau, Samuel Vaiter, Pierre Ablin. A lower bound and a near-optimal algorithm for bilevel empirical risk minimization. International Conference on Artificial Intelligence and Statistics (AISTATS), May 2024, Valencia, Spain. ⟨10.48550/arXiv.2302.08766⟩. ⟨hal-04302861v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS INRIA INSMI DIEUDONNE INRIA2 UNIV-COTEDAZUR ANR GS-COMPUTER-SCIENCE ANR-IA

235 Consultations

120 Téléchargements

A lower bound and a near-optimal algorithm for bilevel empirical risk minimization

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager