From

Time -

Location LISN Site Plaine - Digitéo

Algorithmes Learning and Computation, Data Science, Thesis

Spotting expressivity bottlenecks in neural networks and fixing them by optimal architecture growth

Thesis directed by Sylvain Chevallier and Guillaume Charpiat

Speaker : Manon Verbockhaven

Abstract

We aim at adapting a neural network architecture during training, by detecting expressivity bottlenecks. This would allow to start from tiny architectures and make them grow as necessary, which would gain considerable training time and computational ressources compared to the standard approach where over-sized models are trained and then reduced. A promising mathematical formulation of the problem yields a way to directly precisely locate expressivity bottlenecks, at low computational cost, to the opposite of standard Auto-Deep-Learning techniques which proceed by trial and error via random architecture modifications (which is very costly).

Jury

  • Rémi Gribonval, Directeur de recherche Inria, ENS Lyon, rapporteur
  • Hervé Luga, Professeur Université Toulouse Jean Jaurès, rapporteur
  • David Picard, Directeur de recherche École des Ponts ParisTech
  • Aurélie Névéol, Directrice de recherche, CNRS, examinatrice
  • Guillaume Lecué, Professeur ESSEC, CY Cergy Paris Université, examinateur

Keywords

Neural networks,expressive power,low-rank approximation,optimization,

Publications

  • Article dans une revue, Article dans une revue

    Manon Verbockhaven, Théo Rudkiewicz, Sylvain Chevallier, Guillaume Charpiat. Growing Tiny Networks: Spotting Expressivity Bottlenecks and Fixing Them Optimally. Transactions on Machine Learning Research Journal, 2024. ⟨hal-04591472v2⟩

    AO, AO

    Year of publication

    Available in free access

  • Communication dans un congrès

    Stella Douka, Manon Verbockhaven, Théo Rudkiewicz, Stéphane Rivaud, François P. Landes, et al.. Growth strategies for arbitrary DAG neural architectures. ESANN 2025 – 33th European Symposium on Artificial Neural Networks, Apr 2025, Bruges, Belgium. ⟨hal-04902059v2⟩

    AO

    Year of publication

    Available in free access