An empirical approach to machine learning: algorithm selection, hyperparameter optimization and automatic principal design
Auteur / Autrice : | Sourava prasad Mishra |
Direction : | Balazs Kegl, Michèle Sebag |
Type : | Projet de thèse |
Discipline(s) : | Informatique |
Date : | Inscription en doctorat le 01/11/2014 |
Etablissement(s) : | université Paris-Saclay |
Ecole(s) doctorale(s) : | École doctorale Sciences et technologies de l'information et de la communication (Orsay, Essonne ; 2015-....) |
Partenaire(s) de recherche : | Laboratoire : LAL - Laboratoire de l'Accélérateur Linéaire |
Equipe de recherche : Applied Statistics and machine learning (AppStat) | |
établissement de préparation de la thèse : Université Paris-Sud (1970-2019) |
Mots clés
Résumé
In this thesis project we propose to apply the scientific method to machine learning. We will explore two lines of research. In the first we will build on recent work applying modern experimental design for algorithm selection and hyperparameter tuning. The main thrust of this sub-project is the multi-problem approach: we will explore the interaction between methods (and hyperparameters) and data sets to find out whether and to what extent experience can be generalized across data sets. The output of this project is a toolbox for practitioners and a stockpile of knowledge on what algorithm works on what (kind of) data sets. This second output will feed into the second line of research: we will ask the question of emph{why} certain methods work on certain data sets. We will study algorithms as natural phenomena, form hypotheses, design and evaluate experiments, and carry out measurements that could validate or refute our hypotheses.