Séminaire de l’équipe SOP présenté par Xavier Erny à 14h et Elouan Argouarc’h à 15h le 16/05/2024, amphi C06 à Evry

14h00 Xavier Erny

Titre : Équations différentielles stochastiques avec des sauts et modèles champ-moyens

Résumé : Le but de la présentation est d’introduire la notion de processus ponctuels et d’équations différentielles stochastiques à sauts, ainsi que leurs utilisations dans les modèles. Nous nous intéresserons principalement à des modèles de systèmes de particules de type champ-moyen (que nous introduirons), et à l’étude de ces systèmes quand le nombre de particules tend vers l’infini. Pour cela nous utiliserons la notion de générateur infinitésimal de processus de Markov à la fois pour des processus à valeurs réelles et des processus à valeurs mesures. Il n’y a pas de prérequis pour suivre la présentation, les concepts principaux seront introduits.

15h00 Elouan Argouarc’h

Titre: Generative vs. discriminative modelling under the lens of uncertainty quantification

Abstract: Learning a parametric model from a given dataset indeed enables to capture intrinsic dependencies between random variables via a parametric conditional probability distribution and in turn predict the value of a label variable given observed variables. In this paper, we undertake a comparative analysis of generative and discriminative approaches which differ in their construction and the structure of the underlying inference problem. Our objective is to compare the ability of both approaches to leverage information from various sources in an epistemic uncertainty aware inference via the posterior predictive distribution. We assess the role of a prior distribution, explicit in the generative case and implicit in the discriminative case, leading to a discussion about discriminative models suffering from imbalance dataset. We then discuss the role of the observed variable which plays a double part in the generative case since the model parameters also indeed depend on this random variable, and we subsequently build upon this argument to discuss the compatibility of both approaches with the problem of semi-supervised learning. We also provide with practical insights and we examine how the modelling choice impacts the sampling from the posterior predictive distribution. With regard to this, we propose a general sampling scheme enabling supervised learning for both approaches, as well as semi-supervised learning when compatible with the considered modelling approach. Throughout this paper, we illustrate our arguments and conclusions using the example of affine regression and validate our comparative analysis through classification simulations using neural network-based models.