{"id":1200,"date":"2019-06-14T15:37:00","date_gmt":"2019-06-14T13:37:00","guid":{"rendered":"https:\/\/samovar2022.int-evry.fr\/index.php\/2019\/06\/14\/detection-des-anomalies-sur-les-donnees-en-vol-en-temps-reel-avec-des-agents-communicants-heterogenes\/"},"modified":"2020-09-04T18:45:19","modified_gmt":"2020-09-04T16:45:19","slug":"detection-des-anomalies-sur-les-donnees-en-vol-en-temps-reel-avec-des-agents-communicants-heterogenes","status":"publish","type":"post","link":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/2019\/06\/14\/detection-des-anomalies-sur-les-donnees-en-vol-en-temps-reel-avec-des-agents-communicants-heterogenes\/","title":{"rendered":"D\u00e9tection des anomalies sur les donn\u00e9es en vol en temps r\u00e9el avec des agents communicants h\u00e9t\u00e9rog\u00e8nes"},"content":{"rendered":"<p>L&rsquo;Ecole doctorale : Sciences et Technologies de l&rsquo;Information et de la Communication et le Laboratoire de recherche SAMOVAR pr\u00e9sentent l\u2019AVIS DE SOUTENANCE de Monsieur Nicolas AUSSEL<\/p>\n<p>Autoris\u00e9 \u00e0 pr\u00e9senter ses travaux en vue de l\u2019obtention du Doctorat de l&rsquo;Universit\u00e9 Paris-Saclay, pr\u00e9par\u00e9 \u00e0 T\u00e9l\u00e9com SudParis en :<br \/>\nMath\u00e9matiques et Informatique<br \/>\n\u00ab D\u00e9tection des anomalies sur les donn\u00e9es en vol en temps r\u00e9el avec des agents communicants h\u00e9t\u00e9rog\u00e8nes \u00bb<\/p>\n<p><strong>le VENDREDI 21 JUIN 2019 \u00e0 10h00<\/p>\n<p>Salle A003<\/p>\n<p>T\u00e9l\u00e9com SudParis &#8211; 9 Rue Charles Fourier 91011 Evry<\/strong><\/p>\n<p><strong>Membres du jury :<\/strong><\/p>\n<table>\n<tbody>\n<tr class='row_even'>\n<td>Mme Sophie CHABRIDON,<br \/>\nDirectrice d&rsquo;\u00e9tudes, T\u00e9l\u00e9com SudParis, FRANCE <\/td>\n<td>Directrice de th\u00e8se<\/td>\n<\/tr>\n<tr class='row_odd'>\n<td>M. Yohan PETETIN,<br \/>\nMa\u00eetre de conf\u00e9rences, T\u00e9l\u00e9com SudParis, FRANCE<\/td>\n<td>Encadrant<\/td>\n<\/tr>\n<tr class='row_even'>\n<td>M. Pierre SENS,<br \/>\nProfesseur, Universit\u00e9 Paris 6, FRANCE<\/td>\n<td>Rapporteur<\/td>\n<\/tr>\n<tr class='row_odd'>\n<td>M. Mustapha LEBBAH,<br \/>\nAssociate Professor, Universit\u00e9 Paris 13, FRANCE<\/td>\n<td>Rapporteur<\/td>\n<\/tr>\n<tr class='row_even'>\n<td>M. Eric GRESSIER-SOUDAN,<br \/>\nProfesseur, CNAM, FRANCE <\/td>\n<td>Examinateur<\/td>\n<\/tr>\n<tr class='row_odd'>\n<td>Mme Mathilde MOUGEOT,<br \/>\nProfesseur, ENSIIE, FRANCE<\/td>\n<td>Examinatrice<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>R\u00e9sum\u00e9 :<\/strong><\/p>\n<p>Avec l&rsquo;augmentation du nombre de capteurs et d&rsquo;actuateurs dans les avions et le d\u00e9veloppement de liaisons de donn\u00e9es fiables entre les avions et le sol, il est devenu possible d&rsquo;am\u00e9liorer la s\u00e9curit\u00e9 et la fiabilit\u00e9 des syst\u00e8mes \u00e0 bord en appliquant des techniques d&rsquo;analyse en temps r\u00e9el. Cependant, \u00e9tant donn\u00e9 la disponibilit\u00e9 limit\u00e9 des ressources de calcul embarqu\u00e9es et le co\u00fbt \u00e9lev\u00e9 des liaisons de donn\u00e9es, les solutions architecturelles actuelles ne peuvent pas exploiter pleinement toutes les ressources disponibles, limitant leur pr\u00e9cision. Notre but est de proposer un algorithme distribu\u00e9 de pr\u00e9diction de panne qui pourrait \u00eatre ex\u00e9cut\u00e9 \u00e0 la fois \u00e0 bord de l&rsquo;avion et dans une station au sol tout en respectant un budget de communication. Dans cette approche, la station au sol disposerait de ressources de calcul rapides et de donn\u00e9es historiques et l&rsquo;avion disposerait de ressources de calcul limit\u00e9es et des donn\u00e9es de vol actuelles. Dans cette th\u00e8se, nous \u00e9tudierons les sp\u00e9cificit\u00e9s des donn\u00e9es a\u00e9ronautiques et les m\u00e9thodes d\u00e9j\u00e0 existantes pour produire des pr\u00e9dictions de pannes \u00e0 partir de ces derni\u00e8res et nous proposerons une solution au probl\u00e8me pos\u00e9. Notre contribution sera d\u00e9taill\u00e9 en trois parties. Premi\u00e8rement, nous \u00e9tudierons le probl\u00e8me de pr\u00e9diction d&rsquo;\u00e9v\u00e9nements rares cr\u00e9\u00e9 par la haute fiabilit\u00e9 des syst\u00e8mes a\u00e9ronautiques. Beaucoup de m\u00e9thodes d&rsquo;apprentissage en classification reposent sur des jeux de donn\u00e9es \u00e9quilibr\u00e9s. Plusieurs approches existent pour corriger le d\u00e9s\u00e9quilibre d&rsquo;un jeu de donn\u00e9e et nous \u00e9tudierons leur efficacit\u00e9 sur des jeux de donn\u00e9es extr\u00eamement d\u00e9s\u00e9quilibr\u00e9s. Deuxi\u00e8mement, nous \u00e9tudierons le probl\u00e8me d&rsquo;analyse textuelle de journaux car de nombreux syst\u00e8mes a\u00e9ronautiques ne produisent pas d&rsquo;\u00e9tiquettes ou de valeurs num\u00e9riques faciles \u00e0 interpr\u00e9ter mais des messages de journaux textuels. Nous \u00e9tudierons les m\u00e9thodes existantes bas\u00e9es sur une approche statistique et sur l&rsquo;apprentissage profond pour convertir des messages de journaux textuels en une forme utilisable en entr\u00e9e d&rsquo;algorithmes d&rsquo;apprentissage pour classification. Nous proposerons notre propre m\u00e9thode bas\u00e9e sur le traitement du langage naturel et montrerons comment ses performances d\u00e9passent celles des autres m\u00e9thodes sur un jeu de donn\u00e9e public standard. Enfin, nous offrirons une solution au probl\u00e8me pos\u00e9 en proposant un nouvel algorithme d&rsquo;apprentissage distribu\u00e9 s&rsquo;appuyant sur deux paradigmes d&rsquo;apprentissage existant, l&rsquo;apprentissage actif et l&rsquo;apprentissage f\u00e9d\u00e9r\u00e9. Nous d\u00e9taillerons notre algorithme, son impl\u00e9mentation et fournirons une comparaison de ses performances avec les m\u00e9thodes existantes.<\/p>\n<p><strong>Abstract :<\/strong><\/p>\n<p>With the rise of the number of sensors and actuators in an aircraft and the development of reliable data links from the aircraft to the ground, it becomes possible to improve aircraft security and maintainability by applying real-time analysis techniques. However, given the limited availability of on-board computing and the high cost of the data links, current architectural solutions cannot fully leverage all the available resources limiting their accuracy. Our goal is to provide a distributed algorithm for failure prediction that could be executed both on-board of the aircraft and on a ground station and that would produce on-board failure predictions in near real-time under a communication budget. In this approach, the ground station would hold fast computation resources and historical data and the aircraft would hold limited computational resources and current flight&rsquo;s data. In this thesis, we will study the specificities of aeronautical data and what methods already exist to produce failure prediction from them and propose a solution to the problem stated. Our contribution will be detailed in three main parts. First, we will study the problem of rare event prediction created by the high reliability of aeronautical systems. Many learning methods for classifiers rely on balanced datasets. Several approaches exist to correct a dataset imbalance and we will study their efficiency on extremely imbalanced datasets. Second, we study the problem of log parsing as many aeronautical systems do not produce easy to classify labels or numerical values but log messages in full text. We will study existing methods based on a statistical approach and on Deep Learning to convert full text log messages into a form usable as an input by learning algorithms for classifiers. We will then propose our own method based on Natural Language Processing and show how it outperforms the other approaches on a public benchmark. Last, we offer a solution to the stated problem by proposing a new distributed learning algorithm that relies on two existing learning paradigms Active Learning and Federated Learning. We detail our algorithm, its implementation and provide a comparison of its performance with existing methods<\/p>\n","protected":false},"excerpt":{"rendered":"<p>L&rsquo;Ecole doctorale : Sciences et Technologies de l&rsquo;Information et de la Communication et le Laboratoire de recherche SAMOVAR pr\u00e9sentent l\u2019AVIS DE SOUTENANCE de Monsieur Nicolas AUSSEL Autoris\u00e9 \u00e0 pr\u00e9senter ses travaux en vue de l\u2019obtention du Doctorat de l&rsquo;Universit\u00e9 Paris-Saclay, pr\u00e9par\u00e9 \u00e0 T\u00e9l\u00e9com SudParis en : Math\u00e9matiques et Informatique \u00ab D\u00e9tection des anomalies sur les [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ocean_post_layout":"","ocean_both_sidebars_style":"","ocean_both_sidebars_content_width":0,"ocean_both_sidebars_sidebars_width":0,"ocean_sidebar":"","ocean_second_sidebar":"","ocean_disable_margins":"enable","ocean_add_body_class":"","ocean_shortcode_before_top_bar":"","ocean_shortcode_after_top_bar":"","ocean_shortcode_before_header":"","ocean_shortcode_after_header":"","ocean_has_shortcode":"","ocean_shortcode_after_title":"","ocean_shortcode_before_footer_widgets":"","ocean_shortcode_after_footer_widgets":"","ocean_shortcode_before_footer_bottom":"","ocean_shortcode_after_footer_bottom":"","ocean_display_top_bar":"default","ocean_display_header":"default","ocean_header_style":"","ocean_center_header_left_menu":"","ocean_custom_header_template":"","ocean_custom_logo":0,"ocean_custom_retina_logo":0,"ocean_custom_logo_max_width":0,"ocean_custom_logo_tablet_max_width":0,"ocean_custom_logo_mobile_max_width":0,"ocean_custom_logo_max_height":0,"ocean_custom_logo_tablet_max_height":0,"ocean_custom_logo_mobile_max_height":0,"ocean_header_custom_menu":"","ocean_menu_typo_font_family":"","ocean_menu_typo_font_subset":"","ocean_menu_typo_font_size":0,"ocean_menu_typo_font_size_tablet":0,"ocean_menu_typo_font_size_mobile":0,"ocean_menu_typo_font_size_unit":"px","ocean_menu_typo_font_weight":"","ocean_menu_typo_font_weight_tablet":"","ocean_menu_typo_font_weight_mobile":"","ocean_menu_typo_transform":"","ocean_menu_typo_transform_tablet":"","ocean_menu_typo_transform_mobile":"","ocean_menu_typo_line_height":0,"ocean_menu_typo_line_height_tablet":0,"ocean_menu_typo_line_height_mobile":0,"ocean_menu_typo_line_height_unit":"","ocean_menu_typo_spacing":0,"ocean_menu_typo_spacing_tablet":0,"ocean_menu_typo_spacing_mobile":0,"ocean_menu_typo_spacing_unit":"","ocean_menu_link_color":"","ocean_menu_link_color_hover":"","ocean_menu_link_color_active":"","ocean_menu_link_background":"","ocean_menu_link_hover_background":"","ocean_menu_link_active_background":"","ocean_menu_social_links_bg":"","ocean_menu_social_hover_links_bg":"","ocean_menu_social_links_color":"","ocean_menu_social_hover_links_color":"","ocean_disable_title":"default","ocean_disable_heading":"default","ocean_post_title":"","ocean_post_subheading":"","ocean_post_title_style":"","ocean_post_title_background_color":"","ocean_post_title_background":0,"ocean_post_title_bg_image_position":"","ocean_post_title_bg_image_attachment":"","ocean_post_title_bg_image_repeat":"","ocean_post_title_bg_image_size":"","ocean_post_title_height":0,"ocean_post_title_bg_overlay":0.5,"ocean_post_title_bg_overlay_color":"","ocean_disable_breadcrumbs":"default","ocean_breadcrumbs_color":"","ocean_breadcrumbs_separator_color":"","ocean_breadcrumbs_links_color":"","ocean_breadcrumbs_links_hover_color":"","ocean_display_footer_widgets":"default","ocean_display_footer_bottom":"default","ocean_custom_footer_template":"","ocean_post_oembed":"","ocean_post_self_hosted_media":"","ocean_post_video_embed":"","ocean_link_format":"","ocean_link_format_target":"self","ocean_quote_format":"","ocean_quote_format_link":"post","ocean_gallery_link_images":"on","ocean_gallery_id":[],"footnotes":""},"categories":[276],"tags":[],"class_list":["post-1200","post","type-post","status-publish","format-standard","hentry","category-theses-2020-fr","entry"],"_links":{"self":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts\/1200","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/comments?post=1200"}],"version-history":[{"count":1,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts\/1200\/revisions"}],"predecessor-version":[{"id":1453,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts\/1200\/revisions\/1453"}],"wp:attachment":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/media?parent=1200"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/categories?post=1200"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/tags?post=1200"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}