{"id":766,"date":"2016-09-16T15:24:00","date_gmt":"2016-09-16T13:24:00","guid":{"rendered":"https:\/\/samovar2022.int-evry.fr\/index.php\/2016\/09\/16\/modelisation-integratrice-du-traitement-big-data\/"},"modified":"2020-09-04T18:46:11","modified_gmt":"2020-09-04T16:46:11","slug":"modelisation-integratrice-du-traitement-big-data","status":"publish","type":"post","link":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/2016\/09\/16\/modelisation-integratrice-du-traitement-big-data\/","title":{"rendered":"\u00ab Mod\u00e9lisation int\u00e9gratrice du traitement Big Data \u00bb"},"content":{"rendered":"<p>Quand: LUNDI 19 SEPTEMBRE 2016 \u00e0 14h00<br \/>\nO\u00f9: en A003 \u00e0 T\u00e9l\u00e9com SudParis 9 rue Charles Fourier 91011 Evry Cedex<\/p>\n<p><strong>Membres du jury :<\/strong><\/p>\n<p>Mme Ana CAVALLI, Professeure, T\u00e9l\u00e9com SudParis, FRANCE &#8211; Directeur de these<br \/>\nMme No\u00ebmie SIMONI, Professeure, Telecom ParisTech, FRANCE &#8211; Examinateur<br \/>\nMme Karine ZEITOUNI, Professeure, Universit\u00e9 de Versaille St Quentin (UVSQ), FRANCE &#8211; Examinateur<br \/>\nM. Daniel RANC, Ing\u00e9nieur d&rsquo;\u00e9tudes, T\u00e9l\u00e9com SudParis, FRANCE &#8211; Co-encadrant de these<br \/>\nMme Genoveva VARGAS-SOLAR, Professeure, Laboratory of Informatics of Grenoble, FRANCE &#8211; Examinateur<br \/>\nM. Florent MASSEGLIA, Professeur, Universit\u00e9 Montpellier 2 &#8211; Campus St Priest, FRANCE &#8211; Examinateur<\/p>\n<p><strong>Rapporteurs :<\/strong><\/p>\n<p>Madame Michelle SIBILLA, Professeure, IRIT &#8211; Toulouse &#8211; FRANCE<\/p>\n<p>Monsieur Laurent D&rsquo;ORAZIO, Ma\u00eetre de conf\u00e9rences, HDR &#8211; Universit\u00e9 de Clermont-Ferrand &#8211; FRANCE<\/p>\n<p><em>R\u00e9sum\u00e9 :<\/em><\/p>\n<p>Dans le monde d\u2019aujourd\u2019hui de multiples acteurs de la technologie num\u00e9rique produisent des quantit\u00e9s infinies de donn\u00e9es. Capteurs, r\u00e9seaux sociaux ou e-commerce, ils g\u00e9n\u00e8rent tous de l\u2019information qui s\u2019incr\u00e9mente en temps-r\u00e9el selon les 3 V de Gartner : en Volume, en Vitesse et en Variabilit\u00e9. Afin d\u2019exploiter efficacement et durablement ces donn\u00e9es, il est important de respecter la dynamicit\u00e9 de leur \u00e9volution chronologique au moyen de deux approches : le polymorphisme d\u2019une part, au moyen d\u2019un mod\u00e8le dynamique capable de supporter le changement de type \u00e0 chaque instant sans failles de traitement ; d\u2019autre part le support de la volatilit\u00e9 par un mod\u00e8le intelligent prenant en compte des donn\u00e9es cl\u00e9 seulement interpr\u00e9tables \u00e0 un instant \u00ab t \u00bb, au lieu de traiter toute la volum\u00e9trie des donn\u00e9es actuelle et historique. L\u2019objectif premier de cette \u00e9tude est de pouvoir \u00e9tablir au moyen de ces approches une vision int\u00e9gratrice du cycle de vie des donn\u00e9es qui s\u2019\u00e9tablit selon 3 \u00e9tapes, (1) la synth\u00e8se des donn\u00e9es via la s\u00e9lection des valeurs-cl\u00e9s des micro-donn\u00e9es acquises par les diff\u00e9rents op\u00e9rateurs au niveau de la source, (2) la fusion en faisant le tri des valeurs-cl\u00e9s s\u00e9lectionn\u00e9es et les dupliquant suivant un aspect de d\u00e9-normalisation afin d\u2019obtenir un traitement plus rapide des donn\u00e9es et (3) la transformation en un format particulier de carte de cartes de cartes, via Hadoop dans le processus classique de MapReduce afin d\u2019obtenir un graphe d\u00e9fini dans la couche applicative. Cette r\u00e9flexion est en outre soutenue par un prototype logiciel mettant en \u0153uvre les op\u00e9rateurs de mod\u00e9lisation sus-d\u00e9crits et aboutissant \u00e0 une bo\u00eete \u00e0 outils de mod\u00e9lisation comparable \u00e0 un AGL et, permettant une mise en place assist\u00e9e d&rsquo;un ou plusieurs traitements sur BigData.<\/p>\n<p><em>Abstract :<\/em><\/p>\n<p>Nowadays, multiple actors of Internet technology are producing very large amounts of data. Sensors, social media or e-commerce, all generate real-time extending information based on the 3 Vs of Gartner: Volume, Velocity and Variety. In order to efficiently exploit this data, it is important to keep track of the dynamic aspect of their chronological evolution by means of two main approaches: the polymorphism, a dynamic model able to support type changes every second with a successful processing and second, the support of data volatility by means of an intelligent model taking in consideration key-data, salient and valuable at a specific moment without processing all volumes of history and up to date data. The primary goal of this study is to establish, based on these approaches, an integrative vision of data life cycle set on 3 steps, (1) data synthesis by selecting key-values of micro-data acquired by different data source operators, (2) data fusion by sorting and duplicating the selected key-values based on a de-normalization aspect in order to get a faster processing of data and (3) the data transformation into a specific format of map of maps of maps, via Hadoop in the standard MapReduce process, in order to define the related graph in applicative layer. In addition, this study is supported by a software prototype using the already described modeling tools, as a toolbox compared to an automatic programming software and allowing to create a customized processing chain of BigData.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Quand: LUNDI 19 SEPTEMBRE 2016 \u00e0 14h00 O\u00f9: en A003 \u00e0 T\u00e9l\u00e9com SudParis 9 rue Charles Fourier 91011 Evry Cedex Membres du jury : Mme Ana CAVALLI, Professeure, T\u00e9l\u00e9com SudParis, FRANCE &#8211; Directeur de these Mme No\u00ebmie SIMONI, Professeure, Telecom ParisTech, FRANCE &#8211; Examinateur Mme Karine ZEITOUNI, Professeure, Universit\u00e9 de Versaille St Quentin (UVSQ), FRANCE [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":765,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ocean_post_layout":"","ocean_both_sidebars_style":"","ocean_both_sidebars_content_width":0,"ocean_both_sidebars_sidebars_width":0,"ocean_sidebar":"","ocean_second_sidebar":"","ocean_disable_margins":"enable","ocean_add_body_class":"","ocean_shortcode_before_top_bar":"","ocean_shortcode_after_top_bar":"","ocean_shortcode_before_header":"","ocean_shortcode_after_header":"","ocean_has_shortcode":"","ocean_shortcode_after_title":"","ocean_shortcode_before_footer_widgets":"","ocean_shortcode_after_footer_widgets":"","ocean_shortcode_before_footer_bottom":"","ocean_shortcode_after_footer_bottom":"","ocean_display_top_bar":"default","ocean_display_header":"default","ocean_header_style":"","ocean_center_header_left_menu":"","ocean_custom_header_template":"","ocean_custom_logo":0,"ocean_custom_retina_logo":0,"ocean_custom_logo_max_width":0,"ocean_custom_logo_tablet_max_width":0,"ocean_custom_logo_mobile_max_width":0,"ocean_custom_logo_max_height":0,"ocean_custom_logo_tablet_max_height":0,"ocean_custom_logo_mobile_max_height":0,"ocean_header_custom_menu":"","ocean_menu_typo_font_family":"","ocean_menu_typo_font_subset":"","ocean_menu_typo_font_size":0,"ocean_menu_typo_font_size_tablet":0,"ocean_menu_typo_font_size_mobile":0,"ocean_menu_typo_font_size_unit":"px","ocean_menu_typo_font_weight":"","ocean_menu_typo_font_weight_tablet":"","ocean_menu_typo_font_weight_mobile":"","ocean_menu_typo_transform":"","ocean_menu_typo_transform_tablet":"","ocean_menu_typo_transform_mobile":"","ocean_menu_typo_line_height":0,"ocean_menu_typo_line_height_tablet":0,"ocean_menu_typo_line_height_mobile":0,"ocean_menu_typo_line_height_unit":"","ocean_menu_typo_spacing":0,"ocean_menu_typo_spacing_tablet":0,"ocean_menu_typo_spacing_mobile":0,"ocean_menu_typo_spacing_unit":"","ocean_menu_link_color":"","ocean_menu_link_color_hover":"","ocean_menu_link_color_active":"","ocean_menu_link_background":"","ocean_menu_link_hover_background":"","ocean_menu_link_active_background":"","ocean_menu_social_links_bg":"","ocean_menu_social_hover_links_bg":"","ocean_menu_social_links_color":"","ocean_menu_social_hover_links_color":"","ocean_disable_title":"default","ocean_disable_heading":"default","ocean_post_title":"","ocean_post_subheading":"","ocean_post_title_style":"","ocean_post_title_background_color":"","ocean_post_title_background":0,"ocean_post_title_bg_image_position":"","ocean_post_title_bg_image_attachment":"","ocean_post_title_bg_image_repeat":"","ocean_post_title_bg_image_size":"","ocean_post_title_height":0,"ocean_post_title_bg_overlay":0.5,"ocean_post_title_bg_overlay_color":"","ocean_disable_breadcrumbs":"default","ocean_breadcrumbs_color":"","ocean_breadcrumbs_separator_color":"","ocean_breadcrumbs_links_color":"","ocean_breadcrumbs_links_hover_color":"","ocean_display_footer_widgets":"default","ocean_display_footer_bottom":"default","ocean_custom_footer_template":"","ocean_post_oembed":"","ocean_post_self_hosted_media":"","ocean_post_video_embed":"","ocean_link_format":"","ocean_link_format_target":"self","ocean_quote_format":"","ocean_quote_format_link":"post","ocean_gallery_link_images":"on","ocean_gallery_id":[],"footnotes":""},"categories":[350],"tags":[],"class_list":["post-766","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-theses-2016-fr","entry","has-media"],"_links":{"self":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts\/766","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/comments?post=766"}],"version-history":[{"count":1,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts\/766\/revisions"}],"predecessor-version":[{"id":1667,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/posts\/766\/revisions\/1667"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/media\/765"}],"wp:attachment":[{"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/media?parent=766"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/categories?post=766"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/samovar.telecom-sudparis.eu\/index.php\/wp-json\/wp\/v2\/tags?post=766"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}