The Data Science department brings together four teams with recognized and complementary expertise, covering the modeling, collection, management, analysis and construction of data and knowledge (A&O, Bioinfo, LaHDAK, Rocs), making it possible to explore synergies between expertise related to data, learning and optimization, particularly in connection with the fields of bioinformatics, IoT and data graphs.
Digital traces of all human activities are now available in all fields, data that is often massive, heterogeneous, dynamic and of variable quality (the 4 V’s of Big Data: Volume, Variety, Velocity, Veracity). Their exploitation leads to the definition of a fourth scientific paradigm: the design and validation of hypotheses, theoretical models and algorithms, guided by the data and in interaction with domain experts. The Data Science department is interested in robustly addressing the challenges of the 4Vs, in terms of scaling up in the face of data volume and velocity, and resisting diversity and quality bias. These goals define new computational issues in storage, communication, analysis and processing optimization, data query and enrichment, knowledge discovery, and model learning.
With nearly 40 researchers and teacher-researchers, the Data Science department covers a broad spectrum of fundamental and application-related topics: databases, data mining, semantic web, knowledge representation, algorithms, combinatorics, stochastic and distributed optimization, statistical learning and neural networks, communication networks, simulation. It also has extensive expertise in interdisciplinary research and dialogue with experts in the application domains (particularly in biology, medicine, human and social sciences, and experimental physics), allowing privileged access to data of interest and to the evaluation of models and algorithms.
Coordination
Algorithmes, apprentissage et calcul, Sciences des Données
Bang Liu, Jun Yao, Bo Ma, Zhihui Chen, Xiaozhe Zhu, et al.. Metal(loid)s diffusion pathway triggers distinct microbiota responses in key regions of typical karst non-ferrous smelting assembly. Journal of Hazardous Materials, 2022, 423, pp.127164. ⟨10.1016/j.jhazmat.2021.127164⟩. ⟨hal-05426585⟩
Abdulhakeem Abdulazeez. Development of an autonomous control protocol for a UAV fleet network using machine learning. Robotics [cs.RO]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG107⟩. ⟨tel-05426943⟩
Petra Isenberg, Gunther Weber, Niklas Elmqvist, Narges Mahyar. Panel Proposal: IEEE VIS Reviewing -On a Path to Self-Destruction?. 2025. ⟨hal-05424383⟩
Luc Lebon, Paul Boniface, Chi-Tuong Pham, Laurent Limat. Allée de vortex de Bénard-von Kármán confinée : selection de longueur d’onde par les instabilités de kelvin-Helmholtz. 28e Rencontre du Non Linéaire, Mar 2025, Paris, France. ⟨hal-05412081⟩
Shwetha Salimath, Francesca Bugiotti, Sylvain Wlodarczyk. Responsible AI: Training Deep Learning Model Efficiently. 29th European Conference on Advances in Databases and Information Systems (ADBIS 2025), Sep 2025, Tampere, Finland. pp.63-78, ⟨10.1007/978-3-032-05281-0_5⟩. ⟨hal-05422803⟩
Tim Schneider, Charles Ménard-Wendling. Perturbatio — An artistic framework for the visualisation of the cumulative eco-impact of human activities. 12th International Conference on Digital and Interactive Arts: Media Art Cultures, Communities & Territories (ARTECH 2025), Nov 2025, Braga, Portugal. ⟨10.1145/3773699.3773920⟩. ⟨hal-05409627⟩
Pierre Fraigniaud, Minh Hang Nguyen, AmiArchitectures et modèles pour l'Interaction Paz. A Simple Lower Bound for Set Agreement in Dynamic Networks. 2025 Symposium on Simplicity in Algorithms (SOSA), Jan 2025, New Orleans, United States. pp.253-262, ⟨10.1137/1.9781611978315.20⟩. ⟨hal-05403931⟩
Alexandre Combeau, Fatiha Saïs, Nageeta Kumari, Stéphane Dervaux, Cristina Manfredotti, et al.. NutriKG -Un Graphe de Connaissances pour Modéliser les Préférences et les Besoins Nutritionnels. 36es Journées francophones d’Ingénierie des Connaissances, IC 2025, Dijon, France, July 2-4, 2025, Jul 2025, Dijon, France. pp.8-17. ⟨hal-05410354⟩
Fernando A Villanea, David Peede, Eli J Kaufman, Valeria Añorve-Garibay, Elizabeth T Chevy, et al.. The MUC19 gene: An evolutionary history of recurrent introgression and natural selection. Science, 2025, 389 (6762), pp.eadl0882. ⟨10.1126/science.adl0882⟩. ⟨hal-05410793⟩
Pierre Jehel, Stéphane Vialle. Collaborative Platform for Railway Projects – Business Needs Analysis and Their Formalization as Functional Requirements. 2023. ⟨hal-05371720⟩
Luc Pommeret, Sophie Rosset, Christophe Servan, Sahar Ghannay. AtomicEval: Evaluation Framework for Atomic Proposition Autonomy with French Propositioner. 10th Junior Conference on Data Sciences and Engineering, Sep 2025, Gif-sur-Yvette, France. . ⟨hal-05414939⟩
Nina Vittorelli, Cintia Gómez-Muñoz, Irina Andriushchenko, Louis Ollivier, Nicolas Agier, et al.. Repeated losses of self-fertility shaped heterozygosity and polyploidy in yeast evolution. 2025. ⟨hal-05404528⟩
Luc Lebon, Paul Boniface, Chi-Tuong Pham, Laurent Limat. Allée de vortex de Bénard-von Kármán confinée : sélection de longueur d’onde par les instabilités de kelvin-Helmholtz. Rencontre du Non Linéaire 2025 (RNL 2025), Mar 2025, Paris, France. . ⟨hal-05412053⟩
Tim Schneider, Céline Clavel, Gérard Kubryk, Michèle Gouiffès, Emmanuelle Frenoux, et al.. Creating and measuring immersion in open public spaces with Ariadne’s Fibres. IFAC-PapersOnLine, 59 (11), pp.252-257, 2025, Part of special issue : 2nd IFAC Workshop on Control of Complex Systems COSY 2025: Gif-sur-Yvette, France, June 30 – July 02, 2025, ⟨10.1016/j.ifacol.2025.09.557⟩. ⟨hal-05409345v2⟩
Tim Schneider, Gérard Kubryk, Fayçal Bouiddouh, Matthieu Courgeon, Vincent Hulot, et al.. A multi-touch multi-sensor multi-user visual and sound projection sphere to render $the eye of the sun$. 2nd IFAC Workshop on Control of Complex Systems COSY 2025, Jun 2025, Gif-sur Yvette, France. ⟨hal-05409518⟩
Béatrice Albert, Gabriel Poirier-Quinot, Gaële Misiak, Guillaume Junot, Izabela Faguet, et al.. A 360-degree immersive and interactive stage for collective Space-Time experiences. 2nd IFAC Workshop on Control of Complex Systems COSY 2025, Jun 2025, Gif-sur Yvette, France. ⟨hal-05409590⟩
Hao Li, Hao Lin, Guang-Hui Wang, Wen-Ling Zhou. Hypergraphs with a Quarter Uniform Turán Density. Journal of the Operations Research Society of China, 2025, ⟨10.1007/s40305-025-00619-7⟩. ⟨hal-05392096⟩
Nathalie Abadie, Ghislain Atemezing, Grégory Bonnet, Tristan Cazenave, Antoine Cornuéjols, et al.. Conférence Nationale d’Intelligence Artificielle Année 2025. Association Française pour l’Intelligence Artificielle, 2025. ⟨hal-05409313⟩
Tim Schneider, Céline Clavel, Gérard Kubryk, Michèle Gouiffès, Emmanuelle Frenoux, et al.. Triggering immersion in public spaces: A comparative study of interactive digital art installations. EuroXR 2025 – 22nd EuroXR International Conference, Sep 2025, Winterthur, Switzerland. pp.267-291, ⟨10.1007/978-3-032-03805-0_15⟩. ⟨hal-05409323⟩
Béatrice Albert, Emmanuelle Frenoux, Guillaume Junot, Gaële Misiak, Gérard Kubryk, et al.. The L∞p — A 360-degree immersive and interactive stage for collective space-time experiences. 12th International Conference on Digital and Interactive Arts: Media Art Cultures, Communities & Territories (ARTECH 2025), Nov 2025, Braga, Portugal. ⟨10.1145/3773699.3774364⟩. ⟨hal-05409683⟩
Tim Schneider, Gérard Kubryk, Vincent Hulot, Matthieu Courgeon, David Poirier-Quinot, et al.. The Eye of the Sun — A touch- and motion-sensitive, interactive, audio-visual sculpture combining curiosity and solar physics. 12th International Conference on Digital and Interactive Arts: Media Art Cultures, Communities & Territories (ARTECH 2025), Nov 2025, Braga, Portugal. ⟨10.1145/3773699.3773925⟩. ⟨hal-05409657⟩
Salim Khazem, Ludovic Arnould, Hugues Ali Mehenni. BYO-Eval: Build Your Own Dataset for Fine-Grained Visual Assessment of Multimodal Language Models. 2025. ⟨hal-05395052⟩
Romain Mussard, Aurélien Gauffre, Ihsan Ullah, Thanh Gia Hieu Khuong, Massih-Reza Amini, et al.. Stylized Meta-Album: Group-bias injection with style transfer to study robustness against distribution shifts. 2025. ⟨hal-05371736⟩
Marie Thérèse El Fakhry, Virginie Demulier, Gérard Uzan, Karine Gros. Co-conception d’outils numériques : illustration parmi des apprenants en situation de handicap et leurs formateurs. Technologies, Insertion, Handicap, Autonomie, Vieillissement, Jun 2025, Aubervilliers, France. pp.68-73. ⟨hal-05371267⟩
Miriam Bravo-Lopez, Eduardo Arrieta-Donato, Viridiana Villa-Islas, Åshild Joanne Vågene, Ana Villaseñor-Altamirano, et al.. Ancient genomic insights into Salmonella enterica Paratyphi C from Central Mexico. 2025. ⟨hal-05407381⟩
Luigi Marra, Onofrio Semeraro, Lionel Mathelin, Andrea Meilán-Vila, Stefano Discetti. Latent-Space Non-Linear Model Predictive Control for Partially-Observable Systems. 2025. ⟨hal-05394151⟩
Louis Ollivier, Brian Charlesworth, Fanny Pouyet. Beyond recombination: Exploring the impact of meiotic frequency on genome-wide genetic diversity. PLoS Genetics, 2025, 21 (8), pp.e1011798. ⟨10.1371/journal.pgen.1011798⟩. ⟨hal-05405132⟩
T. Dusserre, H. Dole, F. Sarron, G. Castignani, N. Ramos-Chernenko, et al.. Euclid Quick Data Release (Q1). The Euclid view on Planck galaxy protocluster candidates: towards a probe of the highest sites of star formation at cosmic noon. 2025. ⟨hal-05017542⟩
Onofrio Semeraro, Michele A Bucci, Remy Hosseinkhan-Boucher, Sergio Chibbaro, Alexandre Allauzen, et al.. On the use of entropy-based metrics for data-driven modeling and reinforcement learning control. Joint event Euromech Colloquium on Data-Driven Fluid Dynamics/2nd ERCOFTAC Workshop on Machine Learning for Fluid Dynamics, Apr 2025, Londres, United Kingdom. ⟨hal-05379611⟩
Yanis Ouakrim. Vers la traduction automatique de la Langue des Signes Française (LSF). Traitement du signal et de l’image [eess.SP]. Université Grenoble-Alpes (UGA), 2025. Français. ⟨NNT : ⟩. ⟨tel-05400716⟩
Hagit Attiya, Pierre Fraigniaud, AmiArchitectures et modèles pour l'Interaction Paz, Sergio Rajsbaum. On the Existence of Extension-Based Proofs of Impossibility for Set-Agreement. SIROCCO 2025 – Structural Information and Communication Complexity, Jun 2025, Delphi, Greece. pp.56-73, ⟨10.1007/978-3-031-91736-3_4⟩. ⟨hal-05403686⟩
Hagit Attiya, Pierre Fraigniaud, AmiArchitectures et modèles pour l'Interaction Paz, Sergio Rajsbaum. Solvability Characterization for General Three-Process Tasks. PODC ’25: ACM Symposium on Principles of Distributed Computing, Jun 2025, Huatulco, Mexico. pp.488-498, ⟨10.1145/3732772.3733548⟩. ⟨hal-05403672⟩
Onofrio Semeraro, Michele Alessandro Bucci, Lionel Mathelin, Luigi Marra, Amine Saibi. From robotics to fluid dynamics: opportunities and pitfalls of Reinforcement Learning in flow control. iTi Workshop on Structure and control of wall-bounded turbulent flows, Jul 2025, Bertinoro, Italy. ⟨hal-05379587⟩
Andrea Palumbo, Onofrio Semeraro, Luigi de Luca. Transition to turbulence in planar synthetic jets: numerical simulations and coherent structures eduction. Coherent structures and instabilities in transitional and turbulent wall-bounded flows, Euromech Colloquium 658, Sep 2025, Bari, Italy. ⟨hal-05379576⟩
Alice Delbosc, Nicolas Sabouret, Brian Ravenet, Stéphane Ayache, Magalie Ochs. Automatic objective metric for the optimization of nonverbal behavior generative models. IVA ’25: ACM International Conference on Intelligent Virtual Agents, Sep 2025, Berlin, Germany. pp.1-4, ⟨10.1145/3717511.3749302⟩. ⟨hal-05390112⟩
Nicolas Saint-Léger, Nicolas Férey, Joe Raad, Patrick Bourdot. An overview study on the use of Semantics in Immersive Environments. Journal on Multimodal User Interfaces, 2025, 19, pp.305-322. ⟨10.1007/s12193-025-00465-0⟩. ⟨hal-05383687⟩
Mackenzie Hisako Dalto, Charles Perin, Lora Oehlberg, Petra Isenberg, Sheelagh Carpendale, et al.. VisFutures. alt.vis workshop held at IEEE VIS 2023, 2023. ⟨hal-05361293⟩
Nathan Carbonneau, Julien Salort, Anne Sergent. Small coherent structures in rough turbulent convection. 27e Rencontre du Non-Linéaire, Mar 2024, Paris, France. ⟨hal-05349336⟩
Daniele Noto, Alexandre Allauzen, Sergio Chibbaro. An efficient training method to learn a model of turbulence. The European Physical Journal Plus, 2024, 139 (3), pp.298. ⟨10.1140/epjp/s13360-024-05056-8⟩. ⟨hal-05356011⟩
Michael Filhol. AZVD as a Sign Language writing system proxy, and the potential evolution. Proceedings of Grapholinguistics in the 21st century, Oct 2024, Venice, Italy. ⟨hal-05344585⟩
Bran Knowles, Vicki L Hanson, Christoph Becker, Mike Berners-Lee, Andrew A Chien, et al.. Climate Change: What is Computing’s Responsibility?. 2025, pp.1-18. ⟨10.4230/DagMan.11.1.1⟩. ⟨hal-05369257⟩
Radhouan Belgacem El Zrelli, Jessica K Klar, Sylvie Castet, Michel Grégoire, Pierre Courjault-Radé, et al.. Spatial Distribution Patterns, Eco-Environmental Risk Assessment, and Human Health Impacts of Uranium and Thorium in Beach Sediments in the Central Gulf of Gabes (Southern Mediterranean Sea). Sustainability, 2025, 17 (3), pp.1283. ⟨10.3390/su17031283⟩. ⟨hal-05381639⟩
Akash Malhotra, Nacéra Seghouani, Gilbert Badaro, Christophe Blaya. ConMax3D: Frame Selection for 3D Reconstruction Through Concept Maximization. 20th International Conference on Computer Vision Theory and Applications, Feb 2025, Porto, France. pp.598-609, ⟨10.5220/0013258800003912⟩. ⟨hal-05369558⟩
Akash Malhotra, Nacéra Seghouani, Ahmad Saiid, Alaa Almatuwa, Koumudi Ganepola. Rethinking Deblurring Strategies for 3D Reconstruction: Joint Optimization vs. Modular Approaches. 20th International Conference on Computer Vision Theory and Applications, Feb 2025, Porto, France. pp.816-823, ⟨10.5220/0013378800003912⟩. ⟨hal-05369551⟩
Nathan Carbonneau, Julien Salort, Yann Fraigneau, Anne Sergent. Effet de la suppression du vent sur la convection turbulente de Rayleigh-Bénard. 28e Rencontre du Non-Linéaire, Mar 2025, Paris, France. ⟨hal-05349347⟩
Nicolas Atienza, Johanne Cohen, Christophe Labreuche, Michèle Sébag. Provably safeguarding a classifier from OOD and adversarial samples. ICLR 2025 : International Conference on Representation Learning, Apr 2025, Singapore, Singapore. ⟨hal-05361525⟩
Quentin Le Tellier, Marc Evrard, Albert Rilliard, Jean-Sylvain Liénard. Impact de la parole expressive sur l’estimation de l’intensité vocale. CFA 2025 – 17e Congrès Français d’Acoustique, Société Française d’Acoustique (SFA), Apr 2025, Paris, France. ⟨hal-05365670⟩
Jingyi Sun, Nicolas Audibert, Yaru Wu, Martine Adda-Decker. Effets du Style de Parole et de la Durée sur le Ton Neutre en Mandarin. 17e Congrès Français d’Acoustique (CFA 2025), Apr 2025, Paris, France. , 2025. ⟨hal-05365563⟩
Jean-Sylvain Liénard, Albert Rilliard, Marc Evrard, Quentin Le Tellier. Variabilité du signal de parole en fonction de la Force de Voix en situation d’interaction orale. CFA 2025 – 17e Congrès Français d’Acoustique, Société Française d’Acoustique (SFA), Apr 2025, Paris, France. ⟨hal-05366097⟩
Catherine Weisman, Yann Fraigneau, Diana G. Baltean Carlès. Simulation numérique des effets d’inclinaison sur le pompage de chaleur thermoacoustique en cavité compacte. CFA 2025 – 17e Congrès Français d’Acoustique, Société Française d’Acoustique (SFA), Apr 2025, Paris, France. ⟨hal-05365998⟩
Nathan Carbonneau, Julien Salort, Yann Fraigneau, Anne Sergent. Effet de la suppression du vent sur l’émission de panaches en convection turbulente rugueuse. GdR Navier-Stokes 2.00, Jun 2025, Marseille, France. ⟨hal-05353144⟩
Nicolas Agier, Nina Vittorelli, Louis Ollivier, Frédéric Chaux, Alexandre Gillet-Markowska, et al.. A transient mutational burst occurs during yeast colony development. Molecular Systems Biology, 2025, 21 (9), pp.1214-1236. ⟨10.1038/s44320-025-00117-1⟩. ⟨hal-05389997⟩
Mizuki Akiyama, Christian Sandor, Yuki Igarashi. Pretraining Support for Cheerleading Stunts using Virtual Reality. SIGGRAPH Posters ’25: Special Interest Group on Computer Graphics and Interactive Techniques Conference Posters, Aug 2025, Vancouver, Canada. pp.1-3, ⟨10.1145/3721250.3743007⟩. ⟨hal-05358588⟩
Emil Larose, Franck Kerhervé, Yann Fraigneau, Bérengère Podvin, Chris Morton, et al.. Effects of oncoming flow conditions on the flow upstream of a forward-facing step subject to a thin boundary layer. International Journal of Heat and Fluid Flow, 2025, 115, pp.109866. ⟨10.1016/j.ijheatfluidflow.2025.109866⟩. ⟨hal-05366300⟩
Nathan Carbonneau, Julien Salort, Yann Fraigneau, Anne Sergent. Effect of the wind depletion on the turbulent Rayleigh-Bénard convection. 2nd European Fluid Dynamics Conference (EFDC2), Aug 2025, Dublin, Ireland. ⟨hal-05349321⟩
Amine Saibi, Lionel Mathelin, Onofrio Semeraro. A Multistep Reinforcement Learning Control of Shear Flows in Minimal Input–Output Plants Under Large Time-delays. Flow, Turbulence and Combustion, 2025, 115 (3), pp.1379-1402. ⟨10.1007/s10494-025-00697-w⟩. ⟨hal-05379450⟩
Hugo de Oliveira. Reinforcement Learning and Federated Learning-based Multi-Band Assignment for IoT Short Packet Communications. Networking and Internet Architecture [cs.NI]. Université Paris-Saclay; The Graduate University for Advanced Studies (Hayama (Japon) ; 1988-..), 2025. English. ⟨NNT : 2025UPASG046⟩. ⟨tel-05351603⟩
Fabrizio Nunnari, Cristina Luna Jiménez, Rosalee Wolfe, John Mcdonald, Michael Filhol, et al.. 9th Workshop on Sign Language Translation and Avatar Technologies (SLTAT 2025). 9th workshop on Sign Language Translation and Avatar Technologies (SLTAT), Sep 2025, Berlin, Germany. ⟨10.1145/3742886.3759656⟩. ⟨hal-05344671⟩
Emmanuella Martinod, Michael Filhol. Exclamation in French Sign Language through the AZee approach. Workshop Exclamatives in Sign Languages, Structures Formelles du Langage (SFL, CNRS and Paris 8 University); Laboratoire de Linguistique Formelle (LLF, CNRS and Paris Cité University); IRN Typologie à travers les modalités (CNRS), Sep 2025, Paris, France. ⟨hal-05344608⟩
Jennifer Hamet Bagnou. Différences interindividuelles sous-jacentes aux compétences sociales lors d’interaction avec autrui. Psychologie. Université Paris Saclay, 2025. Français. ⟨NNT : 2025UPASW005⟩. ⟨tel-05348791⟩
Albert Rilliard, João Antônio De Moraes, Donna Erickson, Marine Guerry, Angelika Hönemann, et al.. Cross-cultural dimensions organizing prosodic attitudes reception. Journal of Speech Sciences, 2025, 14, pp.e025012. ⟨10.20396/joss.v14i00.20379⟩. ⟨hal-05359361⟩
Thibault Fabacher, Erik-Andre Sauleau, Emmanuelle Arcay, Bineta Faye, Maxime Alter, et al.. Efficient extraction of medication information from clinical notes: an evaluation in 2 languages. Journal of the American Medical Informatics Association, 2025, pp.ocaf113. ⟨10.1093/jamia/ocaf113⟩. ⟨hal-05375038⟩
Jean-Baptiste Ly. Une approche fondée sur les déterminants de l’activité humaine pour la simulation multi-agent de l’activité des ménages en lien avec la consommation énergétique. Intelligence artificielle [cs.AI]. Université Paris-Saclay, 2025. Français. ⟨NNT : 2025UPASG058⟩. ⟨tel-05360764⟩
Eliott Pradeleix, Rémy Hosseinkhan-Boucher, Alena Shilova, Onofrio Semeraro, Lionel Mathelin. Learning Meets Differential Equations: From Theory to Applications Learning non-Markovian Dynamical Systems with Signature-based Encoders. ML-DE 2025 – 2nd Workshop on “Machine Learning Meets Differential Equations: From Theory to Applications”,, Oct 2025, Bologna, Italy. pp.1-25. ⟨hal-05379481⟩
David Doukhan, Anissa-Claire Adgharouamane, Marlène Coulomb-Gully, Simon Devauchelle, Benjamin Elie, et al.. Voyage dans le temps : des archives télévision et radio pour observer l’évolution des voix. Culture et recherche, 2025, 149, pp.104-107. ⟨hal-05373155⟩
Fanny Pouyet, Ferdinand Petit, Jérémy Guez, Léo Planche, Evelyne Heyer, et al.. Methods for inferring coalescent tree topologies from genomic data: a comparison based on the transmission of reproductive success. 2025. ⟨hal-05347078⟩
Ambre Assor, Michael Mcguffin, Arnaud Prouzeau, Pierre Dragicevic, Martin Hachet. Animated Transitions for Abstract and Concrete Immersive Visualizations: A Design Space and Experiment. VRST 2025 – 31st ACM Symposium on Virtual Reality Software and Technology, Nov 2025, Montreal, Canada. ⟨10.1145/3756884.3765974⟩. ⟨hal-05363857⟩
Elsa Denakpo, Nicolas Dias, Johann Pitout, Thierry Naas, Dylan Pillai, et al.. Imputing missing minimum inhibitory concentration (MIC) values for Pseudomonas aeruginosa strains with a Denoising AutoEncoder. 2025. ⟨hal-05361361⟩
Lyes Kahouadji, Mosayeb Shams, Debashis Panda, Abdullah M. Abdal, Seungwon Shin, et al.. The crown: Rolling splash. Physical Review Fluids, 2025, 10, pp.110511. ⟨10.1103/t2zw-4577⟩. ⟨hal-05375558⟩