SDD

Data Science

The Data Science department brings together four teams with recognized and complementary expertise, covering the modeling, collection, management, analysis and construction of data and knowledge (A&O, Bioinfo, LaHDAK, Rocs), making it possible to explore synergies between expertise related to data, learning and optimization, particularly in connection with the fields of bioinformatics, IoT and data graphs.

Digital traces of all human activities are now available in all fields, data that is often massive, heterogeneous, dynamic and of variable quality (the 4 V’s of Big Data: Volume, Variety, Velocity, Veracity). Their exploitation leads to the definition of a fourth scientific paradigm: the design and validation of hypotheses, theoretical models and algorithms, guided by the data and in interaction with domain experts. The Data Science department is interested in robustly addressing the challenges of the 4Vs, in terms of scaling up in the face of data volume and velocity, and resisting diversity and quality bias. These goals define new computational issues in storage, communication, analysis and processing optimization, data query and enrichment, knowledge discovery, and model learning.

With nearly 40 researchers and teacher-researchers, the Data Science department covers a broad spectrum of fundamental and application-related topics: databases, data mining, semantic web, knowledge representation, algorithms, combinatorics, stochastic and distributed optimization, statistical learning and neural networks, communication networks, simulation. It also has extensive expertise in interdisciplinary research and dialogue with experts in the application domains (particularly in biology, medicine, human and social sciences, and experimental physics), allowing privileged access to data of interest and to the evaluation of models and algorithms.

Coordination

Research Teams

Recent publications

  • Poster de conférence

    Nicolas Férey, Bastien Vincke, Mohamed Anis Ghaoui. Interface tangible modulaire, articulée et sans marqueur dédiée à la pédagogie et à la recherche en biologie moléculaire. Congrès du Groupe Graphisme et Modélisation Moléculaire, Sep 2021, Lille, France. , Actes du congrès du GGMM 2021, 2021. ⟨hal-04582042⟩

    VENISE

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Leticia Rebollo Couto, Albert Rilliard. Variación pragmática, traducción audiovisual y estrategias conversacionales para el doblaje: léxico coloquial y palabras tabús – Anexos. 2024. ⟨hal-04578522⟩

    STL

    Year of publication

  • Article dans une revue

    Benoît Dabouis, V Boccara, Bernard Yannou. Vers un langage de conception d’UX − Proposition d’un lexique pour représenter l’expérience utilisateur conçue dans des activités de conception. Activités, 2024, 21-1, ⟨10.4000/activites.9374⟩. ⟨hal-04580505⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Communication dans un congrès

    Victor Spitzer, Céline Gicquel, Evgeny Gurevsky, François Sanson. Day-ahead lot-sizing under uncertainty: An application to green hydrogen production. 8th International Symposium on Combinatorial Optimization (ISCO 2024), May 2024, La Laguna, Tenerife, Canary Islands, Spain. ⟨10.1007/978-3-031-60924-4_30⟩. ⟨hal-04580335⟩

    Year of publication

  • Communication dans un congrès

    Jean-François Nominé, Martine Garnier-Rizet. Rapport sur la septième session de la conférence Tralogy I: La disponibilité des ressources. Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l’avenir ?, Mar 2011, Paris, France. 6 p. ⟨hal-02496072⟩

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Nicolas Froeliger, Joseph Mariani, Jean-François Nominé, Wallon Alain, Subra-Itsusutji Caroline. Métiers et technologies de la traduction : quelles convergences pour l’avenir ?. Tralogy I : Métiers et technologies de la traduction : quelles convergences pour l’avenir ?, 2014. ⟨hal-01371210⟩

    Year of publication

  • Ouvrages

    Nicolas Froeliger, Joseph Mariani, Wallon Alain, Meunier Mikaël, Durand-Fleischer Dominique (Dir.). Trouver le sens : où sont nos manques et nos besoins respectifs ? . INIST-CNRS, http://lodel.irevues.inist.fr/tralogy/index.php?id=188, 2015. ⟨hal-01371212⟩

    Year of publication

  • Article dans une revue

    Céline Loot, Gael A Millot, Egill Richard, Eloi Littner, Claire Vit, et al.. Integron cassettes commonly integrate into bacterial genomes via widespread non-classical attG sites. Nature Microbiology, 2024, 9 (1), pp.228-240. ⟨10.1038/s41564-023-01548-y⟩. ⟨pasteur-04384854⟩

    AO

    Year of publication

    Available in free access

  • Communication dans un congrès

    Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, et al.. LongEval: Longitudinal Evaluation of Model Performance at CLEF 2024. Advances In Information Retrieval (ECIR 2024), Mar 2024, Glasgow (Ecosse), United Kingdom. pp.60-66, ⟨10.1007/978-3-031-56072-9_8⟩. ⟨hal-04577466⟩

    STL

    Year of publication

  • Thèse

    Wissal Sahel. Participatory design to support power grid operators in control rooms. Human-Computer Interaction [cs.HC]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG022⟩. ⟨tel-04577446⟩

    Year of publication

    Available in free access

  • Communication dans un congrès

    Luc Lecointre, Sergey Kudriakov, Etienne Studer, Ronan Vicquelin, Christian Tenaud. High-order extension of Roe’s solver for compressible multicomponent real gas flows. 73rd Annual Meeting of the APS Division of Fluid Dynamics APS DFD 2020, Nov 2020, Chicago (virtual), United States. ⟨hal-03168791⟩

    Year of publication

  • Communication dans un congrès

    Luc Lecointre, Sergey Kudriakov, Etienne Studer, Ronan Vicquelin, Christian Tenaud. Numerical tools to study hydrogen flame acceleration. ECCOMAS, Jan 2021, Paris, France. ⟨hal-03168817⟩

    Year of publication

  • Communication dans un congrès

    Jiayi Cai, Pierre-Emmanuel Angeli, Jean-Marc Martinez, Guillaume Damblin, Didier Lucor. Reynolds stress anisotropy tensor predictions using neural networks. Journées Écoulements & Fluides à Saclay, Jun 2023, Saclay, France. ⟨cea-04467629⟩

    Year of publication

  • Proceedings/Recueil des communications

    Nicolas Sabouret. Actes des 28es Journées Francophones sur les Systèmes Multi-Agents : JFSMA 2020. Plate-Forme Intelligence Artificielle, Association Française pour l’Intelligence Artificielle, 2020. ⟨hal-04573992⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Communication dans un congrès

    Gauthier Leclercq, François Lusseyran, Guillaume Dufour, Jean-Maxime Orlac’H. Étude expérimentale et numérique de l’effet thermique d’une décharge à barrière diélectrique Numerical and experimental study of the thermal effect of a dielectric barrier discharge. Journées scientifiques 2024 d’URSI-France, Mar 2024, Paris, France. pp.92-95. ⟨hal-04573776⟩

    DATAFLOT

    Year of publication

    Available in free access

  • Poster de conférence

    Emma Tison, Solène Delsuc, Claudia Krogmeier, Arnaud Prouzeau, Martin Hachet, et al.. Experiencing schizophrenia symptoms through augmented reality: from assessing students needs to prototyping a simulation. SIRS 2024 – Congress of the Schizophrenia International Research Society, Apr 2024, Florence, Italy. . ⟨hal-04574421⟩

    ILDA

    Year of publication

    Available in free access

  • Article dans une revue

    Boya Zhang, Nona Naderi, Rahul Mishra, Douglas Teodoro. Online Health Search Via Multidimensional Information Quality Assessment Based on Deep Language Models: Algorithm Development and Validation. JMIR AI, 2024, 3, pp.e42630. ⟨10.2196/42630⟩. ⟨hal-04574791⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Hossein Rouhizadeh, Irina Nikishina, Anthony Yazdani, Alban Bornet, Boya Zhang, et al.. A Dataset for Evaluating Contextualized Representation of Biomedical Concepts in Language Models. Scientific Data , 2024, 11 (1), pp.455. ⟨10.1038/s41597-024-03317-w⟩. ⟨hal-04574786⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nemo Malhomme, Davide Faranda, Bérengère Podvin, Lionel Mathelin. Latent Dirichlet Allocation: a new machine learning tool to evaluate CMIP6 climate models atmospheric circulation. European Meteorological Society annual meeting, Sep 2022, Bonn, Germany. ⟨hal-04406521⟩

    Year of publication

  • Communication dans un congrès

    Mathurin Videau, Nickolai Knizev, Alessandro Leite, Marc Schoenauer, Olivier Teytaud. Interactive Latent Diffusion Model. GECCO 2023 – Genetic and Evolutionary Computation Conference, ACM SIGEVO, Jul 2023, Lisbon, Portugal. pp.586-596, ⟨10.1145/3583131.3590471⟩. ⟨hal-04570089⟩

    AO

    Year of publication

    Available in free access

  • Communication dans un congrès

    Philippe Rambaud, Adel Taleb, Raphael Fauches, Arpad Rimmel, Joanna Tomasik, et al.. Binary Classification vs. Anomaly Detection on Imbalanced Tabular Medical Datasets. 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), Jul 2023, Las Vegas, France. pp.01-05, ⟨10.1109/CSCE60160.2023.00220⟩. ⟨hal-04567598⟩

    Year of publication

  • Article dans une revue

    Adel Taleb, Philippe Rambaud, Samuel Diop, Raphaël Fauches, Joanna Tomasik, et al.. Spinal Muscular Atrophy Hypotonia Detection Using Computer Vision and Artificial Intelligence. Medicine Archives of Pediatrics & Adolescent – JAMA Pediatrics , 2024, ⟨10.1001/jamapediatrics.2024.0030⟩. ⟨hal-04567605⟩

    Year of publication

  • Proceedings/Recueil des communications

    Brian Ravenet. Actes des 21es Rencontres des Jeunes Chercheurs en Intelligence Artificielle : RJCIA 2023. Plate-Forme Intelligence Artificielle, Association Française pour l’Intelligence Artificielle, 2023. ⟨hal-04565426⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Communication dans un congrès

    Adel Taleb, Philippe Rambaud, Samuel Diop, Awa Bakayoko, Audrey Benezit, et al.. Improve Pose Estimation Model Performance with Unlabeled Data. 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), Jul 2023, Las Vegas, France. pp.1316-1321, ⟨10.1109/CSCE60160.2023.00221⟩. ⟨hal-04566355⟩

    Year of publication

  • Article dans une revue

    Olivier Hudry, Ville Junnila, Antoine Lobstein. On Iiro Honkala’s contributions to identifying codes. Fundamenta Informaticae, In press. ⟨hal-04568130⟩

    GALaC

    Year of publication

  • Communication dans un congrès

    Alessandro Leite, Marc Schoenauer. Memetic Semantic Genetic Programming for Symbolic Regression. 26th EuroGP – Part of EvoStar 2023, Species Society, Apr 2023, Brno, Czech Republic. pp.198-212, ⟨10.1007/978-3-031-29573-7_13⟩. ⟨hal-04563511⟩

    AO

    Year of publication

    Available in free access

  • Article dans une revue

    V Boccara, Laurent van Belleghem, Marc-Eric Bobillier Chaumon, Yvon Haradji, Alexandre Morais, et al.. Introduction au dossier “Représenter l’activité”. Activités, 2024, 21-1, ⟨10.4000/activites.9728⟩. ⟨hal-04563940⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Communication dans un congrès

    Maxime Fily, Guillaume Wisniewski, Séverine Guillaume, Gilles Adda, Alexis Michaud. Establishing degrees of closeness between audio recordings along different dimensions using large-scale cross-lingual models. Findings of the Association for Computational Linguistics: EACL 2024, Association for Computational Linguistics, Mar 2024, St. Julian’s, Malta. ⟨hal-04561819⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Filip Novkoski, Jules Fillette, Chi-tuong Pham, Eric Falcon. Nonlinear dynamics of a hanging string with a freely pivoting attached mass. Physica D: Nonlinear Phenomena, 2024, 463, pp.134164. ⟨10.1016/j.physd.2024.134164⟩. ⟨hal-04560135⟩

    COMET

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Ylène Aboulfath, Dimitri Watel, Marc-Antoine Weisser, Thierry Mautor, Dominique Barth. Maximizing minimum cycle bases intersection. 2024. ⟨hal-04559959⟩

    GALaC

    Year of publication

    Available in free access

  • N°spécial de revue/special issue

    Fatiha Saïs, Emmanuel Adam, Grégory Bonnet, Dominique Longin. PFIA 2023 : Plate-Forme Intelligence Artificielle. Bulletin de l’Association Française pour l’Intelligence Artificielle, 122, 2023, Association Française d’Intelligence Artificielle. ⟨hal-04559065⟩

    LaHDAK

    Year of publication

    Available in free access

  • Communication dans un congrès

    Hugo Boulanger, Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. Using Structured Health Information for Controlled Generation of Clinical Cases in French. The 6th Clinical Natural Language Processing Workshop At NAACL 2024 (ClinicalNLP 2024), Jun 2024, Mexico city, Mexico. ⟨hal-04558890⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès, Communication dans un congrès

    Jérémy Fix, Stéphane Vialle, Remi Hellequin, Claudine Mercier, Patrick Mercier, et al.. Feedback from a data center for education at CentraleSupélec engineering school. 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), May 2022, LYON (Université Lyon 3), France. pp.330-337, ⟨10.1109/IPDPSW55747.2022.00065⟩. ⟨hal-04556247⟩

    ParSys

    Year of publication

  • Rapport

    Agnès Comte, Alexandra Grout, Anne Crance, Maxime Nebule, Romain Cassiaux, et al.. Guide de bonnes pratiques numérique responsable pour les organisations. Direction interministérielle du numérique (DINUM); MiNumEco, mission interministérielle numérique écoresponsable; Ministère de la Transition écologique et de la Cohésion des territoires; Institut du Numérique Responsable (INR); EcoInfo. 2023. ⟨hal-04556114⟩

    AMIArchitectures et modèles pour l'Interaction

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Marion Ficher, Tom Bauer, Anne-Laure Ligozat. A comprehensive review of the end-of-life modeling in LCAs of digital equipment. 2024. ⟨hal-04555155⟩

    STL, STL

    Year of publication

    Available in free access

  • Rapport

    Cécile Cecconi, Françoise Leresche, Martine Voisin, Pierre Choffé, Jean Delahousse, et al.. Documentation du modèle DOREMUS (Version 1.1). Bibliothèque Nationale de France (Paris). 2024. ⟨hal-04554020⟩

    AVIZ

    Year of publication

    Available in free access

  • Communication dans un congrès

    Patrick Pamphile, Isabelle Bournaud, Celine Clavel. Identifier et comprendre les difficultés d’adaptation des primo entrantes à l’université : utilisation d’une méthode mixte quantitative-qualitative avec des méthodes statistiques d’apprentissage automatique. Diversité, Réussite[s] dans l’Enseignement Supérieur (2024), Nantes Université [Nantes Univ], Apr 2024, Nantes, France. ⟨hal-04557134⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Communication dans un congrès

    Isabelle Bournaud, Magali Gallezot, Celine Clavel, Marie-Joëlle Ramage. Étonnements de primo-entrantes à l’université : nature et diversité. Diversité, Réussite[s] dans l’Enseignement Supérieur (2024), Nantes Université [Nantes Univ], Apr 2024, Nantes, France. ⟨hal-04557126⟩

    Year of publication

    Available in free access

  • Article dans une revue

    David Rei, Céline Clavel, Jean-Claude Martin, Brian Ravenet. Adapting goals and motivational messages on smartphones for motivation to walk. Smart Health, 2024, 32, pp.100482. ⟨10.1016/j.smhl.2024.100482⟩. ⟨hal-04556495⟩

  • Communication dans un congrès

    Christian Jacquemin, Georges Gagneré, Benoît Lahoz. Shedding light on shadow : Real-time interactive artworks based on cast shadows or silhouettes. MM ’11: ACM Multimedia Conference, Nov 2011, Scottsdale Arizona USA, United States. pp.173-182, ⟨10.1145/2072298.2072322⟩. ⟨hal-04553628⟩

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nicolas Hiebel, Bertrand Remy, Bruno Guillaume, Olivier Ferret, Aurélie Névéol, et al.. Hostomytho: A GWAP for Synthetic Clinical Texts Evaluation and Annotation. Games and Natural Language Processing Workshop at LREC-COLING 2024, May 2024, Turin, Italy, May 2024, Turin (Italie), Italy. ⟨hal-04555052⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Vida Dujmović, Robert Hickingbotham, Jędrzej Hodor, Gwenaël Joret, Hoang La, et al.. The Grid-Minor Theorem Revisited. SODA 2024 – 2024 Annual ACM-SIAM Symposium on Discrete Algorithms, Jan 2024, Westin Alexandria Old Town, United States. pp.1241-1245, ⟨10.1137/1.9781611977912.48⟩. ⟨hal-04553168⟩

    GALaC

    Year of publication

    Available in free access

  • Article dans une revue

    Marcin Briański, Jędrzej Hodor, Hoang La, Piotr Micek, Katzper Michno. Boolean Dimension of a Boolean Lattice. Order, 2024. ⟨hal-04553148⟩

    GALaC

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Lech Duraj, Ross J. Kang, Hoang La, Jonathan Narboni, Filip Pokrývka, et al.. The $chi$-binding function of $d$-directional segment graphs. 2024. ⟨hal-04553176⟩

    GALaC

    Year of publication

    Available in free access

  • Thèse

    Jorgelindo da Veiga Moreira. Modélisation de la bascule métabolique chez les cellules eucaryotes : application à la production de citrate chez la levure Yarrowia lipolytica. Biotechnologie. Université Paris Saclay (COmUE), 2019. Français. ⟨NNT : 2019SACLX015⟩. ⟨tel-02320659⟩

    BioInfo

    Year of publication

    Available in free access

  • Thèse

    Hugues Mandon. Algorithmes pour la prédiction de stratégies de reprogrammation cellulaire dans les réseaux Booléens.. Algorithme et structure de données [cs.DS]. Université Paris Saclay (COmUE), 2019. Français. ⟨NNT : 2019SACLN060⟩. ⟨tel-02513383v2⟩

    BioInfo

    Year of publication

    Available in free access

  • Thèse

    Oralie Cattan. Systèmes de questions-réponses interactifs à grande échelle. Informatique [cs]. Université Paris-Saclay (2020-..), 2022. Français. ⟨NNT : ⟩. ⟨tel-04551072⟩

    STL

    Year of publication

  • Article dans une revue

    Thibaut Couchoux, Tristan Jaouen, Christelle Melodelima-Gonindard, Pierre Baseilhac, Arthur Branchu, et al.. Performance of a Region of Interest–based Algorithm in Diagnosing International Society of Urological Pathology Grade Group ≥2 Prostate Cancer on the MRI-FIRST Database—CAD-FIRST Study. European Urology Oncology, 2024, S2588-9311 (24), pp.00056-7. ⟨10.1016/j.euo.2024.03.003⟩. ⟨hal-04549487⟩

    COMET

    Year of publication

    Available in free access

  • Thèse

    Camille Gobert. Projecting Computer Languages for a Protean Interaction. Human-Computer Interaction [cs.HC]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG019⟩. ⟨tel-04551620⟩

    EX-SITU

    Year of publication

    Available in free access

  • Communication dans un congrès

    Bruno Guillaume, Kim Gerdes, Kirian Guiller, Sylvain Kahane, Yixuan Li. Joint Annotation of Morphology and Syntax in Dependency Treebanks. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), May 2024, Turino, Italy. ⟨hal-04550108⟩

    Year of publication

    Available in free access

  • HDR

    Paola Tubaro. Décrypter la société des plateformes : Organisations, marchés et réseaux dans l’économie numérique. Sociology. Institut d’Etudes Politiques de Paris, 2019. ⟨tel-04547405⟩

    AO

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Chris I. Juric, Damir Juric. Fluid Dynamics Simulation on a GPU. 2024. ⟨hal-04545934⟩

    COMET

    Year of publication

    Available in free access

  • Rapport

    Alexander Kempf, Marc Taylor, Bernhard Kühn, Elliot Brown, Vanessa Trijoulet, et al.. SEAwise Report on consistency of existing targets and limits for indicators in an ecosystem context. Institut Agro – Rennes Angers; Technical University of Denmark. 2023, pp.245 P. ⟨hal-04495754⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Thèse

    David Rei. Interactions Humain-Machine Adaptées à la Personnalité des Utilisateurs : Application de Motivation à l’Activité Physique. Informatique. Université Paris-Saclay, 2024. Français. ⟨NNT : 2024UPASG012⟩. ⟨tel-04538519v2⟩

    CPUCognition Perception et Usages

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Pierre Barbault, Matthieu Kowalski, Charles Soussen. LEMUR: Latent EM Unsupervised Regression for Sparse Inverse Problems. 2024. ⟨hal-04542061⟩

    AO

    Year of publication

    Available in free access

  • Article dans une revue

    Sibo Cheng, César Quilodrán-Casas, Said Ouala, Alban Farchi, Che Liu, et al.. Machine learning with data assimilation and uncertainty quantification for dynamical systems: a review. IEEE/CAA Journal of Automatica Sinica, In press, 10 (6), pp.1361-1387. ⟨10.1109/JAS.2023.123537⟩. ⟨hal-04039094⟩

    DATAFLOT

    Year of publication

    Available in free access

  • Poster de conférence

    George Marchment, Marie Schmit, Clémence Sebe, Frédéric Lemoine, Hervé Ménager, et al.. Representing bioinformatics Nextflow workflows in RO-Crate : challenges and opportunities. Semantic Web Applications and Tools for Health Care and Life Sciences, Feb 2024, Leiden, Netherlands. Zenodo, 2024, ⟨10.5281/zenodo.10822156⟩. ⟨hal-04540040⟩

    Year of publication

    Available in free access

  • Article dans une revue

    Luma da Silva Miranda, João Antônio de Moraes, Albert Rilliard. Visual channel facilitates the comprehension of the intonation of Brazilian Portuguese wh-questions and wh-exclamations: evidence from congruent and incongruent stimuli. Language and Cognition, 2024, pp.1-21. ⟨10.1017/langcog.2024.16⟩. ⟨hal-04538371⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536273⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Audibert, Xingyu Liu, et al.. Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains. LREC-COLING 2024 – Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Turin, Italy. ⟨hal-04535557⟩

    Year of publication

    Available in free access

  • Communication dans un congrès

    Djegdjiga Amazouz, Martine-Adda Decker, Lori Lamel. Variation du voisement des occlusives orales en code-switching: analyses par ABX automatique et mesures acoustiques. Journées d’Études sur la Parole – JEP2022, Jun 2022, Noirmoutier, France. ⟨hal-03703081⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Behnoosh Mohammadzadeh, Jules Françoise, Michèle Gouiffès, Baptiste Caramiaux. Studying Collaborative Interactive Machine Teaching in Image Classification. IUI ’24: 29th International Conference on Intelligent User Interfaces, Mar 2024, Greenville SC USA, United States. pp.195-208, ⟨10.1145/3640543.3645204⟩. ⟨hal-04535375⟩

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Romain Egele, Felix Mohr, Tom Viering, Prasanna Balaprakash. The Unreasonable Effectiveness Of Early Discarding After One Epoch In Neural Network Hyperparameter Optimization. 2024. ⟨hal-04537565⟩

    AO

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536600⟩

    STL

    Year of publication

  • Pré-publication, Document de travail

    Sylvain Chevallier, Igor Carrara, Bruno Aristimunha, Pierre Guetschel, Sara Sedlar, et al.. The largest EEG-based BCI reproducibility study for open science: the MOABB benchmark. 2024. ⟨hal-04537061⟩

    AO, ParSys

    Year of publication

    Available in free access

  • Communication dans un congrès

    Karën Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, et al.. Your Stereotypical Mileage may Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Turin (Italie), Italy. ⟨hal-04537096⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Armita Khajeh Nassiri, Nathalie Pernelle, Fatiha Saïs, Gianluca Quercini. RE-miner for data linking results for OAEI 2020. Ontology Matching Workshop at ISWC 2020, Nov 2020, Athens, France. ⟨hal-04537965⟩

    LaHDAK

    Year of publication

  • Communication dans un congrès

    Paul Lerner, Cyril Grouin. INCLURE: a Dataset and Toolkit for Inclusive French Translation. The 17th Workshop on Building and Using Comparable Corpora (BUCC @ LREC 2024), 2024, Turin, Italy. ⟨hal-04531938⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Karën Fort, Aurélie Névéol. Ethics and NLP: 10 years after. Journée d’études ATALA “éthique et TALTraitement Automatique des langues : 10 ans après”, 2024. ⟨hal-04533870⟩

    STL

    Year of publication

    Available in free access

  • Thèse

    Bin Wang. Rainbow structures in properly edge-colored graphs and hypergraph systems. Combinatorics [math.CO]. Université Paris-Saclay; Shandong University (Jinan, Chine), 2024. English. ⟨NNT : 2024UPASG016⟩. ⟨tel-04534170⟩

    Year of publication

    Available in free access

  • Poster de conférence

    Filippo Gatti, Fanny Lehmann, Hugo Gabrielidis, Michaël Bertin, Didier Clouteau, et al.. Deep learning generative strategies to enhance 3D physics-based seismic wave propagation: from diffusive super-resolution to 3D Fourier Neural Operators.. European Geophysical Union General Assembly 2024, Apr 2024, Vienna, Austria. 2024, ⟨10.5194/egusphere-egu24-2443⟩. ⟨hal-04534286⟩

    ParSys

    Year of publication

  • Communication dans un congrès

    Paul Lerner, Olivier Ferret, Camille Guinaudeau. Cross-modal Retrieval for Knowledge-based Visual Question Answering. 46th European Conference on Information Retrieval (ECIR 2024), 2024, Glasgow, United Kingdom. ⟨hal-04384431⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Hugo Gabrielidis, Filippo Gatti, Stéphane Vialle. Génération conditionnelle et inconditionnelle de signaux sismiques à l’aide de modèles de diffusion.. CSMA 2024 16ème Colloque National en Calcul des Structures, Association Calcul des Structures et Modélisation (CSMA), May 2024, Presqu’île de Giens (Var) Giens (Var), France. ⟨hal-04531795⟩

    ParSys

    Year of publication

    Available in free access

  • Communication dans un congrès

    Tomohiro Nishiyama, Lisa Raithel, Roland Roller, Pierre Zweigenbaum, Eiji Aramaki. Assessing Authenticity and Anonymity of Synthetic User-generated Content in the Medical Domain. Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo), Mar 2024, St. Julian’s, Malta. pp.8-17. ⟨hal-04528240⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Djegdjiga Amazouz, Martine Adda-Decker, Lori Lamel, Jean-Luc Gauvain. EXPLORING CONSONANTAL VARIATION IN FRENCH-ARABIC CODE SWITCHING SPEECH: THE CASE OF GEMINATION. Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019, Sasha Calhoun, Paola Escudero, Marija Tabain & Paul Warren (eds.), Aug 2019, Melbourne, Australia. ⟨hal-04522640⟩

    Year of publication

    Available in free access

  • Thèse

    Alban Petit. Structured prediction methods for semantic parsing. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG002⟩. ⟨tel-04527227⟩

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anne-Flore Cabouat, Tingying He, Florent Cabric, Tobias Isenberg, Petra Isenberg. Position paper: A case to study the relationship between data visualization readability and visualization literacy. CHI 2024 – Workshop Toward a More Comprehensive Understanding of Visualization Literacy, May 2024, O’ahu (Honolulu), United States. ⟨hal-04523790v2⟩

    AVIZ

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nadège Alavoine, Gaëlle Laperriere, Christophe Servan, Sahar Ghannay, Sophie Rosset. New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. ⟨hal-04523286⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier. A Benchmark Evaluation of Clinical Named Entity Recognition in French. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. ⟨hal-04523267⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Alexis Kauffmann, Arnaud Guével, Colin de La Higuera, Cendrine Mercier, David Leray. Participation à la table ronde : Pourquoi soutenir les communs numériques dans l’éducation ?. Journées des Libertés Numériques, Mar 2024, Nantes (France), France. https://mediaserver.univ-nantes.fr/videos/pourquoi-soutenir-les-communs-numeriques-dans-leducation/. ⟨hal-04518341⟩

    Year of publication

  • Communication dans un congrès

    Christophe Servan, Sahar Ghannay, Sophie Rosset. mALBERT: Is a Compact Multilingual BERT Model Still Worth It?. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Torino, Italy. ⟨hal-04520797⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    L. Kahouadji, Laurent Martin Witkowski, J.S. Walker. Effet de la rotation sur les instabilités thermocapillaires dans un pont liquide chauffé latéralement. CFM 2007 – 18ème Congrès Français de Mécanique, Aug 2007, Grenoble, France. ⟨hal-03361758⟩

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Marc Baboulin, Simplice Donfack, Oguz Kaya, Theo Mary, Matthieu Robeyns. Mixed precision randomized low-rank approximation with GPU tensor cores. 2024. ⟨hal-04520893⟩

    ParSys

    Year of publication

    Available in free access

  • Poster de conférence

    Hui Yang, Mostepha Redouane Khouadjia, Nacéra Bennacer Seghouani, Yue Ma, Serge Delmas. Explainable Anomaly Detection for Context Semantic Awareness. Workshop HyCHA (Hybridation Connaissances, Humain et Apprentissage Statistique), Mar 2024, Gif sur Yvette, France. ⟨hal-04521991⟩

    LaHDAK, LaHDAK

    Year of publication

    Available in free access

  • Communication dans un congrès

    Aaron Boussidan, Fanny Ducel, Aurélie Névéol, Karën Fort. What ChatGPT tells us about ourselves. Journée d’étude Éthique et TALTraitement Automatique des langues 2024, Apr 2024, Nancy, France. ⟨hal-04521121⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Pierre Liardet, Pierre Collet, Cyril Fonlupt, Evelyne Lutton, Marc Schoenauer. Artificial Evolution: 6th International Conference, Evolution Artificielle, EA 2003, Marseilles, France, October 27-30, 2003, Revised Selected Papers. 6th International Conference, Evolution Artificielle, EA 2003, 2936, Springer, pp.1-409, 2004, LNCS 2936, 3540215239. ⟨10.1007/b96080⟩. ⟨inria-00000846⟩

    AO

    Year of publication

    Available in free access

  • Article dans une revue

    Anne Sergent, Patrick Le Quéré. Long time evolution of large-scale patterns in a rectangular Rayleigh-Bénard cell. Journal of Physics: Conference Series, 2011, 318 (8), pp.082010. ⟨10.1088/1742-6596/318/8/082010⟩. ⟨hal-04519827⟩

    COMET

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thierry Hamon, Natalia Grabar. Automatic Prediction of Semantic Labels for French Medical Terms. Medical Informatics Europe conference (MIE2022), May 2022, Nice, France. pp.868-869, ⟨10.3233/SHTI220610⟩. ⟨hal-04519905⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan, Sophie Rosset. Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification. LREC-COLING 2024, May 2024, TURIN, Italy. ⟨hal-04519930v2⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Michèle Sebag, Marc Schoenauer, Caroline Ravisé. An induction-based control for genetic algorithms. Evolution Artificielle ’95, 1996, Brest, France. pp.100-119, ⟨10.1007/3-540-59286-5_85⟩. ⟨hal-00116438⟩

    Year of publication

    Available in free access

  • Article dans une revue

    Mohamed Yassine Tsalamlal, Michel-Ange Amorim, Jean-Claude Martin, Mehdi Ammi. Combining Facial Expression and Touch for Perceiving Emotional Valence. IEEE Transactions on Affective Computing, 2018, 9 (4), pp.437-449. ⟨10.1109/TAFFC.2016.2631469⟩. ⟨hal-04518968⟩

    Year of publication

  • Thèse

    Loris Felardos. Data-free Generation of Molecular Configurations with Normalizing Flows. Machine Learning [cs.LG]. Université Grenoble Alpes [2020-..], 2022. English. ⟨NNT : 2022GRALM026⟩. ⟨tel-04010123⟩

    Year of publication

    Available in free access

  • Communication dans un congrès

    Mérième Bouhandi, Emmanuel Morin, Thierry Hamon. Graph Neural Networks for Adapting Off-the-shelf General Domain Language Models to Low-Resource Specialised Domains. 2nd Workshop on Deep Learning on Graphs for Natural Language Processing (DLG4NLP 2022), ACL, Jul 2022, Seattle, Washington, United States. pp.36-42, ⟨10.18653/v1/2022.dlg4nlp-1.5⟩. ⟨hal-04517190⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Maroua Bahri, Silviu Maniu, Albert Bifet. A Sketch-Based Naive Bayes Algorithms for Evolving Data Streams. 2018 IEEE International Conference on Big Data (Big Data), Dec 2018, Seattle, United States. pp.604-613, ⟨10.1109/BigData.2018.8622178⟩. ⟨hal-04507533⟩

    LaHDAK

    Year of publication

    Available in free access

  • Poster de conférence

    Elise Lincker, Léa Pacini, Olivier Pons, Camille Guinaudeau, Jérôme Dupire, et al.. MALIN : MAnuels scoLaires INclusifs : Accessibilité numérique des manuels scolaires. Colloque Handiversité 2023 – L’innovation pour le partage, Apr 2023, Gif-sur-Yvette, France. ⟨hal-04410349⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Gustave Cortal. Automatisation du codage des personnages et de leurs émotions dans les récits de rêves avec des modèles de langue. Revue TALTraitement Automatique des langues : traitement automatique des langues, 2024, 65 (1), pp.11-35. ⟨hal-04512857⟩

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Gustave Cortal. Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives. 2024. ⟨hal-04512803⟩

    Year of publication

    Available in free access

  • Thèse

    Lisa Raithel. Cross-lingual Information Extraction for the Assessment and Prevention of Adverse Drug Reactions. Document and Text Processing. Université Paris-Saclay; Technische Universität (Berlin), 2024. English. ⟨NNT : 2024UPASG011⟩. ⟨tel-04513068⟩

    Year of publication

    Available in free access

  • Article dans une revue

    Angèle Gayet-Ageron, Khaoula Ben Messaoud, Mark Oliver Richards, Cyril Jaksic, Julien Gobeill, et al.. Assessment of gender and geographical bias in the editorial decision-making process of biomedical journals: A Case-Control study.. Medrxiv : the Preprint Server For Health Sciences, 2024, ⟨10.1101/2024.03.15.24304220⟩. ⟨hal-04510221⟩

    STL

    Year of publication

    Available in free access