STL

Human Language Science and Technology

Coordination: Aurélie NEVEOL

The Department of Language Sciences and Technologies studies fundamental questions relating to linguistic systems by exploiting large corpora collected, annotated and enriched in an unsupervised or semi-supervised way by statistical learning models adapted to the linguistic material.

These models make it possible to study how languages function, their variations (phonetic-phonological, morphological-lexical, syntactic and semantic), both synchronic and diachronic, diaphasic and diatopic, and to raise questions about their acquisition as mother tongues or second languages. Finally, the department is developing major applications in language processing: speech recognition, automatic translation, information retrieval, conversational agents, etc. … which are increasingly important for society (safeguarding endangered languages, providing tools for people with disabilities, helping to process information and medical knowledge) and for ethics.

This approach to language and languages covers a broad spectrum, from the most fundamental to the most applied research, in a wide variety of media (newspapers, social media, video, telephone, . . .) and all modalities (written, spoken and signed).

This research is highly multidisciplinary, bringing together diverse communities from the fields of computer science, engineering and the humanities.

Recent Publications

Communication dans un congrès

Ana Manzano Rodríguez, Camille Guinaudeau, Shin Ichi Satoh. Uncovering Gender Biases in Gender Identification Models for Japanese Data Analysis. Workshop on Demographic Diversity in Computer Vision @ CVPR 2025, Jun 2025, Nashville (Tennessee), United States. ⟨hal-05154054⟩

STL

Year of publication 2025

Available in free access

HAL publication
Thèse

Jiahui Hu. Granular Insights into Financial Discourse : Fine-Grained Opinion Analysis of Expert Texts. Document and Text Processing. Université Paris-Saclay, 2023. English. ⟨NNT : 2023UPASG110⟩. ⟨tel-05153905⟩

AO, STL

Year of publication 2023

Available in free access

HAL publication
Article dans une revue

Philippe Boula de Mareüil, Marc Evrard, Alexandre François, Antonio Romano. Computer modelling of innovations relative to Latin in contemporary Romance dialects. Isogloss. Open Journal of Romance Linguistics, 2025, 11 (3), pp.1 – 31. ⟨10.5565/rev/isogloss.423⟩. ⟨hal-05144863⟩

STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Anne Baillot, Anne-Laure Ligozat. Introduction. Sobriété numérique. Humanités numériques, 2025, 11, ⟨10.4000/1498x⟩. ⟨hal-05143071⟩

STL

Year of publication 2025

HAL publication
Communication dans un congrès

Pierre Lepagnol, Sahar Ghannay, Thomas Gerald, Christophe Servan, Sophie Rosset. Leveraging Information Retrieval to Enhance Spoken Language Understanding Prompts in Few-Shot Learning. Interpseech 2025, Aug 2025, Rotterdam, Netherlands. ⟨hal-05095796⟩

STL, STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Agata Savary. NLP-based Study of Universals of Linguistic Idiosyncrasy. Dagstuhl Reports, 2023, 13 (5), pp.64-67. ⟨hal-04323075⟩

ILES, STL

Year of publication 2023

HAL publication
Thèse

Mathieu Laï-King. Qualité des articles de recherche et modèles de langue neuronaux : applications au domaine biomédical. Intelligence artificielle [cs.AI]. Université Paris-Saclay, 2025. Français. ⟨NNT : 2025UPASG031⟩. ⟨tel-05079724⟩

STL

Year of publication 2025

Available in free access

HAL publication
Pré-publication, Document de travail

Clément Morand, Anne-Laure Ligozat, Aurélie Névéol. Characterizing Goals and Impacts of Digitalization: The Case of Promises in French Healthcare Policies. 2025. ⟨hal-05066176⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Luc Mottin, Julien Gobeill, Jeevanthi Liyana Pathirana, Nona Naderi, Anaïs Mottaz, et al.. Manuscript Classification to Support the Analysis of Biases in Publication Opportunities. The 35th Medical Informatics Europe Conference, May 2025, Glagow, United Kingdom. ⟨10.3233/SHTI250475⟩. ⟨hal-05070636⟩

STL

Year of publication 2025

HAL publication
Rapport

Karin Dassas, Cyrille Bonamy, Bruno Bzeznik, Romaric David, Emmanuelle Frenoux, et al.. Estimer l’impact carbone des activités numériques de l’Observatoire de Paris. EcoInfo. 2025, pp.1-47. ⟨hal-05068666⟩

STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. Clinical text generation: Are we there yet?. Annual Review of Biomedical Data Science, 2025, 8, ⟨10.1146/annurev-biodatasci-103123-095202⟩. ⟨hal-05055957⟩

STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Arezoo Saedi, Afsaneh Fatemi, Mohammad Ali Nematbakhsh, Sophie Rosset, Anne Vilnat. Entity search based on consumer preferences leveraging user reviews. Expert Systems with Applications, 2025, 275, pp.126990. ⟨10.1016/j.eswa.2025.126990⟩. ⟨hal-05047109⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Foucauld Estignard, Sahar Ghannay, Julien Girard-Satabin, Nicolas Hiebel, Aurélie Névéol. Evaluating the Confidentiality of Synthetic Clinical Texts Generated by Language Models. 23rd International Conference on Artificial Intelligence in Medicine (AIME), Jun 2025, Pavie, Italy. ⟨hal-05046326v2⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Lisa Raithel, Philippe Thomas, Bhuvanesh Verma, Roland Roller, Hui-Syuan Yeh, et al.. Overview of #SMM4H 2024 – Task 2: Cross-Lingual Few-Shot Relation Extraction for Pharmacovigilance in French, German, and Japanese. The 9th Social Media Mining for Health Research and Applications (SMM4H 2024) Workshop and Shared Tasks, Association for Computational Linguistics, Aug 2024, Bangkok, Thailand. pp.170-182. ⟨hal-04781015⟩

STL

Year of publication 2024

Available in free access

HAL publication
Pré-publication, Document de travail

Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View. 2025. ⟨hal-04992084⟩

STL

Year of publication 2025

Available in free access

HAL publication
Chapitre d'ouvrage

Mathieu Constant, Marie Candito, Yannick Parmentier, Carlos Ramisch, Agata Savary. Construction, exploitation et exploration de ressources linguistiques pour le traitement automatique des expressions polylexicales en français : le projet PARSEME-FR. Lidia Becker; Julia Kuhn; Christina Ossenkop; Claudia Polzin-Haumann; Elton Prifti. Digitale romanistische Sprachwissenschaft: Stand und Perspektiven, Narr Francke Attempto Verlag GmbH + Co. KG, pp.219-250, 2023, Romanistisches Kolloquium, 978-3-8233-8506-6. ⟨hal-04995189⟩

ILES, STL

Year of publication 2023

HAL publication
Thèse

Rémi Uro. Détection et caractérisation des interruptions dans les interactions orales pour la description du comportement des femmes et des hommes dans les contenus audiovisuels. Informatique et langage [cs.CL]. Université Paris-Saclay, 2024. Français. ⟨NNT : 2024UPASG055⟩. ⟨tel-04994439⟩

STL, STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Amel Fraisse, Patrick Paroubek, Ramit Goyal, Nassreddine Znaidi. Measuring Multilingualism in Online Public Access Catalogs. The ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), Dec 2024, Hong Kong, China. ⟨10.1145/3677389.3702544⟩. ⟨hal-04986773⟩

ILES, STL

Year of publication 2024

HAL publication
Communication dans un congrès

Manon Scholivet, Agata Savary, Louis Estève, Marie Candito, Carlos Ramisch. SELEXINI – a large and diverse automatically parsed corpus of French. Building and Using Comparable Corpora (BUCC), Jan 2025, Abu DHABI, United Arab Emirates. ⟨hal-04978746⟩

ILES, STL

Year of publication 2025

Available in free access

HAL publication
Thèse

Hui-Syuan Yeh. Prompt-based Relation Extraction for Pharmacovigilance. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG097⟩. ⟨tel-04968043⟩

STL, STL

Year of publication 2024

Available in free access

HAL publication
Rapport

Sylvain Bouveret, Aurélie Bugeau, Frenoux Emmanuelle, Julien Lefevre, Laurent Lefèvre, et al.. Quiz sur les impacts environnementaux du numérique. EcoInfo. 2025, pp.1-5. ⟨hal-04960328v2⟩

STL

Year of publication 2025

Available in free access

HAL publication
Thèse

Camille Challant. Représentation formelle avec AZee et contraintes grammaticales pour la langue des signes française. Théorie et langage formel [cs.FL]. Université Paris-Saclay, 2024. Français. ⟨NNT : 2024UPASG086⟩. ⟨tel-04957486⟩

STL, STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Zheng Zhang, Brian Denton, Xiaolan Xie. Branch and Price for Chance-Constrained Bin Packing. INFORMS Journal on Computing, 2020, 32 (3), pp.547-564. ⟨10.1287/ijoc.2019.0894⟩. ⟨hal-04941861⟩

ILES, STL

Year of publication 2020

HAL publication
Communication dans un congrès

Simon Devauchelle, David Doukhan, Lucas Ondel Yang, Benjamin Élie, Albert Rilliard. Estimation automatique de caractéristiques acoustiques pour l’étude diachronique du français oral dans les médias. Atelier DAHLIA: DigitAl Humanities and cuLtural herItAge: data and knowledge management and analysis, Claudia Marinica; Fabrice Guillet; Florent Laroche, Jan 2025, Strasbourg, France. ⟨hal-04938377⟩

STL, STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Rémi Uro, David Doukhan. Pendant le confinement, le temps de parole des femmes a baissé à la télévision et à la radio. La revue des médias, 2020. ⟨hal-04906221⟩

STL, TLP

Year of publication 2020

HAL publication
Communication dans un congrès

Fanny Ducel, Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. “Women do not have heart attacks!” Gender Biases in Automatically Generated Clinical Cases in French. NAACL 2025 – Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, Apr 2025, Albuquerque, United States. ⟨hal-04938811⟩

STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. RNA-TorsionBERT: leveraging language models for RNA 3D torsion angles prediction. Bioinformatics, 2025, 41 (1), pp.btaf004. ⟨10.1093/bioinformatics/btaf004⟩. ⟨hal-04911519⟩

STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Marion Ficher, Tom Bauer, Anne-Laure Ligozat. A comprehensive review of the end-of-life modeling in LCAs of digital equipment. International Journal of Life Cycle Assessment, 2024, 30 (1), pp.20-42. ⟨10.1007/s11367-024-02367-x⟩. ⟨hal-04924691⟩

STL

Year of publication 2024

Available in free access

HAL publication
Thèse

Atilla Kaan Alkan. Natural Language Processing for Analyzing Messages of Astrophysical Observations. Artificial Intelligence [cs.AI]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG114⟩. ⟨tel-04928511⟩

STL

Year of publication 2024

Available in free access

HAL publication
Pré-publication, Document de travail

Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. Has AlphaFold3 achieved success for RNAs?. 2025. ⟨hal-04911522⟩

STL

Year of publication 2025

Available in free access

HAL publication
Thèse

Léa-Marie Lam-Yee-Mui. Modélisations pour la reconnaissance de la parole à données contraintes. Traitement du signal et de l’image [eess.SP]. Université Paris-Saclay, 2024. Français. ⟨NNT : 2024UPASG075⟩. ⟨tel-04918814⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. Has AlphaFold 3 achieved success for RNA?. Acta crystallographica Section D : Structural biology [1993-..], 2025, 81 (2), pp.49–62. ⟨10.1107/S2059798325000592⟩. ⟨hal-04919467⟩

STL

Year of publication 2025

HAL publication
Chapitre d'ouvrage

Philippe Boula de Mareüil, Plínio A. Barbosa. Picos melódicos pretônicos em final de enunciado no português brasileiro: um estudo quantitativo. Dermeval da Hora; Ángela Helmer. Interseções Linguísticas: Estudos Diversos, Líquido Editorial, pp.71-85, 2023, ALFAL, 9786599924804. ⟨hal-04893646⟩

STL

Year of publication 2023

Available in free access

HAL publication
Pré-publication, Document de travail

Douglas Teodoro, Nona Naderi, Anthony Yazdani, Boya Zhang, Alban Bornet. A Scoping Review of Artificial Intelligence Applications in Clinical Trial Risk Assessment. 2025. ⟨hal-04913991⟩

STL

Year of publication 2025

HAL publication
Pré-publication, Document de travail

Omar Adjali, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne. Entity-aware cross-modal pretraining for Knowledge-Based Visual Question Answering. 2024. ⟨cea-04910767⟩

STL

Year of publication 2024

Available in free access

HAL publication
Thèse

Paritosh Sharma. Sign Language synthesis by a decreasing granularity system from AZee. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG092⟩. ⟨tel-04908078⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Laetitia Biscarrat, David Doukhan, Cyril Grouin. De Loft Story aux Marseillais à Dubaï : apport des méthodes d’analyse automatique pour la description des évolutions du dispositif télévisuel. Colloque ”La téléréalité, entre média, événement et société”, part of 89e Congrès de l’Association canadienne-française pour l’avancement des sciences (ACFAS), Association canadienne-française pour l’avancement des sciences (ACFAS), 2022, Montreal, Canada. ⟨hal-04906923⟩

STL

Year of publication 2022

HAL publication
Communication dans un congrès

Laetitia Biscarrat, David Doukhan, Cyril Grouin. De Loft Story aux Marseillais à Dubaï : 20 ans de télé-réalité, 20 ans de sexisme ? Apport des méthodes d’analyse automatique pour une approche comparative. Première journée d’études de l’Arcom, ARCOM, Nov 2022, Paris, France. ⟨hal-04905959⟩

STL, STL

Year of publication 2022

Available in free access

HAL publication
Communication dans un congrès

Rémi Uro, Marie Tahon, David Doukhan, Albert Rilliard. Comprendre les phénomènes permettant la gestion des tours de parole dans les contenus de médias audiovisuels. Journée commune AFIA-TLH / AFCP – “Extraction de connaissances interprétables pour l’étude de la communication parlée”, Corinne Fredouille; Maëva Garnier; Olivier Perrotin; Marie Tahon, Dec 2023, Avignon, France. ⟨hal-04906679⟩

STL, TLP

Year of publication 2023

HAL publication
Autre publication scientifique

Louis Estève, Kaja Dobrovoljc. A new pipeline for measuring diversity across various linguistic levels. 2025. ⟨hal-04886792⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Leticia Rebollo Couto, Albert Rilliard. Variação Pragmática e Diminutivização: intensificação e atenuação de atos expressivos e diretivos para a dublagem de animação em português, espanhol e francês. IV Colloque International VariaR 2024, Université Paul-Valéry Montpellier 3, Jun 2024, Montpellier, France. pp.43-44, ⟨10.3726/978-3-0351-0740-1⟩. ⟨hal-04874595⟩

STL

Year of publication 2024

Available in free access

HAL publication
Thèse

Sofiya Kobylyanskaya. Towards multimodal assessment of L2 level : speech and eye tracking features in a cross-cultural setting. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG111⟩. ⟨tel-04900961⟩

STL

Year of publication 2024

Available in free access

HAL publication
Poster de conférence

Leticia Rebollo Couto, Albert Rilliard. Variación pragmática y expresividad negativa: análisis multimodal en datos de doblaje. LingCor2024: Workshop on Spoken Corpus Linguistics, Jul 2024, Vienna, Austria. . ⟨hal-04874470⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Clémentine Bleuze, Fanny Ducel, Karën Fort, Maxime Amblard. Vers la création d’une super-intelligence » : un corpus pour étudier les revendications des articles de TALTraitement Automatique des langues. Journées de lancement LIFT 2, Nov 2024, Orléans, France. ⟨hal-04880335⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Ayoub Hammal, Benno Uthayasooriyar, Caio Corro. Few-Shot Domain Adaptation for Named-Entity Recognition via Joint Constrained k-Means and Subspace Selection. COLING 2025 – 31st International Conference on Computational Linguistics, Jan 2025, Abu Dhabi, United Arab Emirates. pp.1-15. ⟨hal-04877776⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Simon Devauchelle, Albert Rilliard, David Doukhan, Lucas Ondel Yang. Describing voice in French media archives: age and gender effects on pitch and articulation characteristics. XX Convegno Nazionale AISV, LFSAG (Laboratorio di Fonetica Sperimentale “Arturo Genre”) Dipartimento di Lingue e Letterature Straniere e Culture Moderne Università degli Studi di Torino, Feb 2024, Turin, Italy. ⟨hal-04874662⟩

STL

Year of publication 2024

HAL publication
Communication dans un congrès

Donna Erickson, João Antônio De Moraes, Albert Rilliard. Dimensões das atitudes prosódicas entre culturas. V Seminário Internacional de Fonologia, Universidade Federal do Rio de Janeiro, Nov 2024, Rio de Janeiro, Brazil. ⟨hal-04874627⟩

STL

Year of publication 2024

HAL publication
Communication dans un congrès

Khanh-An C Quan, Camille Guinaudeau, Shin’Ichi Satoh. Evaluating VQA Models’ Consistency in the Scientific Domain. Multimedia Modelling 2025, Jan 2025, Nara, Japan. ⟨hal-04860239⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Saumya Yadav, Elise Lincker, Caroline Huron, Stéphanie Martin, Camille Guinaudeau, et al.. Towards Inclusive Education: Multimodal Classification of Textbook Images for Accessibility. Multimedia Modelling 2025, Jan 2025, Nara, Japan. ⟨hal-04860245⟩

STL

Year of publication 2025

Available in free access

HAL publication
Communication dans un congrès

Delphine Bernhard, Myriam Bras, Anne-Laure Ligozat, Aleksandra Miletic, Jean Sibille, et al.. L’avenir numérique des langues minoritaires : bilan du projet RESTAURE pour l’alsacien, l’occitan et le picard. Colloque « Langues minoritaires » : quels acteurs pour quel avenir ?, Groupe d’Etudes sur le Plurilinguisme européen (EA1339 LiLPa), Nov 2019, Strasbourg, France. ⟨hal-04864670⟩

ILES, ILES, STL

Year of publication 2019

HAL publication
Article dans une revue

Cyril Grouin, Natalia Grabar. Year 2023 in Biomedical Natural Language Processing: A Tribute to Large Language Models and Generative AI. IMIA Yearbook of Medical Informatics, 2024, pp.241-248. ⟨10.1055/s-0044-1800751⟩. ⟨hal-04865083⟩

STL, STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Natalia Grabar, Thierry Hamon. Study of the propaganda techniques occurring in Russian newspaper titles in 2022. METAPOL, université de Liège, Nov 2024, Liège, Belgium. ⟨hal-04865074⟩

STL

Year of publication 2024

HAL publication
Article dans une revue

Angèle Gayet-Ageron, Khaoula Ben Messaoud, Mark Richards, Cyril Jaksic, Julien Gobeill, et al.. Gender and geographical bias in the editorial decision-making process of biomedical journals: a case-control study. BMJ Evidence-Based Medicine, 2024, pp.bmjebm-2024-113083. ⟨10.1136/bmjebm-2024-113083⟩. ⟨hal-04865134⟩

STL

Year of publication 2024

HAL publication
Communication dans un congrès

Omar Adjali, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne. Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering. The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Nov 2024, Miami, United States. pp.16499-16513, ⟨10.18653/v1/2024.emnlp-main.922⟩. ⟨hal-04852275⟩

STL

Year of publication 2024

Available in free access

HAL publication
Pré-publication, Document de travail

Aurélie Bugeau, Anne-Laure Ligozat. L’informatique en temps de crises environnementales : comment adapter la recherche et l’enseignement ?. 2024. ⟨hal-04850517⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Donna Erickson, Albert Rilliard, Ela Thurgood, João Antônio de Moraes, Takaaki Shochi. Acoustic and perceptual profiles of american english social affective expressions. Journal of Speech Sciences, 2024, 13, pp.e024004. ⟨10.20396/joss.v13i00.20015⟩. ⟨hal-04850040⟩

STL

Year of publication 2024

Available in free access

HAL publication
Pré-publication, Document de travail

Clément Morand, Anne-Laure Ligozat, Aurélie Névéol. Does Efficiency Lead to Green Machine Learning Model Training? Analyzing Historical Trends in Impacts from Hardware, Algorithmic and Carbon Optimizations. 2025. ⟨hal-04839926v4⟩

STL

Year of publication 2025

Available in free access

HAL publication
Article dans une revue

Lucie Gianola. Traitement automatique des langues et linguistique de corpus pour la reconnaissance d’entités en analyse criminelle. Revue internationale de criminologie et de police technique et scientifique, 2021, LXXIV (3), pp.363-382. ⟨hal-04833123⟩

ILES, ILES, STL

Year of publication 2021

Available in free access

HAL publication
Poster de conférence

Mathilde Aguiar, Ying Lai, Pierre Zweigenbaum, Nona Naderi. Constituting a dataset for applying Natural Language Inference to Chinese Clinical Trials: possible approaches and challenges. Junior Conference on Data Sciences and Engineering, Sep 2024, Gif-sur-Yvette, France. ⟨hal-04837721⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Hansjörg Mixdorff, Albert Rilliard, Navneet Nayan. Perceptual Evaluation of Attitudinal Expressions. 5th International Symposium on Applied Phonetics (ISAPh 2024), Pärtel Lippus, Sep 2024, Tartu, Estonia. pp.60-64, ⟨10.21437/ISAPh.2024-12⟩. ⟨hal-04823812⟩

STL

Year of publication 2024

Available in free access

HAL publication
Pré-publication, Document de travail

Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, et al.. What Can Natural Language Processing Do for Peer Review?. 2024. ⟨hal-04797652⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Fanny Ducel, Aurélie Névéol, Karën Fort. “You’ll be a nurse, my son!” Automatically Assessing Gender Biases in Autoregressive Language Models in French and Italian. Language Resources and Evaluation, 2024, ⟨10.1007/s10579-024-09780-6⟩. ⟨hal-04803403⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Lisa Raithel, Hui-Syuan Yeh, Shuntaro Yada, Cyril Grouin, Thomas Lavergne, et al.. A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Turin, Italy. pp.395-414. ⟨hal-04779777⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Dongfang Xu, Guillermo Lopez-Garcia, Lisa Raithel, Roland Roller, Philippe Thomas, et al.. Overview of the 9th Social Media Mining for Health Applications (#SMM4H) Shared Tasks at ACL 2024 – Large Language Models and Generalizability for Social Media NLP. The 9th Social Media Mining for Health Research and Applications (SMM4H 2024) Workshop and Shared Tasks, Association for Computational Linguistics, Aug 2024, Bangkok, Thailand. pp.183-195. ⟨hal-04781745⟩

STL

Year of publication 2024

Available in free access

HAL publication
Proceedings/Recueil des communications

Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp. The 17th Workshop on Building and Using Comparable Corpora (BUCC) @LREC-COLING-2024. Workshop Proceedings. 17th Workshop on Building and Using Comparable Corpora (BUCC), 2024, 978-2-493814-31-9. ⟨hal-04779272⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Atilla Kaan Alkan, Felix Grezes, Cyril Grouin, Fabian Schüssler, Pierre Zweigenbaum. Enriching a Time-Domain Astrophysics Corpus with Named Entity, Coreference, and Astrophysical Relationship Annotations. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Apr 2024, Turin, Italy. pp.6177-6188. ⟨hal-04780619⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Virgile Barthet, Marie José Aroulanda, Laura Monceaux-Cachard, Christine Jacquin, Cyril Grouin, et al.. Équilibrer qualité et quantité : comparaison de stratégies d’annotation pour la reconnaissance d’entités nommées en cardiologie. Journée Santé et IA 2024, AFIA; L3I; La Rochelle Université, Jul 2024, La Rochelle, France. ⟨hal-04780743⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Clément Morand, Olivier Ridoux. CRI : A Competent Reader Imitator for detecting binomial names in an historical corpus. Lingvisticae investigationes : International Journal of Linguistics and Language, 2024, 47 (1), pp.30-67. ⟨10.1075/li.00107.mor⟩. ⟨hal-04764787⟩

STL

Year of publication 2024

Available in free access

HAL publication
Mémoire d'étudiant

Clément Morand. Evaluation of the environmental impacts of Natural Language Processing methods. Computer Science [cs]. 2023. ⟨dumas-04758937⟩

STL

Year of publication 2023

Available in free access

HAL publication
Communication dans un congrès

Fanny Ducel, Aurélie Névéol, Karën Fort. Desiderata for Actionable Bias Research. New Perspectives on Bias and Discrimination in Language Technology, Nov 2024, Amsterdam, Netherlands. ⟨hal-04755691⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Jamil Zaghir, Marco Naguib, Mina Bjelogrlic, Aurélie Névéol, Xavier Tannier, et al.. Prompt Engineering Paradigms for Medical Applications: Scoping Review. Journal of Medical Internet Research, 2024, 26, pp.e60501. ⟨10.2196/60501⟩. ⟨hal-04752782⟩

STL

Year of publication 2024

HAL publication
Communication dans un congrès

Mariana Neves, Cristian Grozea, Philippe Thomas, Roland Roller, Rachel Bawden, et al.. Findings of the WMT 2024 Biomedical Translation Shared Task: TestDéfinition courte Lorem ipsum Sets on Abstract Level. WMT24 – Ninth Conference on Machine Translation, Nov 2024, Miami, Florida, United States. pp.124-138, ⟨10.18653/v1/2024.wmt-1.6⟩. ⟨hal-04750560⟩

STL

Year of publication 2024

Available in free access

HAL publication
Thèse

Théo Deschamps-Berger. Social Emotion Recognition with multimodal deep learning architecture in emergency call centers. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG036⟩. ⟨tel-04750508⟩

STL, STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Najet Hadj Mohamed, Cherifa Ben Khelil, Agata Savary, Iskander Keskes, Jean Yves Antoine, et al.. PARSEME-AR: Arabic reference corpus for multiword expressions using PARSEME annotation guidelines. Language Resources and Evaluation, 2024, ⟨10.1007/s10579-024-09763-7⟩. ⟨hal-04738059⟩

STL

Year of publication 2024

Available in free access

HAL publication
Rapport

David Benaben, Françoise Berthoud, Gaël Guennebaud, Anne-Laure Ligozat, S. Valcke. Estimation de l’empreinte carbone d’une heure de calcul sur un cœur CPUCognition Perception et Usages ou sur un GPU. Labos 1point5. 2024. ⟨hal-04738556⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Théo Gigant, Camille Guinaudeau, Marc Decombas, Frédéric Dufaux. Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics. The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Nov 2024, Miami (FL), United States. pp.19355-19368, ⟨10.18653/v1/2024.emnlp-main.1078⟩. ⟨hal-04720645⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Emmanuella Martinod, Michael Filhol. Formal Representation of Interrogation in French Sign Language. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin, Italy. pp.235-243. ⟨hal-04712681⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Michael Filhol, Thomas von Ascheberg. A software editor for the AZVD graphical Sign Language representation system. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin, Italy. pp.77-85. ⟨hal-04712674⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Emmanuella Martinod, Michael Filhol. Examining interrogative marking in French Sign Language with the AZee approach. Clause-type marking in the visual modality, workshop at the Annual Conference of the German Linguistics Society, German Linguistics Society, Feb 2024, Bochum, Germany. ⟨hal-04709019⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Paritosh Sharma, Camille Challant, Michael Filhol. Facial Expressions for Sign Language Synthesis using FACSHuman and AZee. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin, Italy. pp.354-360. ⟨hal-04709105⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Paritosh Sharma, Michael Filhol. Sign Language Synthesis using Pose Priors. MOCO ’24: 9th International Conference on Movement and Computing, May 2024, Utrecht Netherlands, France. pp.1-4, ⟨10.1145/3658852.3659080⟩. ⟨hal-04709203⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Pierre La Rocca, Gaël Guennebaud, Aurélie Bugeau, Anne-Laure Ligozat. Estimating The Carbon Footprint Of Digital Agriculture Deployment: A Parametric Bottom-Up Modelling Approach.. Journal of Industrial Ecology, In press, 28 (6), pp.1801-1815. ⟨10.1111/jiec.13568⟩. ⟨hal-04708774⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Fanny Ducel, Aurélie Névéol, Karën Fort. La recherche sur les biais dans les modèles de langue est biaisée : état de l’art en abyme. Revue TALTraitement Automatique des langues : traitement automatique des langues, 2024, 64 (3), pp.119-143. ⟨hal-04710191⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès, Communication dans un congrès

Carlos Cuevas Villarmin, Sarah Cohen-Boulakia, Nona Naderi. Reproducibility in Named Entity Recognition: A Case Study Analysis. 2024 IEEE 20th International Conference on e-Science (e-Science), Sep 2024, Osaka, Japan. ⟨10.1109/e-Science62913.2024.10678721⟩. ⟨hal-04706673⟩

BioInfo, BioInfo, STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Rémi Uro, Marie Tahon, David Doukhan, Antoine Laurent, Albert Rilliard. Detecting the terminality of speech-turn boundary for spoken interactions in French TV and Radio content. Interspeech 2024, Itshak Lapidot; Sharon Gannot, Sep 2024, Kos, Greece. pp.3560 – 3564, ⟨10.21437/interspeech.2024-1163⟩. ⟨hal-04694968⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Donna Erickson, Albert Rilliard, Malin Svensson Lundmark, Adelaide Silva, Leticia Rebollo Couto, et al.. Collecting Mandible Movement in Brazilian Portuguese. Interspeech 2024, Itshak Lapidot; Sharon Gannot, Sep 2024, Kos, Greece. pp.3145-3149, ⟨10.21437/interspeech.2024-1216⟩. ⟨hal-04694958⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Benjamin Elie, David Doukhan, Rémi Uro, Lucas Ondel Yang, Albert Rilliard, et al.. Articulatory Configurations across Genders and Periods in French Radio and TV archives. Interspeech 2024, Itshak Lapidot; Sharon Gannot, Sep 2024, Kos, Greece. pp.3085-3089, ⟨10.21437/interspeech.2024-1177⟩. ⟨hal-04694868⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Rémi Uro, Marie Tahon, Jane Wottawa, David Doukhan, Albert Rilliard, et al.. Annotation of Transition-Relevance Places and Interruptions for the Description of Turn-Taking in Conversations in French Media Content. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. pp.1225–1232. ⟨hal-04694997⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès, Communication dans un congrès

Luc Mottin, Nona Naderi, Anaïs Mottaz, Pierre-André Michel, Gerieke Been, et al.. Comparing Sequence-Based and Literature-Based Pathogenicity Scoring Methods for Human Variants. 34th Medical Informatics Europe Conference, Aug 2024, Athènes, Greece. ⟨10.3233/SHTI240747⟩. ⟨hal-04682928⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Annelies Braffort, Patrice Dalle. Sign language processing: models, representations, tools for video analysis, for signing avatars and for communication. 2nd International Society for Gesture Studies (ISGS 2005) conference: “Interacting bodies”, 2005, Lyon, France. ⟨hal-04678548⟩

ILES, ILES, STL

Year of publication 2005

HAL publication
Communication dans un congrès

Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. Récentes avancées de l’inférence en langue naturelle pour les essais cliniques. Journée Santé et IA 2024, AFIA; L3I; La Rochelle Université, Jul 2024, La Rochelle, France. ⟨hal-04667736⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Leticia Rebollo Couto, Albert Rilliard. Variación pragmática, traducción audiovisual y estrategias conversacionales para el doblaje: léxico coloquial y palabras tabús. Cadernos de Tradução , 2024, Sex, Taboo, and Swearing: Forbidden Words in Audiovisual Translation, 44 (2), pp.1-28. ⟨10.5007/2175-7968.2024.e99158⟩. ⟨hal-04668979⟩

STL

Year of publication 2024

Available in free access

HAL publication
Poster de conférence

Sylvain Kahane, Claudel Pierre-Louis, Sandra Jagodzińska, Agata Savary. The first Haitian Creole treebank. Peer reviewed poster in the 2nd UniDive Workshop, Feb 2024, Naples, Italy. ⟨hal-04667550⟩

ILES, ILES, STL

Year of publication 2024

HAL publication
Communication dans un congrès

Agata Savary, Daniel Zeman, Verginica Barbu Mititelu, Anabela Barreiro, Olesea Caftanatov, et al.. UniDive: A COST Action on Universality, Diversity and Idiosyncrasy in Language Technology. 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, May 2024, Torino, Italy. pp.372-382. ⟨hal-04667545⟩

ILES, ILES, STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Najet Hadj Mohamed, Agata Savary, Cherifa Ben Khelil, Jean-Yves Antoine, Iskandar Keskes, et al.. Lexicons Gain the Upper Hand in Arabic MWE Identification. Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, May 2024, Torino, Italy. pp.88-97. ⟨hal-04667546⟩

ILES, ILES, STL

Year of publication 2024

Available in free access

HAL publication
Autre publication scientifique

Louis Estève, Agata Savary, Thomas Lavergne. Entropy Behaviour upon Dataset Size Update. 2024. ⟨hal-04666672⟩

STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Bui Van-Tuan, Agata Savary. Cross-type French Multiword Expression Identification with Pre-trained Masked Language Models. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Turin, Italy. pp.4198-4204. ⟨hal-04667119⟩

ASARD, ILES, ILES, STL

Year of publication 2024

Available in free access

HAL publication
Thèse

Natalia Kalashnikova. Towards detection of nudges in Human-Human and Human-Machine interactions. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG031⟩. ⟨tel-04663129⟩

STL, STL

Year of publication 2024

Available in free access

HAL publication
Communication dans un congrès

Louis Estève, Agata Savary, Thomas Lavergne. Vector Spaces for Quantifying Disparity of Multiword Expressions in Annotated Text. Association for Computational Linguistics – Student Research Workshop, Aug 2024, Bangkok, Thailand. pp.110-130, ⟨10.18653/v1/2024.acl-srw.20⟩. ⟨hal-04660179⟩

STL

Year of publication 2024

Available in free access

HAL publication
Article dans une revue

Annelies Braffort. L’héritage scientifique de Patrice Dalle : le traitement automatique des langues des signes au service de l’enseignement en LSF. La main de Thôt : théories, enjeux et pratiques de la traduction, 2024, 11. ⟨hal-04256752⟩

STL

Year of publication 2024

Available in free access

HAL publication

All Publications

Teams

Recent Publications

Ana Manzano Rodríguez, Camille Guinaudeau, Shin Ichi Satoh. Uncovering Gender Biases in Gender Identification Models for Japanese Data Analysis. Workshop on Demographic Diversity in Computer Vision @ CVPR 2025, Jun 2025, Nashville (Tennessee), United States. ⟨hal-05154054⟩

Jiahui Hu. Granular Insights into Financial Discourse : Fine-Grained Opinion Analysis of Expert Texts. Document and Text Processing. Université Paris-Saclay, 2023. English. ⟨NNT : 2023UPASG110⟩. ⟨tel-05153905⟩

Philippe Boula de Mareüil, Marc Evrard, Alexandre François, Antonio Romano. Computer modelling of innovations relative to Latin in contemporary Romance dialects. Isogloss. Open Journal of Romance Linguistics, 2025, 11 (3), pp.1 – 31. ⟨10.5565/rev/isogloss.423⟩. ⟨hal-05144863⟩

Anne Baillot, Anne-Laure Ligozat. Introduction. Sobriété numérique. Humanités numériques, 2025, 11, ⟨10.4000/1498x⟩. ⟨hal-05143071⟩

Pierre Lepagnol, Sahar Ghannay, Thomas Gerald, Christophe Servan, Sophie Rosset. Leveraging Information Retrieval to Enhance Spoken Language Understanding Prompts in Few-Shot Learning. Interpseech 2025, Aug 2025, Rotterdam, Netherlands. ⟨hal-05095796⟩

Agata Savary. NLP-based Study of Universals of Linguistic Idiosyncrasy. Dagstuhl Reports, 2023, 13 (5), pp.64-67. ⟨hal-04323075⟩

Mathieu Laï-King. Qualité des articles de recherche et modèles de langue neuronaux : applications au domaine biomédical. Intelligence artificielle [cs.AI]. Université Paris-Saclay, 2025. Français. ⟨NNT : 2025UPASG031⟩. ⟨tel-05079724⟩

Clément Morand, Anne-Laure Ligozat, Aurélie Névéol. Characterizing Goals and Impacts of Digitalization: The Case of Promises in French Healthcare Policies. 2025. ⟨hal-05066176⟩

Karin Dassas, Cyrille Bonamy, Bruno Bzeznik, Romaric David, Emmanuelle Frenoux, et al.. Estimer l’impact carbone des activités numériques de l’Observatoire de Paris. EcoInfo. 2025, pp.1-47. ⟨hal-05068666⟩

Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. Clinical text generation: Are we there yet?. Annual Review of Biomedical Data Science, 2025, 8, ⟨10.1146/annurev-biodatasci-103123-095202⟩. ⟨hal-05055957⟩

Arezoo Saedi, Afsaneh Fatemi, Mohammad Ali Nematbakhsh, Sophie Rosset, Anne Vilnat. Entity search based on consumer preferences leveraging user reviews. Expert Systems with Applications, 2025, 275, pp.126990. ⟨10.1016/j.eswa.2025.126990⟩. ⟨hal-05047109⟩

Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View. 2025. ⟨hal-04992084⟩

Amel Fraisse, Patrick Paroubek, Ramit Goyal, Nassreddine Znaidi. Measuring Multilingualism in Online Public Access Catalogs. The ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), Dec 2024, Hong Kong, China. ⟨10.1145/3677389.3702544⟩. ⟨hal-04986773⟩

Manon Scholivet, Agata Savary, Louis Estève, Marie Candito, Carlos Ramisch. SELEXINI – a large and diverse automatically parsed corpus of French. Building and Using Comparable Corpora (BUCC), Jan 2025, Abu DHABI, United Arab Emirates. ⟨hal-04978746⟩

Hui-Syuan Yeh. Prompt-based Relation Extraction for Pharmacovigilance. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG097⟩. ⟨tel-04968043⟩

Sylvain Bouveret, Aurélie Bugeau, Frenoux Emmanuelle, Julien Lefevre, Laurent Lefèvre, et al.. Quiz sur les impacts environnementaux du numérique. EcoInfo. 2025, pp.1-5. ⟨hal-04960328v2⟩

Camille Challant. Représentation formelle avec AZee et contraintes grammaticales pour la langue des signes française. Théorie et langage formel [cs.FL]. Université Paris-Saclay, 2024. Français. ⟨NNT : 2024UPASG086⟩. ⟨tel-04957486⟩

Zheng Zhang, Brian Denton, Xiaolan Xie. Branch and Price for Chance-Constrained Bin Packing. INFORMS Journal on Computing, 2020, 32 (3), pp.547-564. ⟨10.1287/ijoc.2019.0894⟩. ⟨hal-04941861⟩

Rémi Uro, David Doukhan. Pendant le confinement, le temps de parole des femmes a baissé à la télévision et à la radio. La revue des médias, 2020. ⟨hal-04906221⟩

Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. RNA-TorsionBERT: leveraging language models for RNA 3D torsion angles prediction. Bioinformatics, 2025, 41 (1), pp.btaf004. ⟨10.1093/bioinformatics/btaf004⟩. ⟨hal-04911519⟩

Marion Ficher, Tom Bauer, Anne-Laure Ligozat. A comprehensive review of the end-of-life modeling in LCAs of digital equipment. International Journal of Life Cycle Assessment, 2024, 30 (1), pp.20-42. ⟨10.1007/s11367-024-02367-x⟩. ⟨hal-04924691⟩

Atilla Kaan Alkan. Natural Language Processing for Analyzing Messages of Astrophysical Observations. Artificial Intelligence [cs.AI]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG114⟩. ⟨tel-04928511⟩

Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. Has AlphaFold3 achieved success for RNAs?. 2025. ⟨hal-04911522⟩

Léa-Marie Lam-Yee-Mui. Modélisations pour la reconnaissance de la parole à données contraintes. Traitement du signal et de l’image [eess.SP]. Université Paris-Saclay, 2024. Français. ⟨NNT : 2024UPASG075⟩. ⟨tel-04918814⟩

Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. Has AlphaFold 3 achieved success for RNA?. Acta crystallographica Section D : Structural biology [1993-..], 2025, 81 (2), pp.49–62. ⟨10.1107/S2059798325000592⟩. ⟨hal-04919467⟩

Douglas Teodoro, Nona Naderi, Anthony Yazdani, Boya Zhang, Alban Bornet. A Scoping Review of Artificial Intelligence Applications in Clinical Trial Risk Assessment. 2025. ⟨hal-04913991⟩

Omar Adjali, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne. Entity-aware cross-modal pretraining for Knowledge-Based Visual Question Answering. 2024. ⟨cea-04910767⟩

Paritosh Sharma. Sign Language synthesis by a decreasing granularity system from AZee. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG092⟩. ⟨tel-04908078⟩

Louis Estève, Kaja Dobrovoljc. A new pipeline for measuring diversity across various linguistic levels. 2025. ⟨hal-04886792⟩

Sofiya Kobylyanskaya. Towards multimodal assessment of L2 level : speech and eye tracking features in a cross-cultural setting. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG111⟩. ⟨tel-04900961⟩

Leticia Rebollo Couto, Albert Rilliard. Variación pragmática y expresividad negativa: análisis multimodal en datos de doblaje. LingCor2024: Workshop on Spoken Corpus Linguistics, Jul 2024, Vienna, Austria. . ⟨hal-04874470⟩

Clémentine Bleuze, Fanny Ducel, Karën Fort, Maxime Amblard. Vers la création d’une super-intelligence » : un corpus pour étudier les revendications des articles de TALTraitement Automatique des langues. Journées de lancement LIFT 2, Nov 2024, Orléans, France. ⟨hal-04880335⟩

Donna Erickson, João Antônio De Moraes, Albert Rilliard. Dimensões das atitudes prosódicas entre culturas. V Seminário Internacional de Fonologia, Universidade Federal do Rio de Janeiro, Nov 2024, Rio de Janeiro, Brazil. ⟨hal-04874627⟩

Khanh-An C Quan, Camille Guinaudeau, Shin’Ichi Satoh. Evaluating VQA Models’ Consistency in the Scientific Domain. Multimedia Modelling 2025, Jan 2025, Nara, Japan. ⟨hal-04860239⟩

Saumya Yadav, Elise Lincker, Caroline Huron, Stéphanie Martin, Camille Guinaudeau, et al.. Towards Inclusive Education: Multimodal Classification of Textbook Images for Accessibility. Multimedia Modelling 2025, Jan 2025, Nara, Japan. ⟨hal-04860245⟩

Cyril Grouin, Natalia Grabar. Year 2023 in Biomedical Natural Language Processing: A Tribute to Large Language Models and Generative AI. IMIA Yearbook of Medical Informatics, 2024, pp.241-248. ⟨10.1055/s-0044-1800751⟩. ⟨hal-04865083⟩

Natalia Grabar, Thierry Hamon. Study of the propaganda techniques occurring in Russian newspaper titles in 2022. METAPOL, université de Liège, Nov 2024, Liège, Belgium. ⟨hal-04865074⟩

Aurélie Bugeau, Anne-Laure Ligozat. L’informatique en temps de crises environnementales : comment adapter la recherche et l’enseignement ?. 2024. ⟨hal-04850517⟩

Donna Erickson, Albert Rilliard, Ela Thurgood, João Antônio de Moraes, Takaaki Shochi. Acoustic and perceptual profiles of american english social affective expressions. Journal of Speech Sciences, 2024, 13, pp.e024004. ⟨10.20396/joss.v13i00.20015⟩. ⟨hal-04850040⟩

Clément Morand, Anne-Laure Ligozat, Aurélie Névéol. Does Efficiency Lead to Green Machine Learning Model Training? Analyzing Historical Trends in Impacts from Hardware, Algorithmic and Carbon Optimizations. 2025. ⟨hal-04839926v4⟩

Lucie Gianola. Traitement automatique des langues et linguistique de corpus pour la reconnaissance d’entités en analyse criminelle. Revue internationale de criminologie et de police technique et scientifique, 2021, LXXIV (3), pp.363-382. ⟨hal-04833123⟩

Mathilde Aguiar, Ying Lai, Pierre Zweigenbaum, Nona Naderi. Constituting a dataset for applying Natural Language Inference to Chinese Clinical Trials: possible approaches and challenges. Junior Conference on Data Sciences and Engineering, Sep 2024, Gif-sur-Yvette, France. ⟨hal-04837721⟩

Hansjörg Mixdorff, Albert Rilliard, Navneet Nayan. Perceptual Evaluation of Attitudinal Expressions. 5th International Symposium on Applied Phonetics (ISAPh 2024), Pärtel Lippus, Sep 2024, Tartu, Estonia. pp.60-64, ⟨10.21437/ISAPh.2024-12⟩. ⟨hal-04823812⟩

Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, et al.. What Can Natural Language Processing Do for Peer Review?. 2024. ⟨hal-04797652⟩

Fanny Ducel, Aurélie Névéol, Karën Fort. “You’ll be a nurse, my son!” Automatically Assessing Gender Biases in Autoregressive Language Models in French and Italian. Language Resources and Evaluation, 2024, ⟨10.1007/s10579-024-09780-6⟩. ⟨hal-04803403⟩

Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp. The 17th Workshop on Building and Using Comparable Corpora (BUCC) @LREC-COLING-2024. Workshop Proceedings. 17th Workshop on Building and Using Comparable Corpora (BUCC), 2024, 978-2-493814-31-9. ⟨hal-04779272⟩

Clément Morand, Olivier Ridoux. CRI : A Competent Reader Imitator for detecting binomial names in an historical corpus. Lingvisticae investigationes : International Journal of Linguistics and Language, 2024, 47 (1), pp.30-67. ⟨10.1075/li.00107.mor⟩. ⟨hal-04764787⟩

Clément Morand. Evaluation of the environmental impacts of Natural Language Processing methods. Computer Science [cs]. 2023. ⟨dumas-04758937⟩

Fanny Ducel, Aurélie Névéol, Karën Fort. Desiderata for Actionable Bias Research. New Perspectives on Bias and Discrimination in Language Technology, Nov 2024, Amsterdam, Netherlands. ⟨hal-04755691⟩

Jamil Zaghir, Marco Naguib, Mina Bjelogrlic, Aurélie Névéol, Xavier Tannier, et al.. Prompt Engineering Paradigms for Medical Applications: Scoping Review. Journal of Medical Internet Research, 2024, 26, pp.e60501. ⟨10.2196/60501⟩. ⟨hal-04752782⟩

Théo Deschamps-Berger. Social Emotion Recognition with multimodal deep learning architecture in emergency call centers. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG036⟩. ⟨tel-04750508⟩

Najet Hadj Mohamed, Cherifa Ben Khelil, Agata Savary, Iskander Keskes, Jean Yves Antoine, et al.. PARSEME-AR: Arabic reference corpus for multiword expressions using PARSEME annotation guidelines. Language Resources and Evaluation, 2024, ⟨10.1007/s10579-024-09763-7⟩. ⟨hal-04738059⟩

David Benaben, Françoise Berthoud, Gaël Guennebaud, Anne-Laure Ligozat, S. Valcke. Estimation de l’empreinte carbone d’une heure de calcul sur un cœur CPUCognition Perception et Usages ou sur un GPU. Labos 1point5. 2024. ⟨hal-04738556⟩

Emmanuella Martinod, Michael Filhol. Formal Representation of Interrogation in French Sign Language. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin, Italy. pp.235-243. ⟨hal-04712681⟩

Michael Filhol, Thomas von Ascheberg. A software editor for the AZVD graphical Sign Language representation system. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin, Italy. pp.77-85. ⟨hal-04712674⟩

Paritosh Sharma, Michael Filhol. Sign Language Synthesis using Pose Priors. MOCO ’24: 9th International Conference on Movement and Computing, May 2024, Utrecht Netherlands, France. pp.1-4, ⟨10.1145/3658852.3659080⟩. ⟨hal-04709203⟩

Pierre La Rocca, Gaël Guennebaud, Aurélie Bugeau, Anne-Laure Ligozat. Estimating The Carbon Footprint Of Digital Agriculture Deployment: A Parametric Bottom-Up Modelling Approach.. Journal of Industrial Ecology, In press, 28 (6), pp.1801-1815. ⟨10.1111/jiec.13568⟩. ⟨hal-04708774⟩

Fanny Ducel, Aurélie Névéol, Karën Fort. La recherche sur les biais dans les modèles de langue est biaisée : état de l’art en abyme. Revue TALTraitement Automatique des langues : traitement automatique des langues, 2024, 64 (3), pp.119-143. ⟨hal-04710191⟩

Carlos Cuevas Villarmin, Sarah Cohen-Boulakia, Nona Naderi. Reproducibility in Named Entity Recognition: A Case Study Analysis. 2024 IEEE 20th International Conference on e-Science (e-Science), Sep 2024, Osaka, Japan. ⟨10.1109/e-Science62913.2024.10678721⟩. ⟨hal-04706673⟩

Luc Mottin, Nona Naderi, Anaïs Mottaz, Pierre-André Michel, Gerieke Been, et al.. Comparing Sequence-Based and Literature-Based Pathogenicity Scoring Methods for Human Variants. 34th Medical Informatics Europe Conference, Aug 2024, Athènes, Greece. ⟨10.3233/SHTI240747⟩. ⟨hal-04682928⟩

Annelies Braffort, Patrice Dalle. Sign language processing: models, representations, tools for video analysis, for signing avatars and for communication. 2nd International Society for Gesture Studies (ISGS 2005) conference: “Interacting bodies”, 2005, Lyon, France. ⟨hal-04678548⟩

Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. Récentes avancées de l’inférence en langue naturelle pour les essais cliniques. Journée Santé et IA 2024, AFIA; L3I; La Rochelle Université, Jul 2024, La Rochelle, France. ⟨hal-04667736⟩

Sylvain Kahane, Claudel Pierre-Louis, Sandra Jagodzińska, Agata Savary. The first Haitian Creole treebank. Peer reviewed poster in the 2nd UniDive Workshop, Feb 2024, Naples, Italy. ⟨hal-04667550⟩

Louis Estève, Agata Savary, Thomas Lavergne. Entropy Behaviour upon Dataset Size Update. 2024. ⟨hal-04666672⟩

Natalia Kalashnikova. Towards detection of nudges in Human-Human and Human-Machine interactions. Computation and Language [cs.CL]. Université Paris-Saclay, 2024. English. ⟨NNT : 2024UPASG031⟩. ⟨tel-04663129⟩

Louis Estève, Agata Savary, Thomas Lavergne. Vector Spaces for Quantifying Disparity of Multiword Expressions in Annotated Text. Association for Computational Linguistics – Student Research Workshop, Aug 2024, Bangkok, Thailand. pp.110-130, ⟨10.18653/v1/2024.acl-srw.20⟩. ⟨hal-04660179⟩

Annelies Braffort. L’héritage scientifique de Patrice Dalle : le traitement automatique des langues des signes au service de l’enseignement en LSF. La main de Thôt : théories, enjeux et pratiques de la traduction, 2024, 11. ⟨hal-04256752⟩