STL

Language Sciences and Technologies

Coordination: Aurélie NEVEOL

The Department of Language Sciences and Technologies studies fundamental questions relating to linguistic systems by exploiting large corpora collected, annotated and enriched in an unsupervised or semi-supervised way by statistical learning models adapted to the linguistic material.

These models make it possible to study how languages function, their variations (phonetic-phonological, morphological-lexical, syntactic and semantic), both synchronic and diachronic, diaphasic and diatopic, and to raise questions about their acquisition as mother tongues or second languages. Finally, the department is developing major applications in language processing: speech recognition, automatic translation, information retrieval, conversational agents, etc. … which are increasingly important for society (safeguarding endangered languages, providing tools for people with disabilities, helping to process information and medical knowledge) and for ethics.

This approach to language and languages covers a broad spectrum, from the most fundamental to the most applied research, in a wide variety of media (newspapers, social media, video, telephone, . . .) and all modalities (written, spoken and signed).

This research is highly multidisciplinary, bringing together diverse communities from the fields of computer science, engineering and the humanities.

Teams

Recent Publications

  • Poster de conférence

    Saumya Yadav, Élise Lincker, Caroline Huron, Martin Stéphanie, Camille Guinaudeau, et al.. Vers une pédagogie inclusive : une classification multimodale des illustrations de manuels scolaires pour des environnements d’apprentissage adaptés. JEP TALN RECITAL 2024, Jul 2024, Toulouse, France. ⟨hal-04613698⟩

    STL

    Year of publication

    Available in free access

  • N°spécial de revue/special issue

    Pierre Zweigenbaum, Nicolas Maudet, Philippe Morignot, Laurent Vercouter. PFIA 2015. Bulletin de l’Association Française pour l’Intelligence Artificielle, 90, 2015, Association Française d’Intelligence Artificielle. ⟨hal-04595440⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Camille Challant, Michael Filhol. Extending AZee with Non-manual Gesture Rules for French Sign Language. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Turin, Italy. pp.7007-7016. ⟨hal-04594830⟩

    STL

    Year of publication

    Available in free access

  • Thèse

    Saulo Mendes Santos. How to deal with Discourse Markers : a prosodic, corpus-based, computational and experimental proposal. Computation and Language [cs.CL]. Université Paris-Saclay; Universidade federal de Minas Gerais, 2024. English. ⟨NNT : 2024UPASG013⟩. ⟨tel-04594427⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Julie Lascar, Michèle Gouiffès, Annelies Braffort, Claire Danet. Annotation of LSF subtitled videos without a pre-existing dictionary. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin (IT), Italy. pp.100-108. ⟨hal-04593866⟩

    AMIArchitectures et modèles pour l'Interaction, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Julie Halbout, Diandra Fabre, Yanis Ouakrim, Julie Lascar, Annelies Braffort, et al.. Matignon-LSF: a Large Corpus of Interpreted French Sign Language. LREC-COLING 2024 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, May 2024, Turin, Italy. pp.202-208. ⟨hal-04593865⟩

    AMIArchitectures et modèles pour l'Interaction, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Maxime Fily, Guillaume Wisniewski, Séverine Guillaume, Gilles Adda, Alexis Michaud. Mesure du niveau de proximité entre enregistrements audio et évaluation indirecte du niveau d’abstraction des représentations issues d’un grand modèle de langage. JEP TALN RECITAL 2024, Association Française de la Communication Parlée (AFCP), Jul 2024, Toulouse, France. ⟨hal-04583516⟩

    STL

    Year of publication

  • Communication dans un congrès

    Clément Morand, Anne-Laure Ligozat, Aurélie Névéol. Bracing for impact: on-going digitalization of healthcare requires urgent characterization of impact on environment and beyond. Undone Computer Science, Guillaume Munch-Maccagnoni; Chantal Enguehard; Maël Pégny; Marc Anderson, Feb 2024, Nantes (France), France. ⟨hal-04579545⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Leticia Rebollo Couto, Albert Rilliard. Variación pragmática, traducción audiovisual y estrategias conversacionales para el doblaje: léxico coloquial y palabras tabús – Anexos. 2024. ⟨hal-04578522⟩

    STL

    Year of publication

  • Communication dans un congrès

    Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, et al.. LongEval: Longitudinal Evaluation of Model Performance at CLEF 2024. Advances In Information Retrieval (ECIR 2024), Mar 2024, Glasgow (Ecosse), United Kingdom. pp.60-66, ⟨10.1007/978-3-031-56072-9_8⟩. ⟨hal-04577466⟩

    STL

    Year of publication

  • Article dans une revue

    Boya Zhang, Nona Naderi, Rahul Mishra, Douglas Teodoro. Online Health Search Via Multidimensional Information Quality Assessment Based on Deep Language Models: Algorithm Development and Validation. JMIR AI, 2024, 3, pp.e42630. ⟨10.2196/42630⟩. ⟨hal-04574791⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Hossein Rouhizadeh, Irina Nikishina, Anthony Yazdani, Alban Bornet, Boya Zhang, et al.. A Dataset for Evaluating Contextualized Representation of Biomedical Concepts in Language Models. Scientific Data , 2024, 11 (1), pp.455. ⟨10.1038/s41597-024-03317-w⟩. ⟨hal-04574786⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Maxime Fily, Guillaume Wisniewski, Séverine Guillaume, Gilles Adda, Alexis Michaud. Establishing degrees of closeness between audio recordings along different dimensions using large-scale cross-lingual models. Findings of the Association for Computational Linguistics: EACL 2024, Association for Computational Linguistics, Mar 2024, St. Julian’s, Malta. ⟨hal-04561819⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Hugo Boulanger, Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. Using Structured Health Information for Controlled Generation of Clinical Cases in French. The 6th Clinical Natural Language Processing Workshop At NAACL 2024 (ClinicalNLP 2024), Jun 2024, Mexico city, Mexico. ⟨hal-04558890⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Marion Ficher, Tom Bauer, Anne-Laure Ligozat. A comprehensive review of the end-of-life modeling in LCAs of digital equipment. 2024. ⟨hal-04555155⟩

    STL, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nicolas Hiebel, Bertrand Remy, Bruno Guillaume, Olivier Ferret, Aurélie Névéol, et al.. Hostomytho: A GWAP for Synthetic Clinical Texts Evaluation and Annotation. Games and Natural Language Processing Workshop at LREC-COLING 2024, May 2024, Turin, Italy, May 2024, Turin (Italie), Italy. ⟨hal-04555052⟩

    STL

    Year of publication

    Available in free access

  • Thèse

    Oralie Cattan. Systèmes de questions-réponses interactifs à grande échelle. Informatique [cs]. Université Paris-Saclay (2020-..), 2022. Français. ⟨NNT : ⟩. ⟨tel-04551072⟩

    STL

    Year of publication

  • Article dans une revue

    Luma da Silva Miranda, João Antônio de Moraes, Albert Rilliard. Visual channel facilitates the comprehension of the intonation of Brazilian Portuguese wh-questions and wh-exclamations: evidence from congruent and incongruent stimuli. Language and Cognition, 2024, pp.1-21. ⟨10.1017/langcog.2024.16⟩. ⟨hal-04538371⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536273⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Djegdjiga Amazouz, Martine-Adda Decker, Lori Lamel. Variation du voisement des occlusives orales en code-switching: analyses par ABX automatique et mesures acoustiques. Journées d’Études sur la Parole – JEP2022, Jun 2022, Noirmoutier, France. ⟨hal-03703081⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536600⟩

    STL

    Year of publication

  • Communication dans un congrès

    Karën Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, et al.. Your Stereotypical Mileage may Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Turin (Italie), Italy. ⟨hal-04537096⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Paul Lerner, Cyril Grouin. INCLURE: a Dataset and Toolkit for Inclusive French Translation. The 17th Workshop on Building and Using Comparable Corpora (BUCC @ LREC 2024), 2024, Turin, Italy. ⟨hal-04531938⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Karën Fort, Aurélie Névéol. Ethics and NLP: 10 years after. Journée d’études ATALA “éthique et TALTraitement Automatique des langues : 10 ans après”, 2024. ⟨hal-04533870⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Paul Lerner, Olivier Ferret, Camille Guinaudeau. Cross-modal Retrieval for Knowledge-based Visual Question Answering. 46th European Conference on Information Retrieval (ECIR 2024), 2024, Glasgow, United Kingdom. ⟨hal-04384431⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Tomohiro Nishiyama, Lisa Raithel, Roland Roller, Pierre Zweigenbaum, Eiji Aramaki. Assessing Authenticity and Anonymity of Synthetic User-generated Content in the Medical Domain. Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo), Mar 2024, St. Julian’s, Malta. pp.8-17. ⟨hal-04528240⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nadège Alavoine, Gaëlle Laperriere, Christophe Servan, Sahar Ghannay, Sophie Rosset. New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. ⟨hal-04523286⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier. A Benchmark Evaluation of Clinical Named Entity Recognition in French. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. ⟨hal-04523267⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Christophe Servan, Sahar Ghannay, Sophie Rosset. mALBERT: Is a Compact Multilingual BERT Model Still Worth It?. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Torino, Italy. ⟨hal-04520797⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Aaron Boussidan, Fanny Ducel, Aurélie Névéol, Karën Fort. What ChatGPT tells us about ourselves. Journée d’étude Éthique et TALTraitement Automatique des langues 2024, Apr 2024, Nancy, France. ⟨hal-04521121⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thierry Hamon, Natalia Grabar. Automatic Prediction of Semantic Labels for French Medical Terms. Medical Informatics Europe conference (MIE2022), May 2022, Nice, France. pp.868-869, ⟨10.3233/SHTI220610⟩. ⟨hal-04519905⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan, Sophie Rosset. Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification. LREC-COLING 2024, May 2024, TURIN, Italy. ⟨hal-04519930v2⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Mérième Bouhandi, Emmanuel Morin, Thierry Hamon. Graph Neural Networks for Adapting Off-the-shelf General Domain Language Models to Low-Resource Specialised Domains. 2nd Workshop on Deep Learning on Graphs for Natural Language Processing (DLG4NLP 2022), ACL, Jul 2022, Seattle, Washington, United States. pp.36-42, ⟨10.18653/v1/2022.dlg4nlp-1.5⟩. ⟨hal-04517190⟩

    STL

    Year of publication

    Available in free access

  • Poster de conférence

    Elise Lincker, Léa Pacini, Olivier Pons, Camille Guinaudeau, Jérôme Dupire, et al.. MALIN : MAnuels scoLaires INclusifs : Accessibilité numérique des manuels scolaires. Colloque Handiversité 2023 – L’innovation pour le partage, Apr 2023, Gif-sur-Yvette, France. ⟨hal-04410349⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Angèle Gayet-Ageron, Khaoula Ben Messaoud, Mark Oliver Richards, Cyril Jaksic, Julien Gobeill, et al.. Assessment of gender and geographical bias in the editorial decision-making process of biomedical journals: A Case-Control study.. Medrxiv : the Preprint Server For Health Sciences, 2024, ⟨10.1101/2024.03.15.24304220⟩. ⟨hal-04510221⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Clement Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. RNAdvisor: a comprehensive benchmarking tool for the measure and prediction of RNA structural model quality. Briefings in Bioinformatics, 2024, 25 (2), pp.bbae064. ⟨10.1093/bib/bbae064⟩. ⟨hal-04508073⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail, Pré-publication, Document de travail

    Anne-Laure Ligozat, Christophe Brun, Benjamin Demirdjian, Guillaume Gouget, Emilie Jardé, et al.. Setting Climate Targets: The Case of Higher Education and Research. 2024. ⟨hal-04505199⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Yanis Ouakrim, Hannah Bull, Michèle Gouiffès, Denis Beautemps, Thomas Hueber, et al.. Mediapi-RGB: Enabling Technological Breakthroughs in French Sign Language (LSF) Research through an Extensive Video-Text Corpus. VISAPP 2024 – 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Feb 2024, Rome, Italy. ⟨10.5220/0012372600003660⟩. ⟨hal-04494094⟩

    AMIArchitectures et modèles pour l'Interaction, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Aurélie Bugeau, Anne-Laure Ligozat. Analysing ICT in prospective scenarios to help reveal undone computer science. Undone Computer Science conference, Feb 2024, Nantes (France), France. ⟨hal-04486589⟩

    STL

    Year of publication

  • Article dans une revue

    Julien Lefevre, Aurélie Bugeau, Jacques Combaz, Laurent Lefèvre, Anne-Laure Ligozat, et al.. Impacts environnementaux de l’IA : quels réels bénéfices ?. Collection numérique de l’AMUE, Agence de mutualisation des universités et établissements d’enseignement supérieur, 2023. ⟨hal-04486682⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Nicholas Asher, Pierre Zweigenbaum. Artificial Intelligence and Language. Pierre Marquis; Odile Papini; Henri Prade. A Guided Tour of Artificial Intelligence Research, III: Interfaces and Applications of Artificial Intelligence (chapter 4), Springer International Publishing, pp.117-145, 2020, 978-3-030-06169-2. ⟨10.1007/978-3-030-06170-8_4⟩. ⟨hal-04483086⟩

    ILES, STL

    Year of publication

  • Proceedings/Recueil des communications

    Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff. Proceedings of the 13th Workshop on Building and Using Comparable Corpora. LREC 2020, 2020, 979-10-95546-42-9. ⟨hal-04482188⟩

    ILES, ILES, STL

    Year of publication

  • Communication dans un congrès

    Rabab Alkhalifa, Iman Bilal, Hsuvas Borkakoty, Jose Camacho-Collados, Romain Deveaud, et al.. Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. CLEF 2023: Experimental IR Meets Multilinguality, Multimodality, and Interaction, Sep 2023, Thessalonic, Greece. pp.440-458, ⟨10.1007/978-3-031-42448-9_28⟩. ⟨hal-04475726⟩

    ILES, STL

    Year of publication

  • Communication dans un congrès

    Fatima Hamlaoui, Emmanuel-Moselly Makasso, Markus Müller, Jonas Engelmann, Gilles Adda, et al.. BULBasaa: A Bilingual Bàsàá-French Speech Corpus for the Evaluation of Language Documentation Tools. LREC 2018, European Language Resources Association (ELRA), May 2018, Miyazaki, Japan. ⟨hal-04466108⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Yuming Zhai, Gabriel Illouz, Anne Vilnat. Detecting Non-literal Translations by Fine-tuning Cross-lingual Pre-trained Language Models. 28th International Conference on Computational Linguistics (COLING), Dec 2020, Barcelona (on line), Spain. pp.5944-5956, ⟨10.18653/v1/2020.coling-main.522⟩. ⟨hal-04468022⟩

    ILES, STL, STL

    Year of publication

    Available in free access

  • Article dans une revue

    Surya Roca, Sophie Rosset, José García, Álvaro Alesanco. A Study on the Impacts of Slot Types and Training Data on Joint Natural Language Understanding in a Spanish Medication Management Assistant Scenario. Sensors, 2022, 22 (6), pp.2364. ⟨10.3390/s22062364⟩. ⟨hal-04465686⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Laura Spinu, Ioana Vasilescu, Lori Lamel, Jason Lilley. Voicing neutralization in Romanian fricatives across different speech styles. Interspeech, ISCA, Sep 2022, Incheon, South Korea. pp.1342-1346, ⟨10.21437/interspeech.2022-10716⟩. ⟨hal-04465920⟩

    STL, TLP

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 5 : démonstrations. CORIA – TALN 2023, 2023. ⟨hal-04462998⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 4 : articles déjà soumis ou acceptés en conférence internationale. CORIA – TALN 2023, 2023. ⟨hal-04462975⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 6 : projets. CORIA – TALN 2023, 2023. ⟨hal-04463005⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 3 : prises de position en TALTraitement Automatique des langues. CORIA – TALN 2023, 2023. ⟨hal-04462921⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 2 : travaux de recherche originaux – articles courts. CORIA – TALN 2023, 2023. ⟨hal-04462841⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 1 : travaux de recherche originaux – articles longs. CORIA – TALN 2023, 2023. ⟨hal-04462825⟩

    STL

    Year of publication

    Available in free access

  • Ouvrages

    Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum. Building and Using Comparable Corpora for Multilingual Natural Language Processing. Springer Cham, 2023, Synthesis Lectures on Human Language Technologies, Graeme Hirst, 978-3-031-31383-7. ⟨10.1007/978-3-031-31384-4⟩. ⟨hal-04470213⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum. Basic Principles of Cross-Lingual Models. Building and Using Comparable Corpora for Multilingual Natural Language Processing, Springer International Publishing, pp.9-16, 2023, Synthesis Lectures on Human Language Technologies, ⟨10.1007/978-3-031-31384-4_2⟩. ⟨hal-04465490⟩

    STL

    Year of publication

  • Communication dans un congrès

    Bernd Dudzik, Tiffany Matej Hrkalovic, Dennis Küster, David St-Onge, Felix Putze, et al.. The 5th Workshop on Modeling Socio-Emotional and Cognitive Processes from Multimodal Data in the Wild (MSECP-Wild). ICMI ’23: International Conference On Multimodal Interaction, Oct 2023, Paris France, France. pp.828-829, ⟨10.1145/3577190.3616883⟩. ⟨hal-04465456⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    John Mcdonald, Michael Filhol, Camille Challant. Geometric Modifications of Gestures in Sign Languages. International Society for Gesture Studies conference, Jul 2022, Chicago, United States. ⟨hal-03721241⟩

    STL

    Year of publication

  • Communication dans un congrès

    Adam Lion-Bouton, Yağmur Öztürk, Agata Savary, Jean-Yves Antoine. Evaluating Diversity of Multiword Expressions in Annotated Text. International Committee on Computational Linguistics (COLING), International Committee on Computational Linguistics, Oct 2022, Gyeongju, South Korea. pp.3285-3295. ⟨hal-04468662⟩

    STL, STL

    Year of publication

    Available in free access

  • Article dans une revue

    S Cardoso, X Aimé, V Meininger, D Grabli, Lf Melo Mora, et al.. A Modular Ontology for Modeling Service Provision in a Communication Network for Coordination of Care.. Studies in Health Technology and Informatics, 2018, 247, pp.890-894. ⟨hal-02481869⟩

    STL

    Year of publication

  • Communication dans un congrès

    Plinio Barbosa, Philippe Boula de Mareüil. Imitating Broadcast News Style: Commonalities and Differences Between French and Brazilian Professionals. Book cover Book cover International Conference on Computational Processing of the Portuguese Language (PROPOR 2018), Sep 2018, Canela, Brazil. pp.419-428, ⟨10.1007/978-3-319-99722-3_42⟩. ⟨hal-04466213⟩

    STL, STL, TLP

    Year of publication

  • Article dans une revue

    Christopher Norman, Elizabeth Gargon, Mariska Leeflang, Aurélie Névéol, Paula Williamson. Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database. Database – The journal of Biological Databases and Curation, 2019, 2019, ⟨10.1093/database/baz109⟩. ⟨hal-04466023⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Patricia Chiril, Farah Benamara, Véronique Moriceau, Kumar Abhishek. The binary trio at SemEval-2019 Task 5: Multitarget Hate Speech Detection in Tweets. 13th International Workshop on Semantic Evaluation (SemEval 2019), Jun 2019, Minneapolis, United States. pp.489-493, ⟨10.18653/v1/S19-2087⟩. ⟨hal-02951036⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Marcely Zanon, Bolaji Yusuf, Lucas Ondel, Aline Villavicencio, Laurent Besacier. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. 2021. ⟨hal-03477951⟩

    STL

    Year of publication

  • Communication dans un congrès

    Elena Knyazeva, Philippe Boula de Mareüil, Frédéric Vernier. Aesop’s Fable “The North Wind and the Sun” Used as a Rosetta Stone to Extract and Map Spoken Words in Under-resourced Languages. LREC 2022 – 13th Conference on Language Resources and Evaluation, ELRA, Jun 2022, Marseille, France. pp.2072-2079. ⟨hal-04465840⟩

    AVIZ, STL, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Rachel Bawden, Marie-Amélie Marieamelie.Botalla@gmail.Com Bottala, Kim Gerdes, Sylvain Kahane. Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie. Proceedings of the 9th Language Resources and Evaluation Conference (LREC), 2014, Iceland. pp.1-6. ⟨halshs-01011059⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Nesrine Fourati, C. Pelachaud. Perception of Emotions and Body Movement in the Emilya Database. IEEE Transactions on Affective Computing, 2016. ⟨hal-02287454⟩

    STL

    Year of publication

  • Article dans une revue

    Nesrine Fourati, Catherine Pelachaud. Perception of Emotions and Body Movement in the Emilya Database. IEEE Transactions on Affective Computing, 2018, 9 (1), pp.90-101. ⟨10.1109/TAFFC.2016.2591039⟩. ⟨hal-02382677⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Joseph J Mariani, Zygmunt Vetulani. Preface. Zygmunt Vetulani, Patrick Paroubek, Marek Kubis. Human Language Technology. Challenges for Computer Science and Linguistics 7th Language and Technology Conference, LTC 2015, Poznań, Poland, November 27-29, 2015, Revised Selected Papers, Springer, 2018, Lecture Notes in Computer Science, 978-3-319-93782-3. ⟨10.1007/978-3-319-93782-3⟩. ⟨hal-04455017⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Kim Gerdes, Sylvain Kahane, Rachel Bawden, Julie Beliao, Éric Villemonte de La Clergerie, et al.. Annotation tools for syntax. Rhapsodie: A Prosodic and Syntactic Treebank for Spoken French, John Benjamins, 2019, ⟨10.1075/scl.89.08ger⟩. ⟨hal-02450311⟩

    STL

    Year of publication

  • Communication dans un congrès

    Tristan Luiggi, Vincent Guigue, Laure Soulier, Siwar Jendoubi, Aurelien Baelde. Dynamic Named Entity Recognition. 38th ACM/SIGAPP Symposium on Applied Computing, Mar 2023, Tallinn, Estonia. pp.890-897, ⟨10.1145/3555776.3577603⟩. ⟨hal-04284318⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Annelies Braffort. Compte-rendu de l’ouvrage “La langue des signes. Statuts linguistiques et institutionnels”, numéro de Langue française, n° 137, C. Cuxac (dir.). Le Français Moderne – Revue de linguistique Française, 2004, 2, pp.250-252. ⟨hal-04457633⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Joseph J Mariani, Gilles Adda, Khalid Choukri, Irmgarda Kasinskaite Buddeberg, Hélène Mazo, et al.. Introduction by the Organizers of the thematic tracks on the Achievements and Challenges (Day 2 and Day 3). Proceedings of the Language Technology for All (LT4All) conference, 2020. ⟨hal-04455045⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Olivier Ridoux, Clément Morand. Extraction dans des textes anciens d’entités nommées de type binômes de la classification linnéenne du vivant : une étude de cas. Extraction et Gestion des Connaissances (EGC) 2023, 2023, Lyon, France. ⟨hal-04447919⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Typological classification of European Portuguese fricatives: a cross-language forced alignment and pronunciation variants study. 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), Dec 2023, Trento, Italy. pp.239-243. ⟨hal-04451618⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Annotating Spin in Biomedical Scientific Publications: the case of Randomized Controlled Trials (RCTs). Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), ELRA, May 2018, MIYAZAKI, Japan. ⟨hal-04449090⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Mathilde Hutin, Ioana Vasilescu, Lori Lamel, Martine Adda-Decker. Stop devoicing and palace of articulation: a cross-linguistic study using large-scale corpora. 20th International Congress of Phonetic Sciences (ICPhS2023), Aug 2023, Prague, Czech Republic. pp.3186 – 3190. ⟨hal-04452900⟩

    STL

    Year of publication

  • Thèse

    Shu Okabe. Modèles faiblement supervisés pour la documentation automatique des langues. Informatique et langage [cs.CL]. Université Paris-Saclay, 2023. Français. ⟨NNT : 2023UPASG091⟩. ⟨tel-04453579⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Brigitte Garcia, Dominique Boutet, Annelies Braffort, Patrice Dalle. Sign Language (SL) in Graphical Form : Methodology, modellisation and representations for gestural communication. Sign Language (SL) in Graphical Form : Methodology, modellisation and representations for gestural communication, Jun 2005, Lyon, France. http://gesture-lyon2005.ens-lsh.fr/article.php3?id_article=230. ⟨halshs-00165911⟩

    ILES, STL

    Year of publication

  • Chapitre d'ouvrage

    Marc Evrard. Transformers in Automatic Speech Recognition. Human-Centered Artificial Intelligence, 13500, Springer International Publishing, pp.123-139, 2023, Lecture Notes in Computer Science, ⟨10.1007/978-3-031-24349-3_8⟩. ⟨hal-04259186⟩

    STL

    Year of publication

  • Communication dans un congrès

    Emmett Strickland, Dana Aubakirova, Dorin Doncenco, Diego Torres, Marc Evrard. NaijaTTS: A pitch-controllable TTS model for Nigerian Pidgin. ISCA Speech Synthesis Workshop, Aug 2023, Grenoble, France. ⟨hal-04183972⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Using cross-language automatic speech recognition and pronunciation variants to investigate voicing in European Portuguese fricatives. Architectures and Mechanisms for Language Processing (AMLAP23), Aug 2023, San Sebastian, Spain. ⟨hal-04451669⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Ioana Chitoran. Laterals in simplex vs. complex syllable codas: a comparison of four languages. 13th International Seminar on Speech Production (ISSP2024), May 2024, Autrans (Grenoble), France. ⟨hal-04451665⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu, Laurence Devillers. An investigation of syllable position /l/ allophony in L2 English learners using Word Error Rate as an index of phonetic proficiency. 13th International Seminar on Speech Production (ISSP2024), May 2024, Autrans, France. ⟨hal-04451662⟩

    STL

    Year of publication

  • Article dans une revue

    Anna Koroleva, Sanjay Kamath, Patrick Paroubek. Measuring semantic similarity of clinical trial outcomes using deep pre-trained language representations. Journal of Biomedical Informatics, 2019, 100, pp.100058. ⟨10.1016/j.yjbinx.2019.100058⟩. ⟨hal-04449449⟩

    LaHDAK, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Demonstrating ConstruKT, a text annotation toolkit for generalized linguistic contructions applied to communication spin. Human Language Technologies as a Challenge for Computer Science and Linguistics – 2019, May 2019, Poznañ, France. pp.19-20. ⟨hal-04449153⟩

    STL

    Year of publication

  • Communication dans un congrès

    Patrick Paroubek, Anna Koroleva, Corentin Masson. Analysing clinical trial outcomes in trial registries : towards creating an ontology of clinical trial outcomes. TOTh 2019 – Terminologie & Ontologie : Théories et Applications, Jun 2019, Le Bourget du Lac, France. pp.309-319. ⟨hal-04449764⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Extracting relations between outcomes and significance levels in Randomized Controlled Trials (RCTs) publications. Proceedings of the 18th BioNLP Workshop and Shared Task, Aug 2019, Florence, France. pp.359-369, ⟨10.18653/v1/W19-5038⟩. ⟨hal-04449412⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anna Koroleva, Camila Olarte Parra, Patrick Paroubek. On improving the implementation of automatic updating of systematic reviews. JAMIA open, 2019, 2 (4), pp.400-401. ⟨10.1093/jamiaopen/ooz044⟩. ⟨hal-04449475⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Sanjay Kamath, Patrick M. M. Bossuyt, Patrick Paroubek. DeSpin: a prototype system for detecting spin in biomedical publications. roceedings of the BioNLP 2020 workshop, SIG-BIOMED, Jul 2020, online (Seattle), United States. pp.49-59, ⟨10.18653/v1/2020.bionlp-1.5⟩. ⟨hal-04449382⟩

    LaHDAK, STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anna Koroleva, Patrick Paroubek. On the Contribution of Specific Entity Detection in Comparative Constructions to Automatic Spin Detection in Biomedical Scientific Publications. Lecture Notes in Computer Science, 2020, Lecture Notes in Computer Science, 12598, pp.304-317. ⟨10.1007/978-3-030-66527-2_22⟩. ⟨hal-04449857⟩

    STL

    Year of publication

  • Article dans une revue

    Gauthier Roussilhe, Anne-Laure Ligozat, Sophie Quinton. A long road ahead: a review of the state of knowledge of the environmental effects of digitization. Current Opinion in Environmental Sustainability, 2023, 62, pp.101296. ⟨10.1016/j.cosust.2023.101296⟩. ⟨hal-04448683⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès, Communication dans un congrès

    Aman Berhe, Camille Guinaudeau, Claude Barras. Détection de scènes remarquables dans un contexte de séries TV. Conférence en Recherche d’Information et Applications, 2021, Grenoble, France. ⟨hal-04445565⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Albert Rilliard, Christophe d’Alessandro, Marc Evrard. Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis. Journal of the Acoustical Society of America, 2018, 143 (1), pp.109-122. ⟨10.1121/1.5018433⟩. ⟨hal-01914497⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Mathieu Avanzi, Philippe Boula de Mareüil. Peut-on identifier perceptivement huit accents régionaux en français européen ? La réponse des sciences participatives. Glottopol : Revue de sociolinguistique en ligne, 2019, 31, pp.1-21. ⟨hal-03321605⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Philippe Boula de Mareüil, Valentina De Iacovo, Antonio Romano, Frédéric Vernier. Un atlante sonoro delle lingue di Francia e d’Italia: focus sulle parlate liguri. Fiorenzo Toso. Il patrimonio linguistico storico della Liguria. Attualità e futuro, Insedicesimo, pp.33-46, 2019. ⟨hal-04441432⟩

  • Article dans une revue

    Yaru Wu, Martine Adda-Decker, Lori Lamel. Schwa Deletion in Word-Initial Syllables of Polysyllabic Words. Journal of Monolingual and Bilingual Speech, 2020, 2 (2), ⟨10.1558/jmbs.17311⟩. ⟨hal-04442984⟩

    STL, TLP

    Year of publication

  • Poster de conférence

    Hélène Bonneau-Maynard. Quelles sont les bonnes pratiques pour que mon cours soit accessible au plus grand nombre ?. journée Initiatives Pédagogiques JIP 2020-2021, Feb 2021, Orsay, France. 2021. ⟨hal-04417697⟩

    STL, TLP

    Year of publication

  • Proceedings/Recueil des communications

    Lori Lamel, Hynek Hermansky, Lukáš Burget, Odette Scharenborg, Petr Motlicek. Proceedings of the 22nd Annual Conference of the International Speech Communication Association: Interspeech 2021. Interspeech 2021, ISCA, 2021, ⟨10.21437/Interspeech.2021⟩. ⟨hal-04442989⟩

    STL

    Year of publication

  • Proceedings/Recueil des communications

    Lori Lamel, Mark Hasegawa-Johnson, John H. L. Hansen, Kyogu Lee, Hanseok Ko, et al.. Proceedings of the 23rd Annual Conference of the International Speech Communication Association: Interspeech 2022. interspeech 2022, 2022, ⟨10.21437/Interspeech.2022⟩. ⟨hal-04442990⟩

    STL

    Year of publication

    Available in free access