STL

Language Sciences and Technologies

Coordination: Aurélie NEVEOL

The Department of Language Sciences and Technologies studies fundamental questions relating to linguistic systems by exploiting large corpora collected, annotated and enriched in an unsupervised or semi-supervised way by statistical learning models adapted to the linguistic material.

These models make it possible to study how languages function, their variations (phonetic-phonological, morphological-lexical, syntactic and semantic), both synchronic and diachronic, diaphasic and diatopic, and to raise questions about their acquisition as mother tongues or second languages. Finally, the department is developing major applications in language processing: speech recognition, automatic translation, information retrieval, conversational agents, etc. … which are increasingly important for society (safeguarding endangered languages, providing tools for people with disabilities, helping to process information and medical knowledge) and for ethics.

This approach to language and languages covers a broad spectrum, from the most fundamental to the most applied research, in a wide variety of media (newspapers, social media, video, telephone, . . .) and all modalities (written, spoken and signed).

This research is highly multidisciplinary, bringing together diverse communities from the fields of computer science, engineering and the humanities.

Teams

Recent Publications

  • Article dans une revue

    Luma da Silva Miranda, João Antônio de Moraes, Albert Rilliard. Visual channel facilitates the comprehension of the intonation of Brazilian Portuguese wh-questions and wh-exclamations: evidence from congruent and incongruent stimuli. Language and Cognition, 2024, pp.1-21. ⟨10.1017/langcog.2024.16⟩. ⟨hal-04538371⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536273⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Djegdjiga Amazouz, Martine-Adda Decker, Lori Lamel. Variation du voisement des occlusives orales en code-switching: analyses par ABX automatique et mesures acoustiques. Journées d’Études sur la Parole – JEP2022, Jun 2022, Noirmoutier, France. ⟨hal-03703081⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536600⟩

    STL

    Year of publication

  • Communication dans un congrès

    Karën Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, et al.. Your Stereotypical Mileage may Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Turin (Italie), Italy. ⟨hal-04537096⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Paul Lerner, Cyril Grouin. INCLURE: a Dataset and Toolkit for Inclusive French Translation. The 17th Workshop on Building and Using Comparable Corpora (BUCC @ LREC 2024), 2024, Turin, Italy. ⟨hal-04531938⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Karën Fort, Aurélie Névéol. Ethics and NLP: 10 years after. Journée d’études ATALA “éthique et TALTraitement Automatique des langues : 10 ans après”, 2024. ⟨hal-04533870⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Paul Lerner, Olivier Ferret, Camille Guinaudeau. Cross-modal Retrieval for Knowledge-based Visual Question Answering. 46th European Conference on Information Retrieval (ECIR 2024), 2024, Glasgow, United Kingdom. ⟨hal-04384431⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Tomohiro Nishiyama, Lisa Raithel, Roland Roller, Pierre Zweigenbaum, Eiji Aramaki. Assessing Authenticity and Anonymity of Synthetic User-generated Content in the Medical Domain. Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo), Mar 2024, St. Julian’s, Malta. pp.8-17. ⟨hal-04528240⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nadège Alavoine, Gaëlle Laperriere, Christophe Servan, Sahar Ghannay, Sophie Rosset. New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. ⟨hal-04523286⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier. A Benchmark Evaluation of Clinical Named Entity Recognition in French. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy. ⟨hal-04523267⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Christophe Servan, Sahar Ghannay, Sophie Rosset. mALBERT: Is a Compact Multilingual BERT Model Still Worth It?. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Torino, Italy. ⟨hal-04520797⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Aaron Boussidan, Fanny Ducel, Aurélie Névéol, Karën Fort. What ChatGPT tells us about ourselves. Journée d’étude Éthique et TALTraitement Automatique des langues 2024, Apr 2024, Nancy, France. ⟨hal-04521121⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thierry Hamon, Natalia Grabar. Automatic Prediction of Semantic Labels for French Medical Terms. Medical Informatics Europe conference (MIE2022), May 2022, Nice, France. pp.868-869, ⟨10.3233/SHTI220610⟩. ⟨hal-04519905⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan, Sophie Rosset. Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification. LREC-COLING 2024, May 2024, TURIN, Italy. ⟨hal-04519930v1⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Mérième Bouhandi, Emmanuel Morin, Thierry Hamon. Graph Neural Networks for Adapting Off-the-shelf General Domain Language Models to Low-Resource Specialised Domains. 2nd Workshop on Deep Learning on Graphs for Natural Language Processing (DLG4NLP 2022), ACL, Jul 2022, Seattle, Washington, United States. pp.36-42, ⟨10.18653/v1/2022.dlg4nlp-1.5⟩. ⟨hal-04517190⟩

    STL

    Year of publication

    Available in free access

  • Poster de conférence

    Elise Lincker, Léa Pacini, Olivier Pons, Camille Guinaudeau, Jérôme Dupire, et al.. MALIN : MAnuels scoLaires INclusifs : Accessibilité numérique des manuels scolaires. Colloque Handiversité 2023 – L’innovation pour le partage, Apr 2023, Gif-sur-Yvette, France. ⟨hal-04410349⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Angèle Gayet-Ageron, Khaoula Ben Messaoud, Mark Oliver Richards, Cyril Jaksic, Julien Gobeill, et al.. Assessment of gender and geographical bias in the editorial decision-making process of biomedical journals: A Case-Control study.. Medrxiv : the Preprint Server For Health Sciences, 2024, ⟨10.1101/2024.03.15.24304220⟩. ⟨hal-04510221⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Clement Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. RNAdvisor: a comprehensive benchmarking tool for the measure and prediction of RNA structural model quality. Briefings in Bioinformatics, 2024, 25 (2), pp.bbae064. ⟨10.1093/bib/bbae064⟩. ⟨hal-04508073⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anne-Laure Ligozat, Christophe Brun, Benjamin Demirdjian, Guillaume Gouget, Emilie Jardé, et al.. Setting Climate Targets: The Case of Higher Education and Research. BioRxiv, 2024, ⟨10.1101/2024.03.11.584380⟩. ⟨hal-04505199⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Yanis Ouakrim, Hannah Bull, Michèle Gouiffès, Denis Beautemps, Thomas Hueber, et al.. Mediapi-RGB: Enabling Technological Breakthroughs in French Sign Language (LSF) Research through an Extensive Video-Text Corpus. VISAPP 2024 – 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Feb 2024, Rome, Italy. ⟨10.5220/0012372600003660⟩. ⟨hal-04494094⟩

    AMIArchitectures et modèles pour l'Interaction, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Aurélie Bugeau, Anne-Laure Ligozat. Analysing ICT in prospective scenarios to help reveal undone computer science. Undone Computer Science conference, Feb 2024, Nantes (France), France. ⟨hal-04486589⟩

    STL

    Year of publication

  • Article dans une revue

    Julien Lefevre, Aurélie Bugeau, Jacques Combaz, Laurent Lefèvre, Anne-Laure Ligozat, et al.. Impacts environnementaux de l’IA : quels réels bénéfices ?. Collection numérique de l’AMUE, Agence de mutualisation des universités et établissements d’enseignement supérieur, 2023. ⟨hal-04486682⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Nicholas Asher, Pierre Zweigenbaum. Artificial Intelligence and Language. Pierre Marquis; Odile Papini; Henri Prade. A Guided Tour of Artificial Intelligence Research, III: Interfaces and Applications of Artificial Intelligence (chapter 4), Springer International Publishing, pp.117-145, 2020, 978-3-030-06169-2. ⟨10.1007/978-3-030-06170-8_4⟩. ⟨hal-04483086⟩

    ILES, STL

    Year of publication

  • Proceedings/Recueil des communications

    Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff. Proceedings of the 13th Workshop on Building and Using Comparable Corpora. LREC 2020, 2020, 979-10-95546-42-9. ⟨hal-04482188⟩

    ILES, ILES, STL

    Year of publication

  • Communication dans un congrès

    Rabab Alkhalifa, Iman Bilal, Hsuvas Borkakoty, Jose Camacho-Collados, Romain Deveaud, et al.. Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. CLEF 2023: Experimental IR Meets Multilinguality, Multimodality, and Interaction, Sep 2023, Thessalonic, Greece. pp.440-458, ⟨10.1007/978-3-031-42448-9_28⟩. ⟨hal-04475726⟩

    ILES, STL

    Year of publication

  • Communication dans un congrès

    Fatima Hamlaoui, Emmanuel-Moselly Makasso, Markus Müller, Jonas Engelmann, Gilles Adda, et al.. BULBasaa: A Bilingual Bàsàá-French Speech Corpus for the Evaluation of Language Documentation Tools. LREC 2018, European Language Resources Association (ELRA), May 2018, Miyazaki, Japan. ⟨hal-04466108⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Yuming Zhai, Gabriel Illouz, Anne Vilnat. Detecting Non-literal Translations by Fine-tuning Cross-lingual Pre-trained Language Models. 28th International Conference on Computational Linguistics (COLING), Dec 2020, Barcelona (on line), Spain. pp.5944-5956, ⟨10.18653/v1/2020.coling-main.522⟩. ⟨hal-04468022⟩

    ILES, STL, STL

    Year of publication

    Available in free access

  • Article dans une revue

    Surya Roca, Sophie Rosset, José García, Álvaro Alesanco. A Study on the Impacts of Slot Types and Training Data on Joint Natural Language Understanding in a Spanish Medication Management Assistant Scenario. Sensors, 2022, 22 (6), pp.2364. ⟨10.3390/s22062364⟩. ⟨hal-04465686⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Laura Spinu, Ioana Vasilescu, Lori Lamel, Jason Lilley. Voicing neutralization in Romanian fricatives across different speech styles. Interspeech, ISCA, Sep 2022, Incheon, South Korea. pp.1342-1346, ⟨10.21437/interspeech.2022-10716⟩. ⟨hal-04465920⟩

    STL, TLP

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 5 : démonstrations. CORIA – TALN 2023, 2023. ⟨hal-04462998⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 4 : articles déjà soumis ou acceptés en conférence internationale. CORIA – TALN 2023, 2023. ⟨hal-04462975⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 6 : projets. CORIA – TALN 2023, 2023. ⟨hal-04463005⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 3 : prises de position en TALTraitement Automatique des langues. CORIA – TALN 2023, 2023. ⟨hal-04462921⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 2 : travaux de recherche originaux – articles courts. CORIA – TALN 2023, 2023. ⟨hal-04462841⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 1 : travaux de recherche originaux – articles longs. CORIA – TALN 2023, 2023. ⟨hal-04462825⟩

    STL

    Year of publication

    Available in free access

  • Ouvrages

    Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum. Building and Using Comparable Corpora for Multilingual Natural Language Processing. Springer Cham, 2023, Synthesis Lectures on Human Language Technologies, Graeme Hirst, 978-3-031-31383-7. ⟨10.1007/978-3-031-31384-4⟩. ⟨hal-04470213⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum. Basic Principles of Cross-Lingual Models. Building and Using Comparable Corpora for Multilingual Natural Language Processing, Springer International Publishing, pp.9-16, 2023, Synthesis Lectures on Human Language Technologies, ⟨10.1007/978-3-031-31384-4_2⟩. ⟨hal-04465490⟩

    STL

    Year of publication

  • Communication dans un congrès

    Bernd Dudzik, Tiffany Matej Hrkalovic, Dennis Küster, David St-Onge, Felix Putze, et al.. The 5th Workshop on Modeling Socio-Emotional and Cognitive Processes from Multimodal Data in the Wild (MSECP-Wild). ICMI ’23: International Conference On Multimodal Interaction, Oct 2023, Paris France, France. pp.828-829, ⟨10.1145/3577190.3616883⟩. ⟨hal-04465456⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    John Mcdonald, Michael Filhol, Camille Challant. Geometric Modifications of Gestures in Sign Languages. International Society for Gesture Studies conference, Jul 2022, Chicago, United States. ⟨hal-03721241⟩

    STL

    Year of publication

  • Communication dans un congrès

    Adam Lion-Bouton, Yağmur Öztürk, Agata Savary, Jean-Yves Antoine. Evaluating Diversity of Multiword Expressions in Annotated Text. International Committee on Computational Linguistics (COLING), International Committee on Computational Linguistics, Oct 2022, Gyeongju, South Korea. pp.3285-3295. ⟨hal-04468662⟩

    STL, STL

    Year of publication

    Available in free access

  • Article dans une revue

    S Cardoso, X Aimé, V Meininger, D Grabli, Lf Melo Mora, et al.. A Modular Ontology for Modeling Service Provision in a Communication Network for Coordination of Care.. Studies in Health Technology and Informatics, 2018, 247, pp.890-894. ⟨hal-02481869⟩

    STL

    Year of publication

  • Communication dans un congrès

    Plinio Barbosa, Philippe Boula de Mareüil. Imitating Broadcast News Style: Commonalities and Differences Between French and Brazilian Professionals. Book cover Book cover International Conference on Computational Processing of the Portuguese Language (PROPOR 2018), Sep 2018, Canela, Brazil. pp.419-428, ⟨10.1007/978-3-319-99722-3_42⟩. ⟨hal-04466213⟩

    STL, STL, TLP

    Year of publication

  • Article dans une revue

    Christopher Norman, Elizabeth Gargon, Mariska Leeflang, Aurélie Névéol, Paula Williamson. Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database. Database – The journal of Biological Databases and Curation, 2019, 2019, ⟨10.1093/database/baz109⟩. ⟨hal-04466023⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Patricia Chiril, Farah Benamara, Véronique Moriceau, Kumar Abhishek. The binary trio at SemEval-2019 Task 5: Multitarget Hate Speech Detection in Tweets. 13th International Workshop on Semantic Evaluation (SemEval 2019), Jun 2019, Minneapolis, United States. pp.489-493, ⟨10.18653/v1/S19-2087⟩. ⟨hal-02951036⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Marcely Zanon, Bolaji Yusuf, Lucas Ondel, Aline Villavicencio, Laurent Besacier. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. 2021. ⟨hal-03477951⟩

    STL

    Year of publication

  • Communication dans un congrès

    Elena Knyazeva, Philippe Boula de Mareüil, Frédéric Vernier. Aesop’s Fable “The North Wind and the Sun” Used as a Rosetta Stone to Extract and Map Spoken Words in Under-resourced Languages. LREC 2022 – 13th Conference on Language Resources and Evaluation, ELRA, Jun 2022, Marseille, France. pp.2072-2079. ⟨hal-04465840⟩

    AVIZ, STL, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Rachel Bawden, Marie-Amélie Marieamelie.Botalla@gmail.Com Bottala, Kim Gerdes, Sylvain Kahane. Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie. Proceedings of the 9th Language Resources and Evaluation Conference (LREC), 2014, Iceland. pp.1-6. ⟨halshs-01011059⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Nesrine Fourati, C. Pelachaud. Perception of Emotions and Body Movement in the Emilya Database. IEEE Transactions on Affective Computing, 2016. ⟨hal-02287454⟩

    STL

    Year of publication

  • Article dans une revue

    Nesrine Fourati, Catherine Pelachaud. Perception of Emotions and Body Movement in the Emilya Database. IEEE Transactions on Affective Computing, 2018, 9 (1), pp.90-101. ⟨10.1109/TAFFC.2016.2591039⟩. ⟨hal-02382677⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Joseph J Mariani, Zygmunt Vetulani. Preface. Zygmunt Vetulani, Patrick Paroubek, Marek Kubis. Human Language Technology. Challenges for Computer Science and Linguistics 7th Language and Technology Conference, LTC 2015, Poznań, Poland, November 27-29, 2015, Revised Selected Papers, Springer, 2018, Lecture Notes in Computer Science, 978-3-319-93782-3. ⟨10.1007/978-3-319-93782-3⟩. ⟨hal-04455017⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Kim Gerdes, Sylvain Kahane, Rachel Bawden, Julie Beliao, Éric Villemonte de La Clergerie, et al.. Annotation tools for syntax. Rhapsodie: A Prosodic and Syntactic Treebank for Spoken French, John Benjamins, 2019, ⟨10.1075/scl.89.08ger⟩. ⟨hal-02450311⟩

    STL

    Year of publication

  • Communication dans un congrès

    Tristan Luiggi, Vincent Guigue, Laure Soulier, Siwar Jendoubi, Aurelien Baelde. Dynamic Named Entity Recognition. 38th ACM/SIGAPP Symposium on Applied Computing, Mar 2023, Tallinn, Estonia. pp.890-897, ⟨10.1145/3555776.3577603⟩. ⟨hal-04284318⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Annelies Braffort. Compte-rendu de l’ouvrage “La langue des signes. Statuts linguistiques et institutionnels”, numéro de Langue française, n° 137, C. Cuxac (dir.). Le Français Moderne – Revue de linguistique Française, 2004, 2, pp.250-252. ⟨hal-04457633⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Joseph J Mariani, Gilles Adda, Khalid Choukri, Irmgarda Kasinskaite Buddeberg, Hélène Mazo, et al.. Introduction by the Organizers of the thematic tracks on the Achievements and Challenges (Day 2 and Day 3). Proceedings of the Language Technology for All (LT4All) conference, 2020. ⟨hal-04455045⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Olivier Ridoux, Clément Morand. Extraction dans des textes anciens d’entités nommées de type binômes de la classification linnéenne du vivant : une étude de cas. Extraction et Gestion des Connaissances (EGC) 2023, 2023, Lyon, France. ⟨hal-04447919⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Typological classification of European Portuguese fricatives: a cross-language forced alignment and pronunciation variants study. 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), Dec 2023, Trento, Italy. pp.239-243. ⟨hal-04451618⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Annotating Spin in Biomedical Scientific Publications: the case of Randomized Controlled Trials (RCTs). Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), ELRA, May 2018, MIYAZAKI, Japan. ⟨hal-04449090⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Mathilde Hutin, Ioana Vasilescu, Lori Lamel, Martine Adda-Decker. Stop devoicing and palace of articulation: a cross-linguistic study using large-scale corpora. 20th International Congress of Phonetic Sciences (ICPhS2023), Aug 2023, Prague, Czech Republic. pp.3186 – 3190. ⟨hal-04452900⟩

    STL

    Year of publication

  • Thèse

    Shu Okabe. Modèles faiblement supervisés pour la documentation automatique des langues. Informatique et langage [cs.CL]. Université Paris-Saclay, 2023. Français. ⟨NNT : 2023UPASG091⟩. ⟨tel-04453579⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Brigitte Garcia, Dominique Boutet, Annelies Braffort, Patrice Dalle. Sign Language (SL) in Graphical Form : Methodology, modellisation and representations for gestural communication. Sign Language (SL) in Graphical Form : Methodology, modellisation and representations for gestural communication, Jun 2005, Lyon, France. http://gesture-lyon2005.ens-lsh.fr/article.php3?id_article=230. ⟨halshs-00165911⟩

    ILES, STL

    Year of publication

  • Chapitre d'ouvrage

    Marc Evrard. Transformers in Automatic Speech Recognition. Human-Centered Artificial Intelligence, 13500, Springer International Publishing, pp.123-139, 2023, Lecture Notes in Computer Science, ⟨10.1007/978-3-031-24349-3_8⟩. ⟨hal-04259186⟩

    STL

    Year of publication

  • Communication dans un congrès

    Emmett Strickland, Dana Aubakirova, Dorin Doncenco, Diego Torres, Marc Evrard. NaijaTTS: A pitch-controllable TTS model for Nigerian Pidgin. ISCA Speech Synthesis Workshop, Aug 2023, Grenoble, France. ⟨hal-04183972⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Using cross-language automatic speech recognition and pronunciation variants to investigate voicing in European Portuguese fricatives. Architectures and Mechanisms for Language Processing (AMLAP23), Aug 2023, San Sebastian, Spain. ⟨hal-04451669⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Ioana Chitoran. Laterals in simplex vs. complex syllable codas: a comparison of four languages. 13th International Seminar on Speech Production (ISSP2024), May 2024, Autrans (Grenoble), France. ⟨hal-04451665⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu, Laurence Devillers. An investigation of syllable position /l/ allophony in L2 English learners using Word Error Rate as an index of phonetic proficiency. 13th International Seminar on Speech Production (ISSP2024), May 2024, Autrans, France. ⟨hal-04451662⟩

    STL

    Year of publication

  • Article dans une revue

    Anna Koroleva, Sanjay Kamath, Patrick Paroubek. Measuring semantic similarity of clinical trial outcomes using deep pre-trained language representations. Journal of Biomedical Informatics, 2019, 100, pp.100058. ⟨10.1016/j.yjbinx.2019.100058⟩. ⟨hal-04449449⟩

    LaHDAK, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Demonstrating ConstruKT, a text annotation toolkit for generalized linguistic contructions applied to communication spin. Human Language Technologies as a Challenge for Computer Science and Linguistics – 2019, May 2019, Poznañ, France. pp.19-20. ⟨hal-04449153⟩

    STL

    Year of publication

  • Communication dans un congrès

    Patrick Paroubek, Anna Koroleva, Corentin Masson. Analysing clinical trial outcomes in trial registries : towards creating an ontology of clinical trial outcomes. TOTh 2019 – Terminologie & Ontologie : Théories et Applications, Jun 2019, Le Bourget du Lac, France. pp.309-319. ⟨hal-04449764⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Extracting relations between outcomes and significance levels in Randomized Controlled Trials (RCTs) publications. Proceedings of the 18th BioNLP Workshop and Shared Task, Aug 2019, Florence, France. pp.359-369, ⟨10.18653/v1/W19-5038⟩. ⟨hal-04449412⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anna Koroleva, Camila Olarte Parra, Patrick Paroubek. On improving the implementation of automatic updating of systematic reviews. JAMIA open, 2019, 2 (4), pp.400-401. ⟨10.1093/jamiaopen/ooz044⟩. ⟨hal-04449475⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Sanjay Kamath, Patrick M. M. Bossuyt, Patrick Paroubek. DeSpin: a prototype system for detecting spin in biomedical publications. roceedings of the BioNLP 2020 workshop, SIG-BIOMED, Jul 2020, online (Seattle), United States. pp.49-59, ⟨10.18653/v1/2020.bionlp-1.5⟩. ⟨hal-04449382⟩

    LaHDAK, STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anna Koroleva, Patrick Paroubek. On the Contribution of Specific Entity Detection in Comparative Constructions to Automatic Spin Detection in Biomedical Scientific Publications. Lecture Notes in Computer Science, 2020, Lecture Notes in Computer Science, 12598, pp.304-317. ⟨10.1007/978-3-030-66527-2_22⟩. ⟨hal-04449857⟩

    STL

    Year of publication

  • Article dans une revue

    Gauthier Roussilhe, Anne-Laure Ligozat, Sophie Quinton. A long road ahead: a review of the state of knowledge of the environmental effects of digitization. Current Opinion in Environmental Sustainability, 2023, 62, pp.101296. ⟨10.1016/j.cosust.2023.101296⟩. ⟨hal-04448683⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès, Communication dans un congrès

    Aman Berhe, Camille Guinaudeau, Claude Barras. Détection de scènes remarquables dans un contexte de séries TV. Conférence en Recherche d’Information et Applications, 2021, Grenoble, France. ⟨hal-04445565⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Albert Rilliard, Christophe d’Alessandro, Marc Evrard. Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis. Journal of the Acoustical Society of America, 2018, 143 (1), pp.109-122. ⟨10.1121/1.5018433⟩. ⟨hal-01914497⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Mathieu Avanzi, Philippe Boula de Mareüil. Peut-on identifier perceptivement huit accents régionaux en français européen ? La réponse des sciences participatives. Glottopol : Revue de sociolinguistique en ligne, 2019, 31, pp.1-21. ⟨hal-03321605⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Philippe Boula de Mareüil, Valentina De Iacovo, Antonio Romano, Frédéric Vernier. Un atlante sonoro delle lingue di Francia e d’Italia: focus sulle parlate liguri. Fiorenzo Toso. Il patrimonio linguistico storico della Liguria. Attualità e futuro, Insedicesimo, pp.33-46, 2019. ⟨hal-04441432⟩

  • Article dans une revue

    Yaru Wu, Martine Adda-Decker, Lori Lamel. Schwa Deletion in Word-Initial Syllables of Polysyllabic Words. Journal of Monolingual and Bilingual Speech, 2020, 2 (2), ⟨10.1558/jmbs.17311⟩. ⟨hal-04442984⟩

    STL, TLP

    Year of publication

  • Poster de conférence

    Hélène Bonneau-Maynard. Quelles sont les bonnes pratiques pour que mon cours soit accessible au plus grand nombre ?. journée Initiatives Pédagogiques JIP 2020-2021, Feb 2021, Orsay, France. 2021. ⟨hal-04417697⟩

    STL, TLP

    Year of publication

  • Proceedings/Recueil des communications

    Lori Lamel, Hynek Hermansky, Lukáš Burget, Odette Scharenborg, Petr Motlicek. Proceedings of the 22nd Annual Conference of the International Speech Communication Association: Interspeech 2021. Interspeech 2021, ISCA, 2021, ⟨10.21437/Interspeech.2021⟩. ⟨hal-04442989⟩

    STL

    Year of publication

  • Proceedings/Recueil des communications

    Lori Lamel, Mark Hasegawa-Johnson, John H. L. Hansen, Kyogu Lee, Hanseok Ko, et al.. Proceedings of the 23rd Annual Conference of the International Speech Communication Association: Interspeech 2022. interspeech 2022, 2022, ⟨10.21437/Interspeech.2022⟩. ⟨hal-04442990⟩

    STL

    Year of publication

    Available in free access

  • Logiciel

    Michael Filhol. AZee-eval. 2023. ⟨hal-04434212⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Patrick Paroubek, Zygmunt Vetulani. Human Language Technologies as a Challenge for Computer Science and Linguistics – 2023. 10th LANGUAGE AND TECHNOLOGY CONFERENCE: Human Language Technologies as a Challenge for Computer Science and Linguistics, Adam Mickiewicz University Press, 2023, 978-83-232-4177-5. ⟨10.14746/amup.9788323241775⟩. ⟨hal-04442486⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Camille Guinaudeau, Andreu Girbau Xalabarder. Textual Analysis for Video Memorability Prediction. the 13th MediaEval Multimedia Benchmark Workshop, Jan 2023, Bergen, Norway. ⟨hal-04091024⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thomas Gerald, Sofiane Ettayeb, Ha Quang Le, Gabriel Illouz, Patrick Paroubek, et al.. Sélectionner les “bons” passages pour créer les “bonnes” questions : Analyse et Évaluation d’un nouveau Corpus de Questions et Réponses pour l’Éducation. Extraction et Gestion des Connaissances, Jan 2023, Lyon (Université Lumière Lyon 2), France. pp.67-78. ⟨hal-04441447⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thomas Gerald, Sofiane Ettayeb, Louis Tamames, Ha Quang Le, Patrick Paroubek, et al.. A new approach to generate teacher-like questions guided by text spans extraction. 10th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Apr 2023, Poznan, Poland. ⟨hal-04441406⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Elise Lincker, Camille Guinaudeau, Olivier Pons, Isabelle Barbet, Jérôme Dupire, et al.. Classification automatique de données déséquilibrées et bruitées : application aux exercices de manuels scolaires. 18e Conférence en Recherche d’Information et Applications — 16e Rencontres Jeunes Chercheurs en RI — 30e Conférence sur le Traitement Automatique des Langues Naturelles — 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Servan, Christophe; Vilnat, Anne, Jun 2023, Paris, France. pp.121-130. ⟨hal-04130220⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Elise Lincker, Olivier Pons, Camille Guinaudeau, Isabelle Barbet, Jérôme Dupire, et al.. Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extraction. Intelligent Textbooks 2023, Jul 2023, Tokyo, Japan. pp.37-53. ⟨hal-04184895⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Élise Lincker, Camille Guinaudeau, Olivier Pons, Jérôme Dupire, Céline Hudelot, et al.. Noisy and Unbalanced Multimodal Document Classification: Textbook Exercises as a Use Case. 20th International Conference on Content-based Multimedia Indexing (CBMI 2023), Sep 2023, Orléans, France. ⟨10.1145/3617233.3617239⟩. ⟨hal-04221023⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. Où la frugalité rejoint l’éthique : utilisation de données synthétiques pour la reconnaissance d’entités cliniques. Journée d’étude sur le traitement automatique des langues frugal et la recherche d’information frugale, ATALA, Jan 2024, Paris, France. ⟨hal-04438229⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. State-of-the-RNArt: benchmarking current methods for RNA 3D structure prediction. 2024. ⟨hal-04437967⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. RNAdvisor: a comprehensive benchmarking tool for the measure and prediction of RNA structural model quality. 2024. ⟨hal-04437940⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Sofiya Kobylyanskaya. Speech and eye tracking features for L2 acquisition: a multimodal experiment. 2022. ⟨hal-04428857⟩

    STL

    Year of publication

    Available in free access

  • Autre publication scientifique

    Joseph J Mariani. 24h pour écouter parler. Conversations sur la langue française : INNOVER. 2023. ⟨hal-04430140⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Rachel Bawden, Hatim Bourfoune, Bertrand Cabot, Nathan Cassereau, Pierre Cornette, et al.. Les modèles Bloom pour le traitement automatique de la langue française. 2024. ⟨hal-04435371⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Nicoletta Calzolari, Frédéric Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, et al.. Proceedings Language Resources and Evaluation Conference (LREC) 2020. Language Resources and Evaluation Conference (LREC) 2020, 2020, 9781713812500. ⟨hal-04415353⟩

    STL, TLP

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Nicoletta Calzolari, Frédéric Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, et al.. Language Resources and Evaluation Conference LREC 2022 Proceedings. Language Resource and Evaluation Conference (LREC) 2022, European Language Resources Association, 2022, 979-10-95546-72-6. ⟨hal-04413343⟩

    STL

    Year of publication

    Available in free access

  • Poster de conférence

    Jean-Sylvain Liénard. Voice Strength Representation and Estimation from the Long Term Amplitude Spectrum. 32èmes Journées d’Étude sur la Parole, Jun 2018, Aix-en-Provence, France. . ⟨hal-04424618⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Zygmunt Vetulani, Patrick Paroubek, Marek Kubis. Human Language Technology. Challenges for Computer Science and Linguistics. Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Lecture Notes in Computer Science, 13212, Springer International Publishing; Springer International Publishing, 2019, Lecture Notes in Computer Science, ISBN 978-83-65988-31-7. ⟨10.1007/978-3-031-05328-3⟩. ⟨hal-04430598⟩

    STL

    Year of publication

    Available in free access