STL

Language Sciences and Technologies

Coordination: Aurélie NEVEOL

The Department of Language Sciences and Technologies studies fundamental questions relating to linguistic systems by exploiting large corpora collected, annotated and enriched in an unsupervised or semi-supervised way by statistical learning models adapted to the linguistic material.

These models make it possible to study how languages function, their variations (phonetic-phonological, morphological-lexical, syntactic and semantic), both synchronic and diachronic, diaphasic and diatopic, and to raise questions about their acquisition as mother tongues or second languages. Finally, the department is developing major applications in language processing: speech recognition, automatic translation, information retrieval, conversational agents, etc. … which are increasingly important for society (safeguarding endangered languages, providing tools for people with disabilities, helping to process information and medical knowledge) and for ethics.

This approach to language and languages covers a broad spectrum, from the most fundamental to the most applied research, in a wide variety of media (newspapers, social media, video, telephone, . . .) and all modalities (written, spoken and signed).

This research is highly multidisciplinary, bringing together diverse communities from the fields of computer science, engineering and the humanities.

Teams

Recent Publications

  • Communication dans un congrès

    Rabab Alkhalifa, Iman Bilal, Hsuvas Borkakoty, Jose Camacho-Collados, Romain Deveaud, et al.. Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. CLEF 2023: Experimental IR Meets Multilinguality, Multimodality, and Interaction, Sep 2023, Thessalonic, Greece. pp.440-458, ⟨10.1007/978-3-031-42448-9_28⟩. ⟨hal-04475726⟩

    ILES, STL

    Year of publication

  • Communication dans un congrès

    Fatima Hamlaoui, Emmanuel-Moselly Makasso, Markus Müller, Jonas Engelmann, Gilles Adda, et al.. BULBasaa: A Bilingual Bàsàá-French Speech Corpus for the Evaluation of Language Documentation Tools. LREC 2018, European Language Resources Association (ELRA), May 2018, Miyazaki, Japan. ⟨hal-04466108⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Yuming Zhai, Gabriel Illouz, Anne Vilnat. Detecting Non-literal Translations by Fine-tuning Cross-lingual Pre-trained Language Models. 28th International Conference on Computational Linguistics (COLING), Dec 2020, Barcelona (on line), Spain. pp.5944-5956, ⟨10.18653/v1/2020.coling-main.522⟩. ⟨hal-04468022⟩

    ILES, STL, STL

    Year of publication

    Available in free access

  • Article dans une revue

    Surya Roca, Sophie Rosset, José García, Álvaro Alesanco. A Study on the Impacts of Slot Types and Training Data on Joint Natural Language Understanding in a Spanish Medication Management Assistant Scenario. Sensors, 2022, 22 (6), pp.2364. ⟨10.3390/s22062364⟩. ⟨hal-04465686⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Laura Spinu, Ioana Vasilescu, Lori Lamel, Jason Lilley. Voicing neutralization in Romanian fricatives across different speech styles. Interspeech, ISCA, Sep 2022, Incheon, South Korea. pp.1342-1346, ⟨10.21437/interspeech.2022-10716⟩. ⟨hal-04465920⟩

    STL, TLP

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 5 : démonstrations. CORIA – TALN 2023, 2023. ⟨hal-04462998⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 4 : articles déjà soumis ou acceptés en conférence internationale. CORIA – TALN 2023, 2023. ⟨hal-04462975⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 6 : projets. CORIA – TALN 2023, 2023. ⟨hal-04463005⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 3 : prises de position en TALTraitement Automatique des langues. CORIA – TALN 2023, 2023. ⟨hal-04462921⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 2 : travaux de recherche originaux – articles courts. CORIA – TALN 2023, 2023. ⟨hal-04462841⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Christophe Servan, Anne Vilnat. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) : volume 1 : travaux de recherche originaux – articles longs. CORIA – TALN 2023, 2023. ⟨hal-04462825⟩

    STL

    Year of publication

    Available in free access

  • Ouvrages

    Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum. Building and Using Comparable Corpora for Multilingual Natural Language Processing. Springer Cham, 2023, Synthesis Lectures on Human Language Technologies, Graeme Hirst, 978-3-031-31383-7. ⟨10.1007/978-3-031-31384-4⟩. ⟨hal-04470213⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum. Basic Principles of Cross-Lingual Models. Building and Using Comparable Corpora for Multilingual Natural Language Processing, Springer International Publishing, pp.9-16, 2023, Synthesis Lectures on Human Language Technologies, ⟨10.1007/978-3-031-31384-4_2⟩. ⟨hal-04465490⟩

    STL

    Year of publication

  • Communication dans un congrès

    Bernd Dudzik, Tiffany Matej Hrkalovic, Dennis Küster, David St-Onge, Felix Putze, et al.. The 5th Workshop on Modeling Socio-Emotional and Cognitive Processes from Multimodal Data in the Wild (MSECP-Wild). ICMI ’23: International Conference On Multimodal Interaction, Oct 2023, Paris France, France. pp.828-829, ⟨10.1145/3577190.3616883⟩. ⟨hal-04465456⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    John Mcdonald, Michael Filhol, Camille Challant. Geometric Modifications of Gestures in Sign Languages. International Society for Gesture Studies conference, Jul 2022, Chicago, United States. ⟨hal-03721241⟩

    STL

    Year of publication

  • Communication dans un congrès

    Adam Lion-Bouton, Yağmur Öztürk, Agata Savary, Jean-Yves Antoine. Evaluating Diversity of Multiword Expressions in Annotated Text. International Committee on Computational Linguistics (COLING), International Committee on Computational Linguistics, Oct 2022, Gyeongju, South Korea. pp.3285-3295. ⟨hal-04468662⟩

    STL, STL

    Year of publication

    Available in free access

  • Article dans une revue

    S Cardoso, X Aimé, V Meininger, D Grabli, Lf Melo Mora, et al.. A Modular Ontology for Modeling Service Provision in a Communication Network for Coordination of Care.. Studies in Health Technology and Informatics, 2018, 247, pp.890-894. ⟨hal-02481869⟩

    STL

    Year of publication

  • Communication dans un congrès

    Plinio Barbosa, Philippe Boula de Mareüil. Imitating Broadcast News Style: Commonalities and Differences Between French and Brazilian Professionals. Book cover Book cover International Conference on Computational Processing of the Portuguese Language (PROPOR 2018), Sep 2018, Canela, Brazil. pp.419-428, ⟨10.1007/978-3-319-99722-3_42⟩. ⟨hal-04466213⟩

    STL, STL, TLP

    Year of publication

  • Article dans une revue

    Christopher Norman, Elizabeth Gargon, Mariska Leeflang, Aurélie Névéol, Paula Williamson. Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database. Database – The journal of Biological Databases and Curation, 2019, 2019, ⟨10.1093/database/baz109⟩. ⟨hal-04466023⟩

    ILES, ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Patricia Chiril, Farah Benamara, Véronique Moriceau, Kumar Abhishek. The binary trio at SemEval-2019 Task 5: Multitarget Hate Speech Detection in Tweets. 13th International Workshop on Semantic Evaluation (SemEval 2019), Jun 2019, Minneapolis, United States. pp.489-493, ⟨10.18653/v1/S19-2087⟩. ⟨hal-02951036⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Marcely Zanon, Bolaji Yusuf, Lucas Ondel, Aline Villavicencio, Laurent Besacier. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. 2021. ⟨hal-03477951⟩

    STL

    Year of publication

  • Communication dans un congrès

    Elena Knyazeva, Philippe Boula de Mareüil, Frédéric Vernier. Aesop’s Fable “The North Wind and the Sun” Used as a Rosetta Stone to Extract and Map Spoken Words in Under-resourced Languages. LREC 2022 – 13th Conference on Language Resources and Evaluation, ELRA, Jun 2022, Marseille, France. pp.2072-2079. ⟨hal-04465840⟩

    AVIZ, STL, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Rachel Bawden, Marie-Amélie Marieamelie.Botalla@gmail.Com Bottala, Kim Gerdes, Sylvain Kahane. Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie. Proceedings of the 9th Language Resources and Evaluation Conference (LREC), 2014, Iceland. pp.1-6. ⟨halshs-01011059⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Nesrine Fourati, C. Pelachaud. Perception of Emotions and Body Movement in the Emilya Database. IEEE Transactions on Affective Computing, 2016. ⟨hal-02287454⟩

    STL

    Year of publication

  • Article dans une revue

    Nesrine Fourati, Catherine Pelachaud. Perception of Emotions and Body Movement in the Emilya Database. IEEE Transactions on Affective Computing, 2018, 9 (1), pp.90-101. ⟨10.1109/TAFFC.2016.2591039⟩. ⟨hal-02382677⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Joseph J Mariani, Zygmunt Vetulani. Preface. Zygmunt Vetulani, Patrick Paroubek, Marek Kubis. Human Language Technology. Challenges for Computer Science and Linguistics 7th Language and Technology Conference, LTC 2015, Poznań, Poland, November 27-29, 2015, Revised Selected Papers, Springer, 2018, Lecture Notes in Computer Science, 978-3-319-93782-3. ⟨10.1007/978-3-319-93782-3⟩. ⟨hal-04455017⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Kim Gerdes, Sylvain Kahane, Rachel Bawden, Julie Beliao, Éric Villemonte de La Clergerie, et al.. Annotation tools for syntax. Rhapsodie: A Prosodic and Syntactic Treebank for Spoken French, John Benjamins, 2019, ⟨10.1075/scl.89.08ger⟩. ⟨hal-02450311⟩

    STL

    Year of publication

  • Communication dans un congrès

    Tristan Luiggi, Vincent Guigue, Laure Soulier, Siwar Jendoubi, Aurelien Baelde. Dynamic Named Entity Recognition. 38th ACM/SIGAPP Symposium on Applied Computing, Mar 2023, Tallinn, Estonia. pp.890-897, ⟨10.1145/3555776.3577603⟩. ⟨hal-04284318⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Annelies Braffort. Compte-rendu de l’ouvrage “La langue des signes. Statuts linguistiques et institutionnels”, numéro de Langue française, n° 137, C. Cuxac (dir.). Le Français Moderne – Revue de linguistique Française, 2004, 2, pp.250-252. ⟨hal-04457633⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Mathilde Hutin, Ioana Vasilescu, Lori Lamel, Martine Adda-Decker. STOP DEVOICING AND PLACE OF ARTICULATION: A CROSS-LINGUISTIC STUDY USING LARGE-SCALE CORPORA. 20th International Congress of Phonetic Sciences, Aug 2023, Prague (Czech Republic), Czech Republic. ⟨hal-04451524⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Joseph J Mariani, Gilles Adda, Khalid Choukri, Irmgarda Kasinskaite Buddeberg, Hélène Mazo, et al.. Introduction by the Organizers of the thematic tracks on the Achievements and Challenges (Day 2 and Day 3). Proceedings of the Language Technology for All (LT4All) conference, 2020. ⟨hal-04455045⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Olivier Ridoux, Clément Morand. Extraction dans des textes anciens d’entités nommées de type binômes de la classification linnéenne du vivant : une étude de cas. Extraction et Gestion des Connaissances (EGC) 2023, 2023, Lyon, France. ⟨hal-04447919⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Typological classification of European Portuguese fricatives: a cross-language forced alignment and pronunciation variants study. 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), Dec 2023, Trento, Italy. pp.239-243. ⟨hal-04451618⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Yenan Sun, Laura Stigliano, Eszter Ronai, Amara Sankhagowit, Anisia Popescu, et al.. The role of contextual-pragmatic information on speech perception: An eye tracking study. 2018 Linguistic Society of America (LSA2018) Annual Meeting, Jan 2018, Salt Lake City, United States. ⟨hal-04453051⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Annotating Spin in Biomedical Scientific Publications: the case of Randomized Controlled Trials (RCTs). Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), ELRA, May 2018, MIYAZAKI, Japan. ⟨hal-04449090⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Ioana Chitoran. Jugements sur le nombre de syllabes et coordination temporelle des gestes articulatoires. 32e Journées d’Études sur la Parole, Jun 2018, Aix-Marseille, France. ⟨hal-04453008⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Ioana Chitoran. Syllable count judgments and temporal organization of articulatory gestures. 16th Conference on Laboratory Phonology, Jun 2018, Lisbon, Portugal. ⟨hal-04453032⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Lisa Hintermeier, Stella Krüger, Aude Noiray. Does the acquisition of reading affect speech production?. Phonetics and Phonology in Europe (PaPE2019), Jun 2019, Lecce, Italy. ⟨hal-04452973⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Aude Noiray. Reading proficiency and phonemic awareness as predictors of coarticulatory gradients in children. Boston University Conference on Language Development (BUCLD44), Nov 2019, Boston, United States. ⟨hal-04452968⟩

    STL

    Year of publication

  • Communication dans un congrès

    Elina Rubertus, Anisia Popescu. Development of coarticulation: comparing modalities in beginner readers. 12th Internation Seminar on Speech Production (ISSP2020), Dec 2020, Virtual conference, United States. ⟨hal-04452952⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Aude Noiray. Coarticulatory organization in beginner readers: a multifactorial interaction approach. 12th Internation Seminar on Speech Production (ISSP2020), Dec 2020, Virtual conference, United States. ⟨hal-04452958⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Aude Noiray. Does learning how to read affect the way you speak? Preliminary insight from German beginning readers. Internation Child Phonology Conference, Jun 2021, Virtual conference, Canada. ⟨hal-04452938⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Aude Noiray. Does learning to read interact with speech patterns in consistent alphabetic systems? The case of German. Architectures and Mechanisms for Language Processing (AMLAP2021), Sep 2021, Paris, France. ⟨hal-04452918⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Louis Goldstein, Mairym Llorens Montesérin, Shrikanth S Narayanan. A multi-sclice rtMRI analysis of horizontal tongue narrowing in English laterals. 18th Conference on Laboratory Phonology, Jun 2022, virtual, France. ⟨hal-04452909⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Mathilde Hutin, Ioana Vasilescu, Lori Lamel, Martine Adda-Decker. Stop devoicing and palace of articulation: a cross-linguistic study using large-scale corpora. 20th International Congress of Phonetic Sciences (ICPhS2023), Aug 2023, Prague, Czech Republic. pp.3186 – 3190. ⟨hal-04452900⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Typological classification of European Portuguese fricatives: a cross-language forced alignment and pronunciation variants study. 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), Dec 2023, Trento (Italy), Italy. pp.239-243. ⟨hal-04452889⟩

    STL

    Year of publication

  • Thèse

    Shu Okabe. Modèles faiblement supervisés pour la documentation automatique des langues. Informatique et langage [cs.CL]. Université Paris-Saclay, 2023. Français. ⟨NNT : 2023UPASG091⟩. ⟨tel-04453579⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Brigitte Garcia, Dominique Boutet, Annelies Braffort, Patrice Dalle. Sign Language (SL) in Graphical Form : Methodology, modellisation and representations for gestural communication. Sign Language (SL) in Graphical Form : Methodology, modellisation and representations for gestural communication, Jun 2005, Lyon, France. http://gesture-lyon2005.ens-lsh.fr/article.php3?id_article=230. ⟨halshs-00165911⟩

    ILES, STL

    Year of publication

  • Chapitre d'ouvrage

    Marc Evrard. Transformers in Automatic Speech Recognition. Human-Centered Artificial Intelligence, 13500, Springer International Publishing, pp.123-139, 2023, Lecture Notes in Computer Science, ⟨10.1007/978-3-031-24349-3_8⟩. ⟨hal-04259186⟩

    STL

    Year of publication

  • Communication dans un congrès

    Emmett Strickland, Dana Aubakirova, Dorin Doncenco, Diego Torres, Marc Evrard. NaijaTTS: A pitch-controllable TTS model for Nigerian Pidgin. ISCA Speech Synthesis Workshop, Aug 2023, Grenoble, France. ⟨hal-04183972⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu. Using cross-language automatic speech recognition and pronunciation variants to investigate voicing in European Portuguese fricatives. Architectures and Mechanisms for Language Processing (AMLAP23), Aug 2023, San Sebastian, Spain. ⟨hal-04451669⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Ioana Chitoran. Laterals in simplex vs. complex syllable codas: a comparison of four languages. 13th International Seminar on Speech Production (ISSP2024), May 2024, Autrans (Grenoble), France. ⟨hal-04451665⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anisia Popescu, Lori Lamel, Ioana Vasilescu, Laurence Devillers. An investigation of syllable position /l/ allophony in L2 English learners using Word Error Rate as an index of phonetic proficiency. 13th International Seminar on Speech Production (ISSP2024), May 2024, Autrans, France. ⟨hal-04451662⟩

    STL

    Year of publication

  • Article dans une revue

    Anna Koroleva, Sanjay Kamath, Patrick Paroubek. Measuring semantic similarity of clinical trial outcomes using deep pre-trained language representations. Journal of Biomedical Informatics, 2019, 100, pp.100058. ⟨10.1016/j.yjbinx.2019.100058⟩. ⟨hal-04449449⟩

    LaHDAK, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Demonstrating ConstruKT, a text annotation toolkit for generalized linguistic contructions applied to communication spin. Human Language Technologies as a Challenge for Computer Science and Linguistics – 2019, May 2019, Poznañ, France. pp.19-20. ⟨hal-04449153⟩

    STL

    Year of publication

  • Communication dans un congrès

    Patrick Paroubek, Anna Koroleva, Corentin Masson. Analysing clinical trial outcomes in trial registries : towards creating an ontology of clinical trial outcomes. TOTh 2019 – Terminologie & Ontologie : Théories et Applications, Jun 2019, Le Bourget du Lac, France. pp.309-319. ⟨hal-04449764⟩

    STL

    Year of publication

  • Communication dans un congrès

    Anna Koroleva, Patrick Paroubek. Extracting relations between outcomes and significance levels in Randomized Controlled Trials (RCTs) publications. Proceedings of the 18th BioNLP Workshop and Shared Task, Aug 2019, Florence, France. pp.359-369, ⟨10.18653/v1/W19-5038⟩. ⟨hal-04449412⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anna Koroleva, Camila Olarte Parra, Patrick Paroubek. On improving the implementation of automatic updating of systematic reviews. JAMIA open, 2019, 2 (4), pp.400-401. ⟨10.1093/jamiaopen/ooz044⟩. ⟨hal-04449475⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Anna Koroleva, Sanjay Kamath, Patrick M. M. Bossuyt, Patrick Paroubek. DeSpin: a prototype system for detecting spin in biomedical publications. roceedings of the BioNLP 2020 workshop, SIG-BIOMED, Jul 2020, online (Seattle), United States. pp.49-59, ⟨10.18653/v1/2020.bionlp-1.5⟩. ⟨hal-04449382⟩

    LaHDAK, STL

    Year of publication

    Available in free access

  • Article dans une revue

    Anna Koroleva, Patrick Paroubek. On the Contribution of Specific Entity Detection in Comparative Constructions to Automatic Spin Detection in Biomedical Scientific Publications. Lecture Notes in Computer Science, 2020, Lecture Notes in Computer Science, 12598, pp.304-317. ⟨10.1007/978-3-030-66527-2_22⟩. ⟨hal-04449857⟩

    STL

    Year of publication

  • Article dans une revue

    Gauthier Roussilhe, Anne-Laure Ligozat, Sophie Quinton. A long road ahead: a review of the state of knowledge of the environmental effects of digitization. Current Opinion in Environmental Sustainability, 2023, 62, pp.101296. ⟨10.1016/j.cosust.2023.101296⟩. ⟨hal-04448683⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès, Communication dans un congrès

    Aman Berhe, Camille Guinaudeau, Claude Barras. Détection de scènes remarquables dans un contexte de séries TV. Conférence en Recherche d’Information et Applications, 2021, Grenoble, France. ⟨hal-04445565⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Albert Rilliard, Christophe d’Alessandro, Marc Evrard. Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis. Journal of the Acoustical Society of America, 2018, 143 (1), pp.109-122. ⟨10.1121/1.5018433⟩. ⟨hal-01914497⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Mathieu Avanzi, Philippe Boula de Mareüil. Peut-on identifier perceptivement huit accents régionaux en français européen ? La réponse des sciences participatives. Glottopol : Revue de sociolinguistique en ligne, 2019, 31, pp.1-21. ⟨hal-03321605⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Philippe Boula de Mareüil, Valentina De Iacovo, Antonio Romano, Frédéric Vernier. Un atlante sonoro delle lingue di Francia e d’Italia: focus sulle parlate liguri. Fiorenzo Toso. Il patrimonio linguistico storico della Liguria. Attualità e futuro, Insedicesimo, pp.33-46, 2019. ⟨hal-04441432⟩

  • Article dans une revue

    Yaru Wu, Martine Adda-Decker, Lori Lamel. Schwa Deletion in Word-Initial Syllables of Polysyllabic Words. Journal of Monolingual and Bilingual Speech, 2020, 2 (2), ⟨10.1558/jmbs.17311⟩. ⟨hal-04442984⟩

    STL, TLP

    Year of publication

  • Poster de conférence

    Hélène Bonneau-Maynard. Quelles sont les bonnes pratiques pour que mon cours soit accessible au plus grand nombre ?. journée Initiatives Pédagogiques JIP 2020-2021, Feb 2021, Orsay, France. 2021. ⟨hal-04417697⟩

    STL, TLP

    Year of publication

  • Proceedings/Recueil des communications

    Lori Lamel, Hynek Hermansky, Lukáš Burget, Odette Scharenborg, Petr Motlicek. Proceedings of the 22nd Annual Conference of the International Speech Communication Association: Interspeech 2021. Interspeech 2021, ISCA, 2021, ⟨10.21437/Interspeech.2021⟩. ⟨hal-04442989⟩

    STL

    Year of publication

  • Proceedings/Recueil des communications

    Lori Lamel, Mark Hasegawa-Johnson, John H. L. Hansen, Kyogu Lee, Hanseok Ko, et al.. Proceedings of the 23rd Annual Conference of the International Speech Communication Association: Interspeech 2022. interspeech 2022, 2022, ⟨10.21437/Interspeech.2022⟩. ⟨hal-04442990⟩

    STL

    Year of publication

    Available in free access

  • Logiciel

    Michael Filhol. AZee-eval. 2023. ⟨hal-04434212⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Patrick Paroubek, Zygmunt Vetulani. Human Language Technologies as a Challenge for Computer Science and Linguistics – 2023. 10th LANGUAGE AND TECHNOLOGY CONFERENCE: Human Language Technologies as a Challenge for Computer Science and Linguistics, Adam Mickiewicz University Press, 2023, 978-83-232-4177-5. ⟨10.14746/amup.9788323241775⟩. ⟨hal-04442486⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Camille Guinaudeau, Andreu Girbau Xalabarder. Textual Analysis for Video Memorability Prediction. the 13th MediaEval Multimedia Benchmark Workshop, Jan 2023, Bergen, Norway. ⟨hal-04091024⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thomas Gerald, Sofiane Ettayeb, Ha Quang Le, Gabriel Illouz, Patrick Paroubek, et al.. Sélectionner les “bons” passages pour créer les “bonnes” questions : Analyse et Évaluation d’un nouveau Corpus de Questions et Réponses pour l’Éducation. Extraction et Gestion des Connaissances, Jan 2023, Lyon (Université Lumière Lyon 2), France. pp.67-78. ⟨hal-04441447⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Thomas Gerald, Sofiane Ettayeb, Louis Tamames, Ha Quang Le, Patrick Paroubek, et al.. A new approach to generate teacher-like questions guided by text spans extraction. 10th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Apr 2023, Poznan, Poland. ⟨hal-04441406⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Elise Lincker, Camille Guinaudeau, Olivier Pons, Isabelle Barbet, Jérôme Dupire, et al.. Classification automatique de données déséquilibrées et bruitées : application aux exercices de manuels scolaires. 18e Conférence en Recherche d’Information et Applications — 16e Rencontres Jeunes Chercheurs en RI — 30e Conférence sur le Traitement Automatique des Langues Naturelles — 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Servan, Christophe; Vilnat, Anne, Jun 2023, Paris, France. pp.121-130. ⟨hal-04130220⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Elise Lincker, Olivier Pons, Camille Guinaudeau, Isabelle Barbet, Jérôme Dupire, et al.. Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extraction. Intelligent Textbooks 2023, Jul 2023, Tokyo, Japan. pp.37-53. ⟨hal-04184895⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Élise Lincker, Camille Guinaudeau, Olivier Pons, Jérôme Dupire, Céline Hudelot, et al.. Noisy and Unbalanced Multimodal Document Classification: Textbook Exercises as a Use Case. 20th International Conference on Content-based Multimedia Indexing (CBMI 2023), Sep 2023, Orléans, France. ⟨10.1145/3617233.3617239⟩. ⟨hal-04221023⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol. Où la frugalité rejoint l’éthique : utilisation de données synthétiques pour la reconnaissance d’entités cliniques. Journée d’étude sur le traitement automatique des langues frugal et la recherche d’information frugale, ATALA, Jan 2024, Paris, France. ⟨hal-04438229⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. State-of-the-RNArt: benchmarking current methods for RNA 3D structure prediction. 2024. ⟨hal-04437967⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Clément Bernard, Guillaume Postic, Sahar Ghannay, Fariza Tahi. RNAdvisor: a comprehensive benchmarking tool for the measure and prediction of RNA structural model quality. 2024. ⟨hal-04437940⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Sofiya Kobylyanskaya. Speech and eye tracking features for L2 acquisition: a multimodal experiment. 2022. ⟨hal-04428857⟩

    STL

    Year of publication

    Available in free access

  • Autre publication scientifique

    Joseph J Mariani. 24h pour écouter parler. Conversations sur la langue française : INNOVER. 2023. ⟨hal-04430140⟩

    STL

    Year of publication

    Available in free access

  • Pré-publication, Document de travail

    Rachel Bawden, Hatim Bourfoune, Bertrand Cabot, Nathan Cassereau, Pierre Cornette, et al.. Les modèles Bloom pour le traitement automatique de la langue française. 2024. ⟨hal-04435371⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Nicoletta Calzolari, Frédéric Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, et al.. Proceedings Language Resources and Evaluation Conference (LREC) 2020. Language Resources and Evaluation Conference (LREC) 2020, 2020, 9781713812500. ⟨hal-04415353⟩

    STL, TLP

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Nicoletta Calzolari, Frédéric Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, et al.. Language Resources and Evaluation Conference LREC 2022 Proceedings. Language Resource and Evaluation Conference (LREC) 2022, European Language Resources Association, 2022, 979-10-95546-72-6. ⟨hal-04413343⟩

    STL

    Year of publication

    Available in free access

  • Poster de conférence

    Jean-Sylvain Liénard. Voice Strength Representation and Estimation from the Long Term Amplitude Spectrum. 32èmes Journées d’Étude sur la Parole, Jun 2018, Aix-en-Provence, France. . ⟨hal-04424618⟩

    STL

    Year of publication

    Available in free access

  • Proceedings/Recueil des communications

    Zygmunt Vetulani, Patrick Paroubek, Marek Kubis. Human Language Technology. Challenges for Computer Science and Linguistics. Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Lecture Notes in Computer Science, 13212, Springer International Publishing; Springer International Publishing, 2019, Lecture Notes in Computer Science, ISBN 978-83-65988-31-7. ⟨10.1007/978-3-031-05328-3⟩. ⟨hal-04430598⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Atilla Kaan Alkan, Cyril Grouin, Fabian Schüssler, Pierre Zweigenbaum. A Majority Voting Strategy of a SciBERT-based Ensemble Models for Detecting Entities in the Astrophysics Literature (Shared Task). First Workshop on Information Extraction from Scientific Publications, Association for Computational Linguistics, Nov 2022, Online, Taiwan. pp.131-139. ⟨hal-04425922⟩

    STL, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Sofiya Kobylyanskaya, Ioana Vasilescu, Laurence Devillers, Olivier Augereau. Vers la compréhension des difficultés de lecture en L2 à travers des paramètres acoustiques et de mouvement des yeux. Environnements Informatiques pour l’Apprentissage Humain (EIAH), Jun 2023, Brest, France. ⟨hal-04388887⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Laurence Devillers. Affective and social dimensions in human‐robot interactions. Interfaces numériques, 2013, 2 (1), pp.105 – 117. ⟨10.25965/interfaces-numeriques.1760⟩. ⟨hal-04421784⟩

    STL

    Year of publication

    Available in free access

  • Chapitre d'ouvrage

    Joseph J Mariani. Les Technologies Linguistiques – Regard d’Expert. RAPPORT 2018 sur l’état de la Francophonie numérique, 2018, ISBN 978-92-9028-436-9. ⟨hal-04413134⟩

    STL

    Year of publication

    Available in free access

  • N°spécial de revue/special issue

    Alexandre Allauzen, Hinrich Schütze. Apprentissage profond pour le traitement automatique des langues. Revue TALTraitement Automatique des langues : traitement automatique des langues, 59 (2), 2018. ⟨hal-04421499⟩

    STL, TLP

    Year of publication

    Available in free access

  • Communication dans un congrès

    Syrielle Montariol, Alexandre Allauzen. Empirical Study of Diachronic Word Embeddings for Scarce Data. Recent Advances in Natural Language Processing, Sep 2019, Varna, Bulgaria. pp.795-803, ⟨10.26615/978-954-452-056-4_092⟩. ⟨hal-04421484⟩

    STL

    Year of publication

    Available in free access

  • Article dans une revue

    Sanjay Kamath, Brigitte Grau, Yue Ma. Predicting and Integrating Expected Answer Types into a Simple Recurrent Neural Network Model for Answer Sentence Selection. Computación y sistemas, 2019, 23 (3), ⟨10.13053/CYS-23-3-3241⟩. ⟨hal-04423041⟩

    LaHDAK, STL

    Year of publication

  • Chapitre d'ouvrage

    Laurence Devillers. Emotional and Social Forms of Robots. Georges Chapouthier et Marie Christine Maurel. The Explosion of Life Forms: Living Beings and Morphology, John Wiley & Sons, pp.173-182, 2020, 978-1-78945-005-7. ⟨hal-04423332⟩

    STL

    Year of publication

  • Chapitre d'ouvrage

    Joseph J Mariani. Language technology for all: a challenge. UNESCO Report on Languages, In press. ⟨hal-04415222⟩

    STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Hugues Ali Mehenni, Sofiya Kobylyanskaya, Ioana Vasilescu, Laurence Devillers. Children as Candidates to Verbal Nudging in a Human-robot Experiment. ICMI ’20: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, Oct 2020, Virtual Event Netherlands, Netherlands. pp.482-486, ⟨10.1145/3395035.3425224⟩. ⟨hal-04423128⟩

    STL

    Year of publication

  • Communication dans un congrès

    He Tianyu, Zheng Yuyu, Bai Jing, Chen Pan, Ma Yue, et al.. Analysis of emotional characteristics of Weibo “tree hole” users with different suicide risk. ISAIMS 2021: 2nd International Symposium on Artificial Intelligence for Medicine Sciences, Oct 2020, Beijing China, France. pp.562-567, ⟨10.1145/3500931.3501027⟩. ⟨hal-04423038⟩

    STL

    Year of publication

  • Thèse

    Antoine Neuraz. Compréhension du langage naturel pour le dossier patient informatisé : accès à l’information et extraction d’information. Bio-informatique [q-bio.QM]. Université Paris Cité, 2020. Français. ⟨NNT : 2020UNIP5201⟩. ⟨tel-04210975⟩

    ILES, STL

    Year of publication

    Available in free access

  • Communication dans un congrès

    Syrielle Montariol, Alexandre Allauzen, Asanobu Kitamoto. Variations in Word Usage for the Financial Domain. Second Workshop on Financial Technology and Natural Language Processing, Jan 2021, Virtual conference, France. ⟨hal-04421686⟩

    ILES, STL

    Year of publication

    Available in free access