Communication dans un congrès, Informatique, Traitement du texte et du document
Kitten: a tool for normalizing HTML and extracting its textual content.
Mathieu-Henri Falco, Véronique Moriceau, Anne Vilnat. Kitten: a tool for normalizing HTML and extracting its textual content.. LREC 2012 - 8th International Conference on Language Resources and Evaluation, May 2012, Istanbul, Turkey. ⟨hal-02951105⟩