Communication dans un congrès, Informatique, Traitement du texte et du document

Kitten: a tool for normalizing HTML and extracting its textual content.

Mathieu-Henri Falco, Véronique Moriceau, Anne Vilnat. Kitten: a tool for normalizing HTML and extracting its textual content.. 8th International Conference on Language Resources and Evaluation - LREC 2012, May 2012, Istanbul, Turkey. ⟨hal-02951105⟩

Publié le