Affichage des résultats 1 à 12 sur 33 au total
Vision par ordinateur et reconnaissance de formes : 1 à 12 sur 33 au total
-
Pré-publication, Document de travail
Omar Adjali, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne. Entity-aware cross-modal pretraining for Knowledge-Based Visual Question Answering. 2024. ⟨cea-04910767⟩
-
Pré-publication, Document de travail
Mariia Zameshina, Mathurin Videau, Alessandro Leite, Marc Schoenauer, Laurent Najman, et al.. Agnostic latent diversity enhancement in generative modeling. 2024. ⟨hal-04661473v2⟩
-
Pré-publication, Document de travail
Vincent Blot, Anastasios N Angelopoulos, Michael I Jordan, Nicolas J-B Brunel. Automatically Adaptive Conformal Risk Control. 2024. ⟨hal-04616940v4⟩
-
Communication dans un congrès
Dana Aubakirova, Kim Gerdes, Lufei Liu. PatFig: Generating Short and Long Captions for Patent Figures. ICCV workshop: CLVL: 5th Workshop on Closing the Loop Between Vision and Language, Computer Vision Foundation, Oct 2023, Paris, France. pp.2843-2849, ⟨10.1109/ICCVW60793.2023.00305⟩. ⟨hal-04408316⟩
-
Article dans une revue
Adrien Arnaud, Michèle Gouiffès, Mehdi Ammi. On the Fly Plane Detection and Time Consistency for Indoor Building Wall Recognition Using a Tablet Equipped With a Depth Sensor. IEEE Access, 2018, 6, pp.17643 - 17652. ⟨10.1109/access.2018.2817838⟩. ⟨hal-04404876⟩
-
Communication dans un congrès
Omar Adjali, Romaric Besançon, Olivier Ferret, Hervé Le Borgne, Brigitte Grau. Multimodal entity linking for tweets. ECIR 2020 - 42nd European Conference on Information Retrieval Research, Apr 2020, Lisbonne (Online event), Portugal. pp.463-478, ⟨10.1007/978-3-030-45439-5⟩. ⟨hal-04315181⟩
-
Communication dans un congrès
Omar Adjali, Romaric Besançon, Olivier Ferret, Hervé Le Borgne, Brigitte Grau. Building a multimodal entity linking dataset from tweets. LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France. pp.4885-4292. ⟨hal-04315504⟩
-
Pré-publication, Document de travail
Birhanu Hailu Belay, Isabelle Guyon, Tadele Mengiste, Bezawork Tilahun, Marcus Liwicki, et al.. HHD-Ethiopic A Historical Handwritten Dataset for Ethiopic OCR with Baseline Models and Human-level Performance. 2023. ⟨hal-04223188⟩
-
Communication dans un congrès
Ihsan Ullah, Dustin Carrión-Ojeda, Sergio Escalera, Isabelle Guyon, Mike Huisman, et al.. Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification. NeurIPS 2022 - 36th Conference on Neural Information Processing Systems - Track on Datasets and Benchmarks, NeurIPS, Nov 2022, New Orleans, United States. ⟨hal-03991982⟩
-
Communication dans un congrès
Hervé Bredin, Johann Poignant, Guillaume Fortier, Makarand Tapaswi, Viet-Bac Le, et al.. QCompere @ REPERE 2013. SLAM 2013 - First Workshop on Speech, Language and Audio for Multimedia, Aug 2013, Marseille, France. pp.49-54. ⟨hal-00949320⟩
-
Communication dans un congrès
Khalil Bergaoui, Yassine Naji, Aleksandr Setkov, Angelique Loesch, Michèle Gouiffès, et al.. Object-Centric And Memory-Guided Normality Reconstruction For Video Anomaly Detection. 2022 IEEE International Conference on Image Processing (ICIP), Oct 2022, Bordeaux, France. pp.2691-2695, ⟨10.1109/ICIP46576.2022.9897259⟩. ⟨hal-03880897⟩
-
Communication dans un congrès
Yassine Naji, Aleksandr Setkov, Angelique Loesch, Michèle Gouiffès, Romaric Audigier. Spatio-temporal predictive tasks for abnormal event detection in videos. 2022 18th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Nov 2022, Madrid, France. pp.1-8, ⟨10.1109/AVSS56176.2022.9959669⟩. ⟨hal-03880911⟩