Annotation-Efficient Learning for Object Discovery and Detection


Huy V. Vo

Tue, Jun 6 2023 - 02:30 pm (GMT + 7)
About Speaker

Huy V. Vo is an AI Research Scientist at FAIR Labs, Meta. He obtained his PhD in Computer Science from Ecole Normale Supérieure in 2022. His thesis was prepared in the INRIA’s WILLOW team and, under the supervision of Jean Ponce (INRIA) and Patrick Pérez ( Prior to his PhD, he obtained the Engineer’s degree on Maths and Computer Science from Ecole Polytechnique in 2017, and the Math-Vision-Machine Learning Master’s degree of Ecole Normale Supérieure Paris-Saclay in 2018. His research interests revolve around learning problems with little or no supervision, including unsupervised object discovery, weakly supervised object detection, active learning and more recently, self-supervised learning.


Object detectors are important components of intelligent systems such as autonomous vehicles or robots. They are typically obtained with fully-supervised training, which requires large manually annotated datasets whose construction is time-consuming and costly. In this talk, I will discuss several alternatives to fully-supervised object detection that work with less or even no manual annotation. I will first focus on the unsupervised object discovery problem, which, given an image collection without manual annotation, aims at identifying pairs of images that contain similar objects and localizing these objects. I will present two optimization-based approaches (OSD, CVPR’19; rOSD, ECCV’20), a ranking method (LOD, NeurIPS’21) and a simple seed-growing approach that exploits features from self-supervised transformers (LOST, BMVC’21) to this problem. In the second part of the talk, I will discuss a practical scenario which combines weakly-supervised and active learning for training an object detector, and introduce BiB (ECCV’22), an active learning strategy tailored for this scenario. I show that our pipeline with BiB offers a better trade-off between annotation cost and effectiveness than both weakly- and fully-supervised object detection.

Related seminars

Trieu Trinh

Google Deepmind

AlphaGeometry: Solving IMO Geometry without Human Demonstrations
Fri, Jul 5 2024 - 10:00 am (GMT + 7)

Tat-Jun (TJ) Chin

Adelaide University

Quantum Computing in Computer Vision: A Case Study in Robust Geometric Optimisation
Fri, Jun 7 2024 - 11:00 am (GMT + 7)

Fernando De la Torre

Carnegie Mellon University

Human Sensing for AR/VR
Wed, Apr 24 2024 - 07:00 am (GMT + 7)

Anh Nguyen

Microsoft GenAI

The Revolution of Small Language Models
Fri, Mar 8 2024 - 02:30 pm (GMT + 7)