Young Scholar TechTalk – Learning Out-of-Distribution Object Detectors from Foundation Models

All members of the HKU community and the general public are welcome to join!
Speaker: Mr Jiahui Liu, PhD candidate, Department of Electrical and Electronic Engineering, Faculty of Engineering, HKU
Date: 16th September 2024 (Monday)
Time: 4:30pm
Mode: Mixed
About the TechTalk
All members of the HKU community and the general public are welcome to join!
Speaker: Mr Jiahui Liu, PhD candidate, Department of Electrical and Electronic Engineering, Faculty of Engineering, HKU
Moderator: Ms Yingxian Chen, PhD candidate, Department of Electrical and Electronic Engineering, Faculty of Engineering, HKU
Date: 16th September 2024 (Monday)
Time: 4:30pm
Mode: Mixed (both face-to-face and online). Seats for on-site participants are limited. A confirmation email will be sent to participants who have successfully registered.
Language: English

Out-of-distribution (OOD) object detection is a challenging task due to the absence of open-set OOD data. Inspired by recent advancements in text-to-image generative models, such as Stable Diffusion, we study the potential of generative models trained on large-scale open-set data to synthesize OOD samples, thereby enhancing OOD object detection. We introduce SyncOOD, a simple data curation method that capitalizes on the capabilities of large foundation models to automatically extract meaningful OOD data from text-to-image generative models. This offers the model access to open-world knowledge encapsulated within off-the-shelf foundation models. The synthetic OOD samples are then employed to augment the training of a lightweight, plug-and-play OOD detector, thus effectively optimizing the in-distribution (ID)/OOD decision boundaries. Extensive experiments across multiple benchmarks demonstrate that SyncOOD significantly outperforms existing methods, establishing new state-of-the-art performance with minimal synthetic data usage.

Registration
  • The tech talk “Learning Out-of-Distribution Object Detectors from Foundation Models” will be organized in the Tam Wing Fan Innovation Wing Two (G/F, Run Run Shaw Building, HKU) on 16th September 2024 (Monday), 4:30pm.
  • Seats are limited. Zoom broadcast is available if the seating quota is full. 
  • Registrants on the waiting list will be notified of the arrangement after the registration deadline (with seating/free-standing/other arrangement)
Recording of the Tech Talk
About the speaker

Mr Jiahui Liu

Mr Jiahui Liu is currently a Ph.D. candidate at Computer Vision and Machine Intelligence Laboratory, the University of Hong Kong, supervised by Professor Xiaojuan Qi. Before that, he obtained his M.Sc degree at the Department of Computer Science, University of Sheffield, and his B.Sc degree at the Department of Mathematics, East China University of Science and Technology. His research interests are mainly in 2D/3D Open-World Understanding, OOD Detection & Generation, and Multimodal Large Language Models.

Promotion materials
About the project

Multifunctional Filters for Protecting Public Health

Clean water and clean air are vital for public health. This project focuses on developing high-efficiency and environmentally sustainable filters for removing harmful air/water pollutants. The team has developed novel architectures and functionalities for the filters to achieve high permeance, high removal efficiency, and excellent reusability.

Other Tech talks