SEMINAR

Innovations in Text-Guided Visual Content Generation

Speaker

Wang Hao

Working
Nanyang Technological University
Timeline
Mon, Jul 17 2023 - 11:00 am (GMT + 7)
About Speaker

WANG Hao is a final year PhD candidate in the School of Computer Science and Engineering at Nanyang Technological University, Singapore. He received the B.E. degree from Huazhong University of Science and Technology, China. His research interest is developing AI-powered perception and generation algorithms for the multimodal domain. In particular, his recent work investigates the translation between visual and text data, to generate controllable contents with efficiency and robustness. He has published first-authored top-tier conference and journal work in computer vision and multimedia fields, including CVPR, ECCV, IEEE TPAMI, IEEE TIP, etc.

Abstract

Text-guided visual content generation is a significant task in generative AI, which focuses on translating semantic information from text to visual content. Generating complex and high-quality visuals while maintaining control is a key challenge in this domain. In this talk, we will introduce two innovative frameworks: StyleGAN-based inversion and online alignment. These frameworks aim to overcome the existing challenges, where we enable high-fidelity visual generation and cross-modal semantic matching simultaneously. With our approach, the inference phase allows for the direct generation of visual content from textual input, streamlining the process into a single step.

Related seminars

Dr. Tu Vu

Virginia Tech

Efficient Model Development in the Era of Large Language Models
Tue, Nov 5 2024 - 09:30 am (GMT + 7)
Representation Learning with Graph Autoencoders and Applications to Music Recommendation
Fri, Jul 26 2024 - 10:00 am (GMT + 7)

Trieu Trinh

Google Deepmind

AlphaGeometry: Solving IMO Geometry without Human Demonstrations
Fri, Jul 5 2024 - 10:00 am (GMT + 7)

Tat-Jun (TJ) Chin

Adelaide University

Quantum Computing in Computer Vision: A Case Study in Robust Geometric Optimisation
Fri, Jun 7 2024 - 11:00 am (GMT + 7)