SEMINAR

Generative Sequence Models for Sequential Decision Making

Speaker

Aditya Grover

Working
University of California, Los Angeles
Timeline
Fri, May 6 2022 - 10:00 am (GMT + 7)
About Speaker

Aditya Grover is an Assistant Professor of Computer Science at UCLA. His goal is to develop efficient machine learning approaches for probabilistic reasoning under limited supervision, with a focus on deep generative modeling and sequential decision-making under uncertainty. He is also an affiliate faculty at the UCLA Institute of the Environment and Sustainability, where he grounds his research in real-world applications in climate science and sustainable energy. His 35+ research works have been published at top-tier scientific conferences and journals including Nature, deployed into production at major technology companies (Instagram, Twitter), and covered in major press venues, such as the Wall Street Journal and Wired. Aditya’s research has been recognized with two best paper awards (NeurIPS, StarAI), several research fellowships (Google-Simons Institute, Microsoft Research, Lieberman, Adobe), and the ACM SIGKDD doctoral dissertation award. Aditya received his postdoctoral training at UC Berkeley, Ph.D. from Stanford, and bachelors from IIT Delhi, all in computer science.

Abstract

The ability to make decisions under uncertainty is a key component of intelligence. We introduce a framework that abstracts sequential decision making as a generative sequence modeling problem. This allows us to draw upon the simplicity and scalability of the Transformer architecture, and associated advances in language modeling such as GPT-x. I will show how this framework permits learning from large offline datasets, uncertainty-guided online exploration, and generalization across multiple tasks. On various benchmarks from continuous control to game playing, our framework matches or exceeds the performance of state-of-the-art algorithms.

Related seminars

Representation Learning with Graph Autoencoders and Applications to Music Recommendation
Fri, Jul 26 2024 - 10:00 am (GMT + 7)

Trieu Trinh

Google Deepmind

AlphaGeometry: Solving IMO Geometry without Human Demonstrations
Fri, Jul 5 2024 - 10:00 am (GMT + 7)

Tat-Jun (TJ) Chin

Adelaide University

Quantum Computing in Computer Vision: A Case Study in Robust Geometric Optimisation
Fri, Jun 7 2024 - 11:00 am (GMT + 7)

Fernando De la Torre

Carnegie Mellon University

Human Sensing for AR/VR
Wed, Apr 24 2024 - 07:00 am (GMT + 7)