TechTalk – Optimizing Distributed Large Model Training in AI Clouds

All members of the HKU community and the general public are welcome to join!
Speaker: Professor Chuan Wu, Professor in School of Computing and Data Science, HKU
Date: 12th December 2024 (Thursday)
Time: 4:30pm
Mode: Mixed
About the TechTalk
All members of the HKU community and the general public are welcome to join!
Speaker: Professor Chuan Wu, Professor in School of Computing and Data Science, HKU
Moderator: Professor Ka-Ho Chow, Assistant Professor, School of Computing and Data Science, HKU
Date:  12th December 2024 (Thursday)
Time: 4:30pm
Mode: Mixed (both face-to-face and online). Seats for on-site participants are limited. A confirmation email will be sent to participants who have successfully registered.
Language: English

Distributed training using a large number of devices has been widely adopted for learning large deep learning models. Improving distributed training efficiency is critical for time, resource and energy consumption of large model learning. In this talk, I will introduce recent research works in my group on optimizing distributed training parallelisms for effective training acceleration and maximal resource utilization. Especially, we have designed optimized strategies and systems for operator sharding, computation and communication scheduling for SPMD parallelism (e.g., in Mixture-of-Experts model training) in both homogeneous and heterogeneous AI clusters, as well as dynamic micro-batching and pipelining to tackle sequence length variation in multi-task model training (e.g., Large Language Model training).

Registration
  • The tech talk “Optimizing Distributed Large Model Training in AI Clouds” will be organized in the Tam Wing Fan Innovation Wing Two (G/F, Run Run Shaw Building, HKU) on 12th December 2024 (Thursday), 4:30pm.
  • Seats are limited. Zoom broadcast is available if the seating quota is full. 
  • Registrants on the waiting list will be notified of the arrangement after the registration deadline (with seating/free-standing/other arrangement)
Recording of the Tech Talk
About the speaker

Professor Chuan Wu

Professor Chuan Wu received her B.Engr. and M.Engr. degrees in 2000 and 2002 from Tsinghua University and Ph.D. degree in 2008 from University of Toronto. Since September 2008, she has been with Department of Computer Science at the University of Hong Kong, and is currently a Professor. Her current main research is on distributed machine learning systems and algorithms. She has served as associate editor for ACM/IEEE Transactions on Networking, IEEE Transactions on Cloud Computing, etc. She has active collaborations with various AI cloud operators (AWS, ByteDance, Alibaba, etc.), and received an Amazon Research Award (ARA) on AWS AI in 2021. 

Promotion materials
About the project

Multifunctional Filters for Protecting Public Health

Clean water and clean air are vital for public health. This project focuses on developing high-efficiency and environmentally sustainable filters for removing harmful air/water pollutants. The team has developed novel architectures and functionalities for the filters to achieve high permeance, high removal efficiency, and excellent reusability.

Other Tech talks