Young Scholar TechTalk – Secure and High-performance AI Serving: Protecting AI Secretes, Accelerating AI Insights

All members of the HKU community and the general public are welcome to join!
Speaker: Mr Tianxiang Shen, PhD Candidate, Department of Computer Science, Faculty of Engineering, HKU
Date: 19th September 2023 (Tuesday)
Time: 4:30pm
Mode: Mixed
About the Tech Talk
All members of the HKU community and the general public are welcome to join!
Speaker: Mr Tianxiang Shen, PhD Candidate, Department of Computer Science, Faculty of Engineering, HKU
Moderator: Mr Bocheng Xiao, PhD Candidate, Department of Computer Science, Faculty of Engineering, HKU
Date: 19th September 2023 (Tuesday)
Time: 4:30pm
Mode: Mixed (both face-to-face and online). Seats for on-site participants are limited. A confirmation email will be sent to participants who have successfully registered.
Language: English

Driven by the remarkable success of artificial intelligence (AI) and edge computing, the deployment of well-trained private AI models on third-party edge devices for mission-critical applications has become increasingly prevalent. Safeguarding these private models on untrusted devices, while simultaneously speeding up model serving (i.e., inference) through accelerators like GPUs, has escalated in urgency.
We introduce SOTER, a new AI serving system that, for the first time, achieves both high security and high performance. Harnessing the associativity property of AI operators, SOTER presents an innovative approach—transforming computationally expensive AI operators into parameter-morphed equivalents for secure execution on untrusted but fast GPUs, and losslessly restoring inference results within trusted execution environments (TEEs) in CPUs. Experimental results on six prevalent AI models in the three most popular categories show that, even with stronger model protection, SOTER achieves comparable performance with baselines while retaining the same high accuracy as insecure AI model inference.

Architecture of Deep Neural Network
Architecture of our secure AI serving system
Registration
  • The tech talk “Secure and High-performance AI Serving: Protecting AI Secretes, Accelerating AI Insights” will be organized in the Tam Wing Fan Innovation Wing Two (G/F, Run Run Shaw Building, HKU) on 19th September 2023 (Tuesday)4:30 pm.
  • Seats are limited. Zoom broadcast is available if the seating quota is full. 
  • Registrants on the waiting list will be notified of the arrangement after the registration deadline (with seating/free-standing/other arrangement)
About the speaker

Mr Tianxiang Shen

Mr Tianxiang is currently a fifth-year PhD student supervised by Dr. Heming Cui. Mr Tianxiang has published 9 papers in top-tier international conferences and journals (e.g., ATC and TDSC). He serves as an invited reviewer of several top (CCF-A) journals, including TPDS, TDSC, and JSAC. He was the web chair of APSys 2021 and the artifact evaluation committee for SOSP 2021. He focuses on building secure collaborative computing systems (including data networking system, distributed ledger, distributed database), and maliciously-secure AI training and serving systems. Mr Tianxiang’s personal website is at https://tianxiang999.github.io/ 

 

Promotion
Other Tech talks