I'm a graduate student at UCLA π», specializing in real-world systems for ML and LLM agents (with compound AI systems). Previously, I worked as a senior research engineer at A*STAR, Singapore and Microsoft Research Asia.
- Oct 18, 2024 - A Close-look at RAG System Optimization
- Aug 12, 2024 - Serving LLMs in Production - The Optimization
- Dec 09, 2023 - Lifelong Learning in Modern AI Systems
- Apr 05, 2023 - Using Checkpoint Recovery in Large-Scale ML Training
- May 26, 2021 - Model Serving Performance Test - A Deep Dive
π [...more]