zhyncs

Follow

🎯

Yineng Zhang zhyncs

🎯

Follow

Just for fun 🌁

1.8k followers · 22 following

Bay Area, CA
17:52 (UTC -07:00)
https://zhyncs.com
@zhyncs42

Sponsors

Achievements

Achievements

Organizations

zhyncs/README.md

Hi there 👋

💼 Principal AI Researcher at Together AI — creator and lead of TGL, the company’s proprietary inference engine.
🔭 I co-lead the SGLang project with Lianmin Zheng and Ying Sheng, driving releases, optimization, and roadmap. I have led major versions and blogs including Llama 3, DeepSeek V3, Large Scale EP, and GB200 NVL72.
📚 Co-author of the FlashInfer paper (MLSys 2025 Best Paper) and committer to FlashInfer. Previously, I was Lead Software Engineer at Baseten (co-authored the DeepSeek V3 and Qwen 3 launches) and led CTR GPU inference and vector retrieval system development at Meituan.
🎤 Interviewed by The New York Times (Article 1, Article 2), Featured speaker at AMD AI DevDay 2025 and PyTorch Conference 2025.
📫 Contact: [email protected] | Telegram | LinkedIn | Homepage
🙌 Best reached through SGLang Slack — we’re always looking for open-source enthusiasts and contributors to grow the community.

Pinned Loading

sgl-project/sglang sgl-project/sglang Public

SGLang is a fast serving framework for large language models and vision language models.

Python 18.4k 3k
flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

FlashInfer: Kernel Library for LLM Serving

Cuda 3.8k 518