Software engineer at Anthropic, passionate about AI infrastructure and high-performance systems.
- Work on deploying, scaling, and operating LLM inference systems on clouds
- Learning about LLM inference optimization, and GPU acceleration
- Large language model inference optimization
- CUDA and GPU programming
- Distributed serving frameworks
- Contributing to large-scale open-source projects
- GitHub: @jia-gao
π‘ Open to collaborations on open-source AI infrastructure and ML systems projects!
