K8s Resource Optimization System
Built an optimization service for GPU/CPU right-sizing across namespaces.
View ProjectAI ENGINEERING PRODUCT SYSTEM
Building scalable GPU-native AI systems for LLM & Agent workloads
Problem / Architecture / Solution / Result / Tech Stack
Built an optimization service for GPU/CPU right-sizing across namespaces.
View ProjectDesigned a multi-tenant training platform for distributed model fine-tuning.
View ProjectBuilt a fair-share, latency-aware GPU scheduler for mixed LLM workloads.
View Project1. 摘要 随着人工智能、高性能计算(HPC)、深度学习等算 […]
Read