Research Scientist - DPU & AI Infra
About the Team The ByteDance DPU (Data Processing Unit) team is building the foundational computing infrastructure for ByteDance and Volcano Engine Public Cloud. Our mission is to advance the architecture, development, and research of next-generation software-hardware technologies across compute, networking, and storage for cloud and AI computing. Our technology stack spans - Cloud virtualization & hypervisors - High-performance user-space network protocols (DPDK, RDMA, etc.) - High-speed interconnect and virtual switching - Distributed storage acceleration - GPU virtualization and scheduling for AI/ML workloads We work at the intersection of software systems, distributed infrastructure, and custom hardware acceleration, shaping the next wave of cloud-scale computing. Responsibilities - Design and develop DPU network software with a focus on high performance, low latency, and reliability. - Collaborate with hardware teams to build software-hardware co-design solutions for networking and storage acceleration. - Explore AI/ML infrastructure acceleration, leveraging DPUs, GPUs, and custom hardware to optimize distributed training and inference. - Drive end-to-end performance optimization, from OS kernels and drivers to user-space runtime systems. - Contribute to architecture design, technical proposals, and long-term research directions.