Software Engineer - Data Storage & Data Lake (ByteDance Singapore)

Singapore·R&D·engineering
Apply on ByteDance (TikTok) →

About the team The Data Ecosystem Team has the vital role of crafting and implementing a storage solution for offline data in our recommendation system, which caters to more than a billion users. Their primary objectives are to guarantee system reliability, uninterrupted service, and seamless performance. They aim to create a storage and computing infrastructure that can adapt to various data sources within the recommendation system, accommodating diverse storage needs. Their ultimate goal is to deliver efficient, affordable data storage with easy-to-use data management tools for the recommendation, search, and advertising functions. What you will be doing: 1. Responsible for the design and development of distributed database Hbase-related components. 2. Responsible for the design and development of single-node LSM engine Rocksdb-related components. 3. Design and implement an offline/real-time data architecture for large-scale recommendation systems. 4. Design and implement a flexible, scalable, stable, and high-performance storage system and computation model. 5. Troubleshoot production systems, and design and implement necessary mechanisms and tools to ensure the overall stability of production systems. 6. Build industry-leading distributed systems such as offline and online storage, batch, and stream processing frameworks, providing reliable infrastructure for massive data and large-scale business systems.

More open roles at ByteDance (TikTok)