Student Researcher (Multimodal Interaction and World Model - Seed) – 2027 Start (PhD)
About the team The Seed Multimodal Interaction and World Model team is dedicated to developing models that have human-level multimodal understanding and interaction capabilities. The team is working to advance the exploration and development of multimodal assistant products. Responsibilities - Conduct research on multimodal foundation models and related systems. - Explore methods to improve model capabilities across modalities, including areas such as vision-language modeling, world modeling, and representation learning. - Design and prototype algorithms, models, or system components. - Collaborate with the team to advance research directions.