Student Researcher (LLM Post Training – Agent & Reinforcement Learning) - 2026 Start (PhD)
About the team The Seed LLM Post Training team is responsible for researching cutting-edge post-train technologies and providing core post-train capabilities for unified multimodal large models. The team's goal is to research and explore next-generation advanced technologies such as SFT, RM, RL, and self-learning during the post-train phase, while significantly optimizing and improving key areas including reasoning, coding, agent, and omni model. We are looking for talented individuals to join us for an internship in 2026. PhD Internships at our Company aim to provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. PhD internships at Our Company provides students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. Our dynamic internship experience blends hands-on learning, enriching community-building and development events, and collaboration with industry experts. Applications will be reviewed on a rolling basis - we encourage you to apply early. Please state your availability clearly in your resume (Start date, End date). Responsibilities - Explore large-scale models and optimize systems. - Data construction, instruction tuning, preference alignment, and model optimization. - Improving relevant model capabilities, such as reasoning, code, math etc. - In-depth research and exploration of future use cases.