Student Researcher (Speech Foundation Model - Seed) – 2027 Start (PhD)
About the team The mission of the Seed Speech team is to enrich interactive and creative processes through the application of multimodal speech technologies. The team focuses on the forefront of research and product development in speech and audio, music, natural language understanding, and multimodal deep learning. Responsibilities - Conduct research on speech foundation models and related systems. - Explore methods to advance model capabilities in areas such as speech generation, speech understanding, and multimodal modeling involving speech, language, or vision. - Design and prototype algorithms, models, or system components. - Collaborate with the team to advance research directions.