[Remote] Student Researcher [Seed LLM Horizon – Multi-turn Tool Use] - 2026 Start (PhD)
Note: The job is a remote job and is open to candidates in USA. ByteDance is a leading technology company committed to inspiring creativity and enriching life. They are seeking passionate and self-driven researchers for their Seed LLM Horizon Team to contribute to cutting-edge research in agent intelligence and model development.
Responsibilities
- Enable models to perform deep usage of professional tools (e.g., search, code-interpreter) to solve complex problems
- Develop approaches to generalize model abilities to millions of out-of-distribution (OOD) tools and scenarios
- Scale up multi-turn tool-use training tasks and explore effective training methods
- Address challenges of long-horizon, multi-turn tasks in reinforcement learning
Skills
- Currently pursuing a PhD in Computer Science, Software Engineering, Machine Learning, or a related field
- Research experience in one or more of the following: reinforcement learning, LLM agents, memory systems, tool use, or interactive learning
- Strong coding skills and proficiency with modern deep learning frameworks
- Demonstrated ability to conduct independent research, with publications in top-tier ML/AI conferences such as NeurIPS, ICML, ICLR, ACL, EMNLP etc
- Experience with long-horizon reasoning, multi-turn tasks, or asynchronous agent behavior
- Familiarity with agent evaluation, personalization, or real-world tool integration
- Background in building or analyzing large-scale agent training pipelines
- Ability to collaborate effectively in a fast-paced, research-driven team environment
Benefits
- Day one access to health insurance
- Life insurance
- Wellbeing benefits
- 10 paid holidays per year
- Paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year)
- Housing allowance
Company Overview
Company H1B Sponsorship
Apply To This Job