Research Intern - LLM Performance Optimization
Microsoft is a leading technology company providing a dynamic environment for research careers through its world-class research labs. The Research Intern will collaborate with researchers and fellow interns to contribute to innovative projects in AI, specifically focusing on Large Language Model performance optimization during a 12-week internship.
Responsibilities
- Research Interns put inquiry and theory into practice
- Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life
- Research Interns not only advance their own careers, but they also contribute to exciting research and development strides
- During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community
- Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer
Skills
- Currently enrolled in a PhD program in Computer Science or a related STEM field
- At least 1 year of experience with Large Language Model architecture or inference performance optimization
- Demonstrated ability to assess and fix kernel performance bottlenecks for GPUs or other high performance parallel computer architectures
- Familiarity with optimizing compiler architecture and intermediate representations (such as LLVMIR or MLIR)
- Ability to think unconventionally to derive creative and innovative solutions
Benefits
- Certain roles may be eligible for benefits and other compensation.
Company Overview
Company H1B Sponsorship
Apply To This Job