Software Engineer, Inference AI/ML
CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost.
Responsibilities
- Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve)
- Write tests, code comments, and short design docs; participate in code reviews
- Add basic metrics and dashboards; assist with alarms and runbooks
- Follow on-call runbooks and learn incident response in a guided rotation
- Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance
Skills
- BS/MS in CS, EE, or related field, or equivalent practical experience
- Foundations in data structures, algorithms, and networked services
- Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics
- Exposure to containers and Kubernetes (coursework or projects welcome)
- Curiosity about GPU inference concepts (micro-batching, KV cache, streaming)
- Internship or project that deployed a microservice or ML inference demo
- Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus
- Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling
Benefits
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption
Company Overview
Apply To This Job