Back to Jobs

Senior Site Reliability Engineer - Networking

Remote, USA Full-time Posted 2025-11-03

Join Lambda, a pioneering AI computing platform, in our mission to make computation as effortless and ubiquitous as electricity. As a Senior Site Reliability Engineer - Networking, you will play a critical role in scaling our high-performance cloud network, ensuring high availability, and delivering predictable networking performance to our clients.

We're looking for an experienced engineer with a passion for building and maintaining large-scale networks. If you have a strong background in networking, a keen eye for detail, and excellent problem-solving skills, we want to hear from you.

Key Responsibilities:

  • Scale Lambda's high-performance cloud network, ensuring high availability and reliability
  • Contribute to the automation of network configuration and deployments, using tools like Ansible and CI/CD pipelines
  • Implement and operate Software Defined Networks (SDN), with experience in OpenStack, Neutron, and OVN
  • Deploy and manage Spine and Leaf networks, with a focus on predictable networking performance
  • Ensure high availability of our network through observability, failover, and redundancy
  • Collaborate with cross-functional teams to deliver high-quality networking solutions

Requirements:

  • 5+ years of experience in Site Reliability Engineering, Software Engineering, or Network Reliability Engineering
  • Proven experience in implementing production-scale networking projects, with a focus on scalability and reliability
  • Experience with on-call and incident response management, with excellent problem-solving skills
  • Strong understanding of Linux networking stack, with experience in building and maintaining SDN, OpenStack, and Neutron
  • Proficiency in Python programming, with experience in configuration management tools like Ansible
  • Experience with CI/CD tools, GIT, and GitOps practices
  • Familiarity with Kubernetes, with experience in application lifecycle and deployments

Nice to Have:

  • Experience operating production-scale SDNs in a cloud context
  • Software development experience in C, GO, or Python
  • Experience automating network configuration within public clouds, using tools like Kubernetes, HELM, Terraform, and Ansible
  • Deep understanding of the Linux networking stack, with experience in network virtualization, SR-IOV, and DPDK
  • Understanding of the SDN ecosystem, with experience in OVS, Neutron, VMware NSX, or Cisco ACI

What We Offer:

  • Competitive salary range, with generous cash and equity compensation
  • Opportunities for professional growth and development, with a fast-growing company
  • Comprehensive benefits package, including health, dental, and vision coverage
  • Flexible Paid Time Off Plan, with a 401k Plan and 2% company match
  • Commuter/Work from home stipends for select roles

About Lambda:

  • Founded in 2012, with ~350 employees and growing fast
  • Backed by top investors, including Andra Capital, SGW, and NVIDIA
  • Experiencing high demand for our systems, with quarter over quarter, year over year profitability
  • Publishing research papers in top machine learning and graphics conferences

Equal Opportunity Employer:

Lambda is an Equal Opportunity employer, committed to building a team with a variety of backgrounds, experiences, and skills. We consider applicants without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Apply To This Job

Apply for this job  

Similar Jobs