Back to Jobs

[Remote] Generative AI Inference Engineer

Remote, USA Full-time Posted 2025-11-24
Note: The job is a remote job and is open to candidates in USA. Stability AI is seeking passionate Machine Learning Engineers to join their Inference team, focusing on the creative applications of generative AI models. The role involves leading the design and development of customer-facing multi-modal ML inference systems and collaborating with various teams to optimize and deploy cutting-edge models. Responsibilities • Lead efforts to drive the design, development of customer-facing multi modal ML inference systems • Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment • Partner with leading cloud providers to deliver hosted Stability AI inference solutions • Be a strategic thought partner for leaders across the organization on driving business impact through machine learning • Be part of the team to bring new Stability models and pipelines into existence • Prototype and productionize inference platform improvements and new features Skills • 7+ years working on productionizing machine learning systems, including inference pipeline development • Expert level knowledge on writing and running python services at scale • 5+ years working on python scientific stack, pyTorch and at least one high-performance inference framework (e.g. Triton and TensorRT) • Deep understanding of Diffusion Architecture • Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight • Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV • Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure • Experience with Docker • Ability to rapidly prototype solutions and iterate on them with tight product deadlines • Strong communication, collaboration, and documentation skills • Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.) Company Overview • Stability AI is an artificial intelligence company focused on developing open-source generative AI models. It was founded in 2019, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is Apply tot his job Apply tot his job Apply To this Job

Similar Jobs

Remote Entry-Level Data Entry Clerk – Accurate Information Management & Administrative Support at arenaflex

Remote, USA Full-time

**Experienced Customer Service Associate – Insurance Industry Expertise**

Remote, USA Full-time

UPS Remote Jobs (Data Entry| Full Time) Work Fr...

Remote, USA Full-time

VP, Operations (Remote)

Remote, USA Full-time

Testing and Quality Assurance Specialist

Remote, USA Full-time

Astronomy Specialist – Remote | $80/hr

Remote, USA Full-time

Online Chat Moderator Position - Entry-Level, $...

Remote, USA Full-time

Remote Spanish Interpreter — Fast, Accurate & Per-Minute Pay

Remote, USA Full-time

Tagger Jobs, Netflix, Jobs With Netflix, Netfli...

Remote, USA Full-time

Sr. Commodity Manager - Renewable Energy & Service Operations

Remote, USA Full-time

Part-Time-Remote Amazon Data Entry Jobs (URGENT)

Remote, USA Full-time

Executive Assistant I – Technology job at lululemon athletica in Seattle, WA, Canada

Remote, USA Full-time

**Experienced Customer Service Representative – Work from Home Opportunity with blithequark**

Remote, USA Full-time

Customer Service Manager

Remote, USA Full-time

Entry-Level Remote Work-from-Home Opportunity in Tech Industry with Competitive Pay and Flexible Scheduling

Remote, USA Full-time

Entry Level Data Entry Assistant – Remote Opportunity for Career Growth and Development in Data Management Services

Remote, USA Full-time

Claims Representative

Remote, USA Full-time

[Remote] Oracle Fusion Cloud HCM Functional Consultant

Remote, USA Full-time

Senior Manager Advertising Analytics

Remote, USA Full-time

CMC Project Manager

Remote, USA Full-time