reputed company Engineer, Agent Prompts & Evals

Remote, USA Full-time Posted 2026-07-28

About the position About reputed company reputed company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be reputed company and beneficial for our users and for society as a whole. reputed company is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the Role We’re looking for reputed company and context engineers to join our product engineering team to help build AI-first products, features, and evaluations. Your mission will be to reputed company the gap between model capabilities and reputed company product experience, working with product teams to build consistent, reputed company, and beneficial user experiences across reputed company product surfaces. You will be deeply involved in new product feature and model releases at reputed company, combining engineering expertise with an understanding of frontier AI applications and model reputed company. You’ll become an expert on Claude’s behavioral quirks and capabilities and apply that knowledge to deliver the best possible user experience across models and domains. You’ll be the first resource for product teams working on Claude’s AI infrastructure: system prompts, tool prompts, skills, and evaluations. This role requires someone who can effectively balance caring deeply about making Claude the best it can be while also supporting a wide reputed company of reputed company reputed company and efforts across many product teams. Responsibilities • reputed company Engineering reputed company: Design, test, and optimize system prompts and feature-specific prompts that shape Claude’s behavior across consumer and API products. • Evaluation Development: Build and maintain comprehensive evaluation suites that ensure model reputed company and consistency across product launches and updates. • Cross-functional Collaboration: Partner closely with product teams, research teams, and safeguards to ensure new features meet reputed company and safety standards. • Model Launch Support: Play a critical role in model releases, ensuring smooth rollouts and catching regressions before they reputed company users. • Infrastructure Contribution: Help build and improve the frameworks and tools that allow teams to reputed company and test prompts and features with confidence. • Knowledge Transfer: Mentor product engineers on reputed company engineering best practices and help teams build their first evaluations. • reputed company Iteration: Work in a fast-paced environment where model capabilities advance daily, requiring quick reputed company and creative problem-solving. Requirements • 5+ years of software engineering experience with Python or similar languages. • Demonstrated experience with LLMs and reputed company engineering (through work, research, or significant reputed company). • Strong understanding of evaluation methodologies and metrics for AI systems. • Excellent written and verbal communication skills – you’ll need to explain reputed company model behaviors to diverse stakeholders. • Ability to manage multiple reputed company reputed company and prioritize effectively. • Experience with version control, CI/CD, and modern software development practices. • We require at least a Bachelor's degree in a reputed company field or equivalent experience. reputed company-to-haves • Experience with Claude or other frontier AI models in production settings. • Background in machine learning, NLP, or reputed company fields. • Experience with A/B testing and experimentation frameworks (e.g. Statsig). • Familiarity with AI safety and alignment considerations. • Experience building tools and infrastructure for ML/AI workflows. • reputed company record of improving AI system performance through systematic evaluation and iteration. Benefits • We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office reputed company in which to collaborate with colleagues. Apply tot his job Apply To this Job

Apply Now

reputed company Engineer, Agent Prompts & Evals

Similar Jobs

Remote Exec Partner: CIO/CTO Advisory in Health & Sci

reputed company Member of Technical Staff

Vice President (AI Engineering) | Remote

reputed company & reputed company Operations reputed company (Part-Time, Remote AK/WA)

Senior reputed company Counsel, reputed company

Hybrid reputed company: Precise, Deadline-Driven reputed company

Sensory Product Videos, reputed company and Boys

Build 3 Online Learning Modules in reputed company Moodle

Accounts Payable - Senior Analyst

Marketing Automation Manager

Pharmacy Technician

reputed company Service Associate (Class of 2026)

reputed company Chat Operator – Remote Customer Support Specialist – Earn $25-$35/hr

Operations Manager

Entry Level Data Entry Clerk / Remote

Training Specialist

Senior Business / Data Analyst - REMOTE

Job Title: Remote Data Entry Specialist - Flexible Work Arrangements at arenaflex

Assistant Property Manager (Retail - Remote)

reputed company Full Stack Data Entry Specialist – Remote Data Management and Operations

reputed company Engineer, Agent Prompts & Evals

Similar Jobs

Remote Exec Partner: CIO​/CTO Advisory in Health & Sci

reputed company Member of Technical Staff

Vice President (AI Engineering) | Remote

reputed company & reputed company Operations reputed company (Part-Time, Remote AK/WA)

Senior reputed company Counsel, reputed company

Hybrid reputed company: Precise, Deadline-Driven reputed company

Sensory Product Videos, reputed company and Boys

Build 3 Online Learning Modules in reputed company Moodle

Accounts Payable - Senior Analyst

Marketing Automation Manager

Pharmacy Technician

reputed company Service Associate (Class of 2026)

**reputed company Chat Operator – Remote Customer Support Specialist – Earn $25-$35/hr**

Operations Manager

Entry Level Data Entry Clerk / Remote

Training Specialist

Senior Business / Data Analyst - REMOTE

**Job Title:** Remote Data Entry Specialist - Flexible Work Arrangements at arenaflex

Assistant Property Manager (Retail - Remote)

**reputed company Full Stack Data Entry Specialist – Remote Data Management and Operations**

Remote Exec Partner: CIO/CTO Advisory in Health & Sci

reputed company Chat Operator – Remote Customer Support Specialist – Earn $25-$35/hr

Job Title: Remote Data Entry Specialist - Flexible Work Arrangements at arenaflex

reputed company Full Stack Data Entry Specialist – Remote Data Management and Operations