Back to Jobs

Remote | SWE (Terminal and CLI Dev Tools Focused) — $75–$80/hour

Remote, USA Full-time Posted 2026-04-22
We are sharing a specialised part-time consulting opportunity for experienced software engineers with strong systems debugging ability, deep terminal and shell fluency, and the ability to evaluate AI-powered CLI coding agents across real-world infrastructure tasks. This role supports an exciting collaboration with leading AI labs focused on improving AI-powered coding systems through high-quality comparative evaluation of CLI agents working on real-world debugging scenarios inside Docker-based environments. Selected professionals will solve infrastructure debugging tasks using AI CLI agents, diagnose broken systems inside containers, write bash scripts that resolve root-cause issues, compare agent approaches and performance, and help improve overall model quality. This opportunity is especially well-suited to detail-oriented engineers who are comfortable working across systems, infrastructure, and debugging workflows, and who can apply strong technical judgment to both problem solving and model evaluation. Key Responsibilities Professionals in this role may contribute to: Infrastructure Debugging & Resolution Solve real-world broken infrastructure scenarios running inside Docker containers Diagnose issues involving databases, networking, security, pipelines, replication, or access control Help ensure that fixes address the root cause and remain stable across service restarts CLI Agent Evaluation & Comparison Use AI-powered CLI coding agents to help solve TerminalBench tasks Compare agents' approaches, reasoning quality, and effectiveness after each task Help establish rigorous comparative evaluations that directly inform product decisions Bash Scripting & Systems Execution Write bash scripts from scratch to resolve infrastructure problems Work within terminal-based environments to inspect, debug, and repair failing systems Help improve model quality through precise technical execution and structured performance ranking Ideal Profile Strong candidates may have: 3+ years of experience in software engineering with hands-on systems and infrastructure debugging experience Strong bash or shell scripting proficiency Docker and containerization experience Infrastructure and systems debugging skills involving PostgreSQL, MySQL, Redis, nginx, TLS, systemd, log analysis, or similar technologies Familiarity with version control workflows such as Git, pull requests, and issue tracking Preferred Qualifications Experience with AI coding tools such as Copilot, Cursor, Claude, or similar tools Strong ability to prompt and evaluate AI-generated technical output Comfort working independently across fast-paced debugging tasks Strong consistency, technical precision, and comparative judgment across repeated evaluations Why This Opportunity Contribute specialised systems engineering expertise to a cutting-edge AI collaboration Help evaluate the next generation of AI-powered CLI coding agents Work on high-impact infrastructure debugging tasks with strong real-world technical relevance Flexible remote work with competitive hourly compensation Contract Details Independent contractor role Fully remote with flexible scheduling Hourly compensation of $75–$80 per hour Immediate start Duration of 1–2 weeks Part-time commitment of 15–25 hours per week, with flexibility up to 40 hours per week Weekly payments via Stripe or Wise Work will not involve access to confidential or proprietary information from any employer, client, or institution Please note: We are unable to support H1-B or STEM OPT candidates at this time Application process includes resume submission, a short AI interview, and follow-up onboarding communication This is a pay-per-task opportunity for writers, with eligible promotion to reviewers based on project needs About The Platform This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects. Experts contribute to improving advanced AI systems by providing specialised expertise across real-world workflows, structured evaluation, model training support, and domain-specific content validation. By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy Apply tot his job Apply To this Job

Similar Jobs

**Experienced Full Stack Contact Center Chat Representative – Digital Customer Support and Sales**

Remote, USA Full-time

Remote Tech Support Representative Entry Level ...

Remote, USA Full-time

**Experienced Customer Service Training Specialist – Driving Customer Experience Excellence at arenaflex**

Remote, USA Full-time

**Experienced Customer Service Representative – Work From Home Opportunity at arenaflex**

Remote, USA Full-time

**Experienced Part-Time Remote Customer Service Representative – Walmart**

Remote, USA Full-time

**Experienced Remote Sales Chat Representative – Shipping Container Sales and Customer Engagement**

Remote, USA Full-time

Veeva Project Manager, Pharma IT Consulting

Remote, USA Full-time

Interior Designer | Residential Design (Remote ...

Remote, USA Full-time

Online Data Entry Jobs (Work From Home/Part Tim...

Remote, USA Full-time

**Experienced Customer Service Representative – Remote Amazon Customer Service For Teens: Flexible Opportunities at arenaflex**

Remote, USA Full-time

Marketing Manager – Events, Integrated Campaigns

Remote, USA Full-time

4X Traveling Dental Assistant

Remote, USA Full-time

Senior Sales Application Engineer

Remote, USA Full-time

**Experienced Customer Service Representative – Virtual Assistant for arenaflex**

Remote, USA Full-time

Remote-First Family Law Associate - Hybrid & Client-Focused

Remote, USA Full-time

Senior Software Engineer - Authentication Infra

Remote, USA Full-time

Senior Back End Software Developer

Remote, USA Full-time

**Experienced Remote Online Chat Specialist – Deliver Exceptional Customer Experience at arenaflex**

Remote, USA Full-time

lead cybersecurity engineer, engineering operations (Remote, US)

Remote, USA Full-time

Remote Senior Data Engineer I : ETL & Analytics

Remote, USA Full-time