All roles

Remote | SWE (Terminal and CLI Dev Tools Focused) — $75–$80/hour

Remote · USA Full-time New today

We are sharing a specialised part-time consulting opportunity for experienced software engineers with strong systems debugging ability, deep terminal and shell fluency, and the ability to evaluate AI-powered CLI coding agents across real-world infrastructure tasks. This role supports an exciting collaboration with leading AI labs focused on improving AI-powered coding systems through high-quality comparative evaluation of CLI agents working on real-world debugging scenarios inside Docker-based environments. Selected professionals will solve infrastructure debugging tasks using AI CLI agents, diagnose broken systems inside containers, write bash scripts that resolve root-cause issues, compare agent approaches and performance, and help improve overall model quality. This opportunity is especially well-suited to detail-oriented engineers who are comfortable working across systems, infrastructure, and debugging workflows, and who can apply strong technical judgment to both problem solving and model evaluation.

Key Responsibilities

Professionals in this role may contribute to: Infrastructure Debugging & Resolution Solve real-world broken infrastructure scenarios running inside Docker containers Diagnose issues involving databases, networking, security, pipelines, replication, or access control Help ensure that fixes address the root cause and remain stable across service restarts CLI Agent Evaluation & Comparison Use AI-powered CLI coding agents to help solve TerminalBench tasks Compare agents' approaches, reasoning quality, and effectiveness after each task Help establish rigorous comparative evaluations that directly inform product decisions Bash Scripting & Systems Execution Write bash scripts from scratch to resolve infrastructure problems Work within terminal-based environments to inspect, debug, and repair failing systems Help improve model quality through precise technical execution and structured performance ranking Ideal Profile Strong candidates may have: 3+ years of experience in software engineering with hands-on systems and infrastructure debugging experience Strong bash or shell scripting proficiency Docker and containerization experience Infrastructure and systems debugging skills involving PostgreSQL, MySQL, Redis, nginx, TLS, systemd, log analysis, or similar technologies Familiarity with version control workflows such as Git, pull requests, and issue tracking

Preferred Qualifications

Experience with AI coding tools such as Copilot, Cursor, Claude, or similar tools Strong ability to prompt and evaluate AI-generated technical output Comfort working independently across fast-paced debugging tasks Strong consistency, technical precision, and comparative judgment across repeated evaluations Why This Opportunity Contribute specialised systems engineering expertise to a cutting-edge AI collaboration Help evaluate the next generation of AI-powered CLI coding agents Work on high-impact infrastructure debugging tasks with strong real-world technical relevance Flexible remote work with competitive hourly compensation Contract Details Independent contractor role Fully remote with flexible scheduling Hourly compensation of $75–$80 per hour Immediate start Duration of 1–2 weeks Part-time commitment of 15–25 hours per week, with flexibility up to 40 hours per week Weekly payments via Stripe or Wise Work will not involve access to confidential or proprietary information from any employer, client, or institution Please note: We are unable to support H1-B or STEM OPT candidates at this time Application process includes resume submission, a short AI interview, and follow-up onboarding communication This is a pay-per-task opportunity for writers, with eligible promotion to reviewers based on project needs About The Platform This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects. Experts contribute to improving advanced AI systems by providing specialised expertise across real-world workflows, structured evaluation, model training support, and domain-specific content validation. By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy Apply tot his job Apply To this Job

Related roles

Staff, Advanced Analytics, CS Safety

Remote · USA Full-time

Specialist, Safety

Remote · USA Full-time

Contracts Specialist III

Remote · USA Full-time

Data Entry Specialist – Remote Amazon E‑Commerce & Cloud Operations Accuracy Expert (Work‑From‑Home)

Remote · USA Full-time

Work from Home: Get Free Amazon Products to Review

Remote · USA Full-time

Manager - International Account Development (Virtual - US)

Remote · USA Full-time

Amazon Account Manager - REMOTE

Remote · USA Full-time

Experienced Remote Data Entry Specialist – Amazon Work from Home Opportunities in Data Management and Entry

Remote · USA Full-time

Early Career Trial Attorney, $10k Sign-on Bonus (Remote - California)

Remote · USA Full-time

API Tester, Work from Home

Remote · USA Full-time

Experienced Customer Service Professional – Passenger Care Agent for Exceptional Client Experience and Relationship Building

Remote · USA Full-time

Experienced Full Stack Data Entry Specialist – Remote Work Opportunity with blithequark

Remote · USA Full-time

Experienced Customer Support Specialist – Earned Wage Access and Payroll Solutions

Remote · USA Full-time

Online Order Filling Team Associate

Remote · USA Full-time

Experienced Real Time Analyst II - Customer Care Specialist for arenaflex (Remote Job Work From Home)

Remote · USA Full-time

Legal Research Mentor

Remote · USA Full-time

Fixed Income Quantitative Analyst

Remote · USA Full-time

[Work From Home] Require Substitute Teacher $180-$200 Per Day in

Remote · USA Full-time

Security Engineer

Remote · USA Full-time

Content Writer – Blog & Online Content - US ONLY WEST COAST PREFERRED

Remote · USA Full-time