All roles

AI Engineer, Developer Ecosystem

Remote · USA Full-time New today

What you'll actually do

  • Build agents and tools in public: demo apps, reference implementations, MCP servers, Claude skills, LangGraph workflows. Ship things that are genuinely impressive.
  • Own the developer experience: identify friction in our API and SDKs, write real feedback back to the eng team, and fix it yourself when you can.
  • Design and run evals: benchmark tool-calling quality, measure agent reliability across integration surfaces, build sandboxed test harnesses that reflect production conditions. Publish what you learn.
  • Run workshops, give talks, appear at events: technical sessions on agentic architectures, tool-calling patterns, context optimization, and integration design.
  • Publish AI research adjacent to your work: MCP tool schema design, context window hygiene, eval frameworks for agentic systems, RLMF, auto-research loops, sandbox architecture for safe agent execution.
  • Foster community: Discords, GitHub, demo days, office hours. Be the engineer developers trust to give them a real answer.
  • Partner with product and engineering: turn new releases into working demos before they're announced. No slide decks without code.

What we're looking for Hard skills

  • Ship production-grade agents
  • Deep MCP / tool-calling fluency
  • Built plugins, skills, extensions, or agents for real usage
  • Designs evals and benchmarks for agentic systems
  • Builds sandboxes for safe agent testing
  • Understands context optimization
  • Reads AI research papers and applies them
  • TypeScript and/or Python at minimum

Soft signals

  • GitHub history you're proud of
  • Technical talks on record
  • Community presence
  • Builds to learn, not to demo
  • Gives direct opinions, backed by data
  • Doesn't wait to be unblocked

What we're not looking for

  • Someone who needs to ask permission to write a blog post or be taught on how to open a PR
  • Someone whose agent experience is only a weekend hackathon project
  • A conference talk collector with nothing on GitHub

Topics you should have opinions on MCP

  • A2A protocol
  • tool-calling schemas
  • context window optimization
  • evals & benchmarking
  • agent sandboxes
  • LangGraph / DSPy
  • RLMF / RLM harnesses
  • auto-research loops
  • code mode / long-horizon agents
  • RAG vs. tool-use tradeoffs
  • enterprise auth for agents
  • multi-agent orchestration
  • prompt caching strategies
  • AI safety boundaries
  • sandbox isolation patterns
  • LLM leaderboard literacy

This is a real engineering role This isn't a "write blog posts and attend conferences" role dressed up as engineering. You'll be embedded with the product and engineering team. You'll ship code that ends up in our SDKs, our docs, and our sample repos. The AI agent ecosystem is moving fast enough that the line between DevRel and R&D is blurring. We want someone comfortable sitting in that blur - writing a technical post about eval design for tool-calling reliability because they spent two weeks deep in it, building a sandbox harness to reproduce a flaky agent behavior, not because someone briefed them on a slide. You'll have access to a platform that connects agents to any other system safely while optimising token usage, and a mandate to show the world what's possible when those connections actually work well. Apply tot his job Apply To this Job

Related roles

AI Engineering Intern, Summer Internship

Remote · USA Full-time

ML/AI Engineers

Remote · USA Full-time

Forward Deployed AI Engineer (Must be PST timezone)

Remote · USA Full-time

Staff Backend AI Engineer

Remote · USA Full-time

Ngspice Electronics Engineer for AI Circuit Simulation

Remote · USA Full-time

Accessibility QA Engineer & AI Trainer

Remote · USA Full-time

Software Engineer, Front-End

Remote · USA Full-time

AI Architect for Automation Delivery

Remote · USA Full-time

Sr. Artificial Intelligence Engineer with Azure for 6 Months of Contract to Hire

Remote · USA Full-time

VP, Investment AI Engineer

Remote · USA Full-time

Experienced Freelance Chat Moderator - Remote Community Engagement Specialist

Remote · USA Full-time

Remote Data Entry Clerk – Precision Data Management & Documentation Specialist

Remote · USA Full-time

Experienced Customer Experience Concierge – Remote Chat Professional at arenaflex

Remote · USA Full-time

Behavioral Health Remote - Full-Time NP/PA

Remote · USA Full-time

Experienced Entry-Level Data Entry Specialist – Remote Work Opportunity at arenaflex

Remote · USA Full-time

Principal Product Manager Farmer Insights Applications

Remote · USA Full-time

Experienced Data Entry Specialist – Remote Work Opportunity at arenaflex

Remote · USA Full-time

Bilingual Contact Center Specialist Remote

Remote · USA Full-time

Senior SRE - Leading Online Retailer

Remote · USA Full-time

Experienced Live Chat Representative – Customer Service & Sales Support

Remote · USA Full-time