All roles

Senior/Staff AI Engineer

Remote · USA Full-time New today

Job Description:

  • Build and optimize LLM serving and inference systems for production environments
  • Improve performance across GPU and CPU pathways
  • Work on KV cache, memory, storage, and throughput bottlenecks
  • Design and scale systems that support RAG and retrieval-heavy AI workloads
  • Contribute to infrastructure where storage architecture and systems efficiency materially affect AI performance
  • Solve engineering problems at the intersection of AI, high-performance systems, and distributed infrastructure

Requirements:

  • An engineer who has spent meaningful time building or optimizing production AI systems, not just experimenting with models
  • Someone who understands how inference performance is shaped by the interaction between compute, memory, storage, and serving architecture
  • Deep hands-on experience working close to the systems layer — for example, improving how workloads run across GPU and CPU resources, reducing bottlenecks, or tuning infrastructure for better throughput and latency
  • Evidence of real ownership in areas like model serving, retrieval, caching, storage, or distributed performance, rather than purely application-layer AI work
  • The ability to move comfortably between architecture decisions and hands-on implementation, especially in environments where efficiency and scale matter
  • A background that suggests you can operate in technically demanding environments, whether that comes from AI infrastructure, high-performance systems, storage platforms, or adjacent distributed systems work
  • PhD preferred, but far less important than having built serious systems in the real world.

Benefits: Apply tot his job Apply To this Job

Related roles

Senior Machine Learning Engineer- Ads Personalization

Remote · USA Full-time

Senior Machine Learning Engineer - Scan, Match and Catalog

Remote · USA Full-time

Staff Machine Learning Engineer - Content and Contributor Intelligence (Remote - United States)

Remote · USA Full-time

Machine Learning Engineer - LLM Evaluation & Automation

Remote · USA Full-time

Edge AI Engineer

Remote · USA Full-time

Lead Machine Learning Engineer - Remote (US) or CA - Only W2

Remote · USA Full-time

ML/AI Engineer - Junior Level

Remote · USA Full-time

FPGA AI/ML Engineer – Part Time

Remote · USA Full-time

Temporary Micro-Credential Grader – Industry-Focused Prompt Engineering for ROI-Driven Results

Remote · USA Full-time

English Prompt Engineer: LLM Migration & Optimization

Remote · USA Full-time

Real Estate Video Editor - Remote - USA

Remote · USA Full-time

Senior Associate Business Development & Strategy

Remote · USA Full-time

Rechtspfleger / Jurist – Legal Tech & Digitalisierung der Justiz (all genders)

Remote · USA Full-time

Manager, Quality (H)

Remote · USA Full-time

Job Title: Experienced Remote Data Entry Clerk – Flexible Work Arrangements | Daily/Weekly Pay Opportunities

Remote · USA Full-time

Experienced Data Entry Clerk – Remote Work Opportunity with arenaflex

Remote · USA Full-time

Account Manager - Mumbai & M.P

Remote · USA Full-time

Outpatient Coder (temp)

Remote · USA Full-time

Live Receptionist

Remote · USA Full-time

Weekend PRN Home Infusion / IVIG RN

Remote · USA Full-time