All roles

Software Engineer, Inference AI/ML

Remote · USA Full-time New today

CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost.

Responsibilities

  • Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve)
  • Write tests, code comments, and short design docs; participate in code reviews
  • Add basic metrics and dashboards; assist with alarms and runbooks
  • Follow on-call runbooks and learn incident response in a guided rotation
  • Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance

Skills

  • BS/MS in CS, EE, or related field, or equivalent practical experience
  • Foundations in data structures, algorithms, and networked services
  • Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics
  • Exposure to containers and Kubernetes (coursework or projects welcome)
  • Curiosity about GPU inference concepts (micro-batching, KV cache, streaming)
  • Internship or project that deployed a microservice or ML inference demo
  • Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus
  • Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Company Overview

  • CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is https://www.coreweave.com.
  • Apply To This Job

    Related roles

    Accountant l

    Remote · USA Full-time

    Associate Product Manager

    Remote · USA Full-time

    [Remote] Laravel Full Stack Developer

    Remote · USA Full-time

    OPS Clinician SBS

    Remote · USA Full-time

    Account Manager / Outside Sales Representative - Virginia Beach, VA area

    Remote · USA Full-time

    Project Assistant

    Remote · USA Full-time

    Phoenix, AZ Account Executive - Bilingual Spanish

    Remote · USA Full-time

    Associate Equipment Specialist - Solar (Traveler) | Mortenson

    Remote · USA Full-time

    Project Coordinator

    Remote · USA Full-time

    Social Video Editor

    Remote · USA Full-time

    Experienced Customer Support Representative – Live Chat and Streaming Entertainment Expertise for a Dynamic Remote Team at arenaflex

    Remote · USA Full-time

    Senior Director, Enterprise Analytics & AI-Enabled BI

    Remote · USA Full-time

    Healthcare Editor (AI & Data Journalism) - Contract (Fully Remote in US)

    Remote · USA Full-time

    Inventory Control Associate

    Remote · USA Full-time

    Remote Full Stack Staff Engineer – eCommerce Platform Development for T.J. Maxx (Work‑From‑Home, $27/hr, 8‑Hour Shift)

    Remote · USA Full-time

    Experienced Customer Service Associate – Work From Home Opportunity at arenaflex

    Remote · USA Full-time

    Remote Software Asset Manager role (ServiceNow ...

    Remote · USA Full-time

    Engineering Manager

    Remote · USA Full-time

    Experienced Data Operations Analyst – Fund Document Processing and Data Entry

    Remote · USA Full-time

    Hiring Now: Live Chat Agent - REMOTE (Part-Time & Full-Time)

    Remote · USA Full-time