All roles

[Remote] Senior Software Engineer, AI and DL Kernel Libraries

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. NVIDIA is a leading technology company specializing in AI and deep learning solutions. They are seeking a Senior Software Engineer to develop innovative AI systems technologies, focusing on optimizing kernels for high-impact AI workloads and collaborating across teams to enhance NVIDIA's hardware architecture.

Responsibilities

  • Innovating and developing new AI systems technologies for efficient inference
  • Designing, implementing, and optimizing kernels for high impact AI workloads
  • Designing and implementing extensible abstractions for LLM serving engines
  • Building efficient just-in-time domain specific compilers and runtimes
  • Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
  • Contributing to open source communities like FlashInfer, vLLM, and SGLang

Skills

  • Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred
  • 6+ years (academic/ industry) experience with ML/DL systems development preferable
  • Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC
  • Strong Python and C/C++ programming skills
  • Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)
  • Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
  • Expertise in inference engines like vLLM and SGLang
  • Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
  • Open source project ownership or contributions

Benefits

  • Equity
  • Benefits

Company Overview

  • NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. It was founded in 1993, and is headquartered in Santa Clara, California, USA, with a workforce of 10001+ employees. Its website is https://www.nvidia.com.
  • Company H1B Sponsorship

  • NVIDIA has a track record of offering H1B sponsorships, with 448 in 2026, 1872 in 2025, 1354 in 2024, 976 in 2023, 835 in 2022, 601 in 2021, 529 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] Senior Principal, Marketing Strategic Finance & Analytics

    Remote · USA Full-time

    [Remote] Senior Product Manager

    Remote · USA Full-time

    [Remote] Travel Allied Recruiter

    Remote · USA Full-time

    [Remote] Maintenance Solutions Consultant

    Remote · USA Full-time

    [Remote] Senior Commercial Finance Manager - Digital & Product (Eastern Time Zone)

    Remote · USA Full-time

    [Remote] Manager - Project Financial Management

    Remote · USA Full-time

    [Remote] Senior Solution & Customer Marketing Manager

    Remote · USA Full-time

    [Remote] Manager - Project Financial Management

    Remote · USA Full-time

    [Remote] Customer Success Manager

    Remote · USA Full-time

    [Remote] Legal Specialist

    Remote · USA Full-time

    Experienced Data Entry Clerk – Remote Work Opportunity with arenaflex

    Remote · USA Full-time

    Experienced Part-Time Remote Data Entry Specialist – Virtual Assistant for arenaflex

    Remote · USA Full-time

    Python Developer - Remote in Finland

    Remote · USA Full-time

    Archives 400 Survey Archivist

    Remote · USA Full-time

    Director of Advisor Recruiting

    Remote · USA Full-time

    Virtual Acute Care Registered Nurse - Full Time - days

    Remote · USA Full-time

    [Remote] Customer Marketing and Advocacy Manager

    Remote · USA Full-time

    Payments Orchestration Specialist

    Remote · USA Full-time

    Experienced Customer Service Representative – Insurance Sales and Customer Support

    Remote · USA Full-time

    Experienced Full-Time Remote Data Entry Operator – High Accuracy and Efficiency in Property Information Management

    Remote · USA Full-time