All jobs

[Remote] Senior AI Compiler Engineer - Applied Research

100% Remote Full-time Open now

Note: The job is a remote job and is open to candidates in USA. NVIDIA is a leader in AI infrastructure, focusing on developing cutting-edge technologies in machine learning compilers and AI systems. The Senior AI Compiler Engineer will design and implement AI-based technologies for low-level GPU code generation and collaborate with compiler engineers to enhance production toolchains.

Responsibilities

  • Design and implement AI-based technology addressing core problems of low-level GPU code generation
  • Build SFT and RL training pipelines
  • Define model inputs using low-level compiler representations
  • Define, implement, and evaluate strategies for intelligent prompt engineering in compilation domain
  • Prototype and iterate on model architectures, prompts, and training strategies for NP-hard problems in optimizing compilers
  • Prepare datasets from compiler traces, optimization passes, and target-specific performance signals
  • Apply RL techniques to optimize for downstream objectives and run rigorous experiments, analysis, and benchmarking across workloads and hardware targets
  • Build rigorous benchmarks to assess code quality, correctness, and generation overhead
  • Partner with compiler engineers to integrate and ship learned policies with production toolchains

Skills

  • M.S. or PhD degree in Computer Engineering, Computer Science related technical field (or equivalent experience)
  • 5+ years of experience building AI/ML systems
  • Solid understanding of machine learning fundamentals and experimentation best practices
  • Strong software engineering skills in Python and C++
  • Hands-on experience training/fine-tuning/post-training large models
  • Experience with reinforcement learning
  • Reward modeling from non-differentiable signals (binary runtime/compile success, performance counters)
  • Knowledge of prompt-engineering techniques (CoT, chaining/orchestration, context adaptation, etc)
  • Ability to work across research and engineering, from prototype to production
  • CUDA programming experience and GPU performance familiarity
  • Distributed training/inference at scale (Megatron, NeMo, vLLM, Triton)
  • Experience working with the NVIDIA training stacks
  • Fundamentals of construction of optimizing compilers
  • Understanding of GPU performance, experience with benchmarking suites and performance profiling tools
  • Knowledge of formal methods or static analysis for correctness guarantees

Benefits

  • You will also be eligible for equity and [benefits](https://www.nvidia.com/en-us/benefits/).

Company Overview

  • NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. It was founded in 1993, and is headquartered in Santa Clara, California, USA, with a workforce of 10001+ employees. Its website is https://www.nvidia.com.
  • Company H1B Sponsorship

  • NVIDIA has a track record of offering H1B sponsorships, with 448 in 2026, 1872 in 2025, 1354 in 2024, 976 in 2023, 835 in 2022, 601 in 2021, 529 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might also like

    [Remote] Sr. Data Engineer

    100% Remote Full-time

    [Remote] Software Engineer, Integrations

    100% Remote Full-time

    [Remote] Customer Service Logistics

    100% Remote Full-time

    [Remote] Customer Support Representative - Work-from-Home Florida

    100% Remote Full-time

    [Remote] Marketing Events Coordinator

    100% Remote Full-time

    [Remote] Senior NPU Architect

    100% Remote Full-time

    [Remote] Client Account Lead - Atlanta

    100% Remote Full-time

    [Remote] Senior Security Consultant (Web Application Penetration Tester)

    100% Remote Full-time

    [Remote] Business Development Director - HRS/NPP

    100% Remote Full-time

    [Remote] Director of Business Development - Northeast

    100% Remote Full-time

    Senior Data Scientist – Machine Learning, Predictive Analytics & Business Intelligence Expert (Remote-Flexible)

    100% Remote Full-time

    Staff Software Engineer - Data Query

    100% Remote Full-time

    Experienced Part-Time Remote Data Entry Specialist – E-commerce Operations at Blithequark

    100% Remote Full-time

    Customer Support Associate – Live Chat & Email (Entry‑Level, Flexible Hours) – Join arenaflex’s Global Support Team

    100% Remote Full-time

    [Remote] Manual Quality Assurance Engineer, Web Core Product

    100% Remote Full-time

    Orchard - Strategy Director Animal/Human Health - New Jersey 3 days

    100% Remote Full-time

    Fullstack Engineer Education

    100% Remote Full-time

    Mainframe System Administrator

    100% Remote Full-time

    Experienced Remote Customer Support Specialist – Revolutionizing Automotive Customer Experience

    100% Remote Full-time

    Sr. GIS Apps Product Engineer

    100% Remote Full-time