All jobs

[Remote] Full Stack ML Efficiency & Observability

100% Remote Full-time Open now

Note: The job is a remote job and is open to candidates in USA. Microsoft AI is looking for a Member of Technical Staff - Full Stack Engineer, ML Efficiency & Observability to help efficiently manage compute capacity. The role involves designing and developing features for capacity management and model performance visibility while collaborating with ML researchers and product managers to create intuitive user experiences.

Responsibilities

  • Design and develop features for our capacity management portal
  • Design and develop features to provide visibility into model performance and quality across our fleet
  • Partner with ML researchers and PMs to translate functional requirements into highly functional, intuitive and appealing interfaces
  • Integrate with backend APIs from schedulers to training frameworks to build visibility across the training life cycle
  • Explore, develop, and adapt new innovations to the software development process
  • Contribute to the development of internal tooling and infrastructure
  • Implement best software development practices to ensure code quality. Hold a high quality bar
  • Embody our culture and values

Skills

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 4+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years of business analytics, data science, software development, data modeling or data engineering work experience
  • OR equivalent experience
  • Experience with Capacity Management, Efficiency Management, ML Training and/or Inference
  • Solid expertise in JavaScript / TypeScript, React, HTML, CSS and browser internals
  • Solid understanding of web performance, accessibility, and cross‑browser compatibility
  • Experience with Development & Debugging with dev environments like Visual Studio or Visual Studio Code
  • Software development experience with Generative AI tools
  • Experience in leading technical projects and supporting architectural decisions with data

Benefits

  • Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Company Overview

  • Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services. It was founded in 1975, and is headquartered in Redmond, Washington, USA, with a workforce of 10001+ employees. Its website is https://www.microsoft.com.
  • Company H1B Sponsorship

  • Microsoft has a track record of offering H1B sponsorships, with 1317 in 2026, 9192 in 2025, 9343 in 2024, 7677 in 2023, 11403 in 2022, 7210 in 2021, 7852 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might also like

    [Remote] Senior Consultant – Oracle Health (VA Critical Support Solutions Team)

    100% Remote Full-time

    [Remote] Director Insurance Operations - AI Trainer

    100% Remote Full-time

    [Remote] Offensive Security Engineer - AI Trainer

    100% Remote Full-time

    [Remote] Purple Team Engineer - AI Trainer

    100% Remote Full-time

    [Remote] Project Manager, Long Haul Site Acquisition and Permitting

    100% Remote Full-time

    [Remote] Software Engineer, iOS Core Product - Washington, DC, USA

    100% Remote Full-time

    [Remote] Software Engineer, iOS Core Product - Minneapolis, MN, USA

    100% Remote Full-time

    [Remote] Software Engineer, iOS Core Product - State College, PA, USA

    100% Remote Full-time

    [Remote] Engineering Manager

    100% Remote Full-time

    [Remote] Senior Full Stack Engineer, Integrations Epic

    100% Remote Full-time

    Experienced Customer Experience Specialist I – Domestic Customer Support & Order Management

    100% Remote Full-time

    Surgery - Volunteer Medical Specialist

    100% Remote Full-time

    Art Director; Remote - Texas

    100% Remote Full-time

    Data Engineering Manager, Community Support Platform

    100% Remote Full-time

    Ingenieur: in Substation Automation System (all gender)

    100% Remote Full-time

    Experienced Customer Sales and Service Representative – Delivering Exceptional Experiences on America’s Fastest and Most Reliable Network

    100% Remote Full-time

    Compliance Data Entry Assistant

    100% Remote Full-time

    Remote Seasonal Tax Software Technical Support Representative - Customer Assistance Specialist (Work From Home) | arenaflex Tax Preparation Product Support

    100% Remote Full-time

    Retail Store Customer Service Specialist in Austin, TX

    100% Remote Full-time

    Walmart Customer Service Representative (Remote Work)

    100% Remote Full-time