All roles

[Remote] Full Stack ML Efficiency & Observability

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Microsoft AI is looking for a Member of Technical Staff - Full Stack Engineer, ML Efficiency & Observability to help efficiently manage compute capacity. The role involves designing and developing features for capacity management and model performance visibility while collaborating with ML researchers and product managers to create intuitive user experiences.

Responsibilities

  • Design and develop features for our capacity management portal
  • Design and develop features to provide visibility into model performance and quality across our fleet
  • Partner with ML researchers and PMs to translate functional requirements into highly functional, intuitive and appealing interfaces
  • Integrate with backend APIs from schedulers to training frameworks to build visibility across the training life cycle
  • Explore, develop, and adapt new innovations to the software development process
  • Contribute to the development of internal tooling and infrastructure
  • Implement best software development practices to ensure code quality. Hold a high quality bar
  • Embody our culture and values

Skills

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 4+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years of business analytics, data science, software development, data modeling or data engineering work experience
  • OR equivalent experience
  • Experience with Capacity Management, Efficiency Management, ML Training and/or Inference
  • Solid expertise in JavaScript / TypeScript, React, HTML, CSS and browser internals
  • Solid understanding of web performance, accessibility, and cross‑browser compatibility
  • Experience with Development & Debugging with dev environments like Visual Studio or Visual Studio Code
  • Software development experience with Generative AI tools
  • Experience in leading technical projects and supporting architectural decisions with data

Benefits

  • Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Company Overview

  • Microsoft is a software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services. It was founded in 1975, and is headquartered in Redmond, Washington, USA, with a workforce of 10001+ employees. Its website is https://www.microsoft.com.
  • Company H1B Sponsorship

  • Microsoft has a track record of offering H1B sponsorships, with 1317 in 2026, 9192 in 2025, 9343 in 2024, 7677 in 2023, 11403 in 2022, 7210 in 2021, 7852 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles