Member of Technical Staff, Multimodal Infrastructure - MAI Superintelligence Team | Microsoft Careers

Microsoft

Hybrid Actively hiring
Remote, USA · San Francisco Bay Area, CA · New York City Metropolitan Area, NY Posted 89 days ago $139,900$274,800 / year

At a glance

AI generated

TL;DR

Microsoft AI is seeking a Member of Technical Staff at the senior level to join its cutting-edge team focused on developing advanced capabilities for Copilot, a personalized AI assistant. In this role, you will design and build large-scale multimodal infrastructures supporting model development cycles, including data processing pipelines, pretraining frameworks, and inference systems. You’ll collaborate closely with research scientists and product engineers to tackle complex infrastructure challenges and drive architectural improvements that influence the software and hardware roadmap. The ideal candidate has extensive experience in distributed data processing, deep learning frameworks like PyTorch and Megatron, and serving technologies such as vLLM and TensorRT-LLM. Proficiency in languages including Python, C++, or Java is essential, along with expertise in multi-modal data handling, model training techniques, and efficient inference methods. This role offers the opportunity to impact a wide range of users and drive innovation in AI at scale.

Skills

Python PyTorch Megatron Deepspeed vLLM TensorRT-LLM SGLang xDiT Cache-DiT Ray Spark Ray Serve Triton Progressive Distillation AWQ GPTQ FP8 RLHF DPO GRPO

What you'll do

  • Design and develop large-scale multimodal data processing pipelines.
  • Create and maintain frameworks for multimodal model pretraining and post-training.
  • Build and manage infrastructures for multimodal model inference and serving.
  • Collaborate with research scientists to solve infrastructure-related challenges.
  • Optimize distributed training techniques for efficient resource utilization.

What we're looking for

  • Bachelor's degree in Computer Science or related field with extensive technical engineering experience.
  • Strong proficiency in multiple programming languages such as C++, Java, Python, etc.
  • Extensive experience in multimodal data processing including distributed data infrastructure and optimizations.
  • Deep expertise in deep learning frameworks like PyTorch and advanced training techniques.
  • Proficiency in serving frameworks for multi-modal models and knowledge of distillation and quantization techniques.

Market check

Salary context

This $139,900–$274,800 range sits above 71% of similar postings on FindRole.

Peer median band

$136,603$234,850

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$168,875$213,375

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 445 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 415 roles with salary data.

Most-posted roles

View all roles at Microsoft

More like this

Similar roles