Software Engineering

Microsoft is a global technology leader providing cloud computing, AI, and enterprise software solutions.
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
1+ year of experience
AI · Enterprise SaaS · Cloud

Description For Software Engineering

Microsoft's Azure Machine Learning team is seeking a talented Software Engineer to join their Inference team, focusing on next-generation model serving capabilities. This role is at the cutting edge of AI and Cloud technology, working with OpenAI models like ChatGPT and supporting Bing and Office applications.

The position involves developing and maintaining high-performance, scalable platforms for model inferencing, handling billions of requests daily. You'll be working with state-of-the-art LLMs and Diffusion models, optimizing their performance and cost-effectiveness at scale.

As part of Microsoft's vision to democratize Machine Learning, you'll be contributing to making ML accessible to enterprises, developers, and data scientists worldwide. The role requires expertise in C/C++, Python, and modern cloud technologies like Kubernetes and Docker.

The ideal candidate will have experience with large-scale machine learning model deployment, strong programming skills, and the ability to work effectively in a geo-distributed team environment. You'll be tackling challenging problems at the intersection of AI and cloud computing, working with one of the largest GPU fleets in the world.

This is an excellent opportunity for someone passionate about AI infrastructure who wants to make a significant impact on how machine learning models are served at enterprise scale. You'll be part of a collaborative team environment that values innovation and technical excellence, working on cutting-edge technology that powers Microsoft's AI initiatives.

Last updated 21 hours ago

Responsibilities For Software Engineering

  • Engage with key partners to understand state-of-the-art LLMs and Diffusion models
  • Run models at scale in performance and cost effective manner
  • Leverage latest hardware stack technologies in CUDA, infiniband
  • Support production inference for core AI scenarios
  • Build and maintain relationships with stakeholders
  • Anticipate, identify, and mitigate project risks

Requirements For Software Engineering

Python
Go
Kubernetes
  • B Tech or M Tech in computer science, engineering, mathematics or related field
  • 1+ year(s) of software development experience in C/C++ and/or Python
  • Knowledge of OSS, Docker, Kubernetes, Python, GOLANG
  • Experience in hosting and running large scale machine learning models
  • Experience in developing low latency, high scale, reliable online services
  • Good communication and collaboration skills
  • Experience working in geo-distributed teams

Benefits For Software Engineering

Medical Insurance
Education Budget
Parental Leave
Mental Health Assistance
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Interested in this job?

Jobs Related To Microsoft Software Engineering

Software Engineer II

Microsoft Azure AI Platform Software Engineer II position focusing on machine learning, AI model training, and distributed systems development in Hyderabad, India.

Software Engineer II

Software Engineer II position at Microsoft Security working on Security Copilot and generative AI services in Hyderabad, India.

Software Engineer II

Software Engineer II position at Microsoft working on Azure Specialized team, focusing on SAP workload optimization and machine learning integration.

Software Engineer II - AI Agents & Monetization

Microsoft is hiring a Software Engineer II to develop AI agents and monetization capabilities for Copilot, offering remote work in Vancouver with competitive pay and benefits.

Software Engineer 2

Software Engineer 2 position at Microsoft's Azure Voice AI team, focusing on developing cutting-edge generative voice and avatar AI models in Suzhou, China.