Software Engineering

Microsoft

Microsoft is a global technology leader providing cloud computing, AI, and enterprise software solutions.

Hyderabad, Telangana, India

Machine Learning

Mid-Level Software Engineer

In-Person

5,000+ Employees

1+ year of experience

AI · Enterprise SaaS · Cloud

This job posting is no longer active. 😔

Job Description

Microsoft's Azure Machine Learning team is seeking a talented Software Engineer to join their Inference team, focusing on next-generation model serving capabilities. This role is at the cutting edge of AI and Cloud technology, working with OpenAI models like ChatGPT and supporting Bing and Office applications.

The position involves developing and maintaining high-performance, scalable platforms for model inferencing, handling billions of requests daily. You'll be working with state-of-the-art LLMs and Diffusion models, optimizing their performance and cost-effectiveness at scale.

As part of Microsoft's vision to democratize Machine Learning, you'll be contributing to making ML accessible to enterprises, developers, and data scientists worldwide. The role requires expertise in C/C++, Python, and modern cloud technologies like Kubernetes and Docker.

The ideal candidate will have experience with large-scale machine learning model deployment, strong programming skills, and the ability to work effectively in a geo-distributed team environment. You'll be tackling challenging problems at the intersection of AI and cloud computing, working with one of the largest GPU fleets in the world.

This is an excellent opportunity for someone passionate about AI infrastructure who wants to make a significant impact on how machine learning models are served at enterprise scale. You'll be part of a collaborative team environment that values innovation and technical excellence, working on cutting-edge technology that powers Microsoft's AI initiatives.

Last updated 5 months ago

Responsibilities For Software Engineering

Engage with key partners to understand state-of-the-art LLMs and Diffusion models
Run models at scale in performance and cost effective manner
Leverage latest hardware stack technologies in CUDA, infiniband
Support production inference for core AI scenarios
Build and maintain relationships with stakeholders
Anticipate, identify, and mitigate project risks

Requirements For Software Engineering

Python

Kubernetes

B Tech or M Tech in computer science, engineering, mathematics or related field
1+ year(s) of software development experience in C/C++ and/or Python
Knowledge of OSS, Docker, Kubernetes, Python, GOLANG
Experience in hosting and running large scale machine learning models
Experience in developing low latency, high scale, reliable online services
Good communication and collaboration skills
Experience working in geo-distributed teams

Benefits For Software Engineering

Medical Insurance

Education Budget

Parental Leave

Mental Health Assistance

Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Networking opportunities