Taro Logo

Senior System Software Engineer – DC Platform Software Tools

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
$184,000 - $356,500
Backend
Senior Software Engineer
In-Person
5,000+ Employees
10+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior System Software Engineer – DC Platform Software Tools

NVIDIA, the pioneer in GPU technology and AI computing, is seeking a Senior System Software Engineer to join their Data Center Platform Software Tools team. This role is crucial in developing and enhancing tools for large-scale AI data centers, focusing on the complete manageability lifecycle from deployment to repair workflows.

The position involves working with NVIDIA's cutting-edge Grace and GPU superchips, designed for HPC and generative AI workloads. You'll be at the forefront of technological advancement, working on tools that manage server software and firmware for data center lifecycle, particularly for DGX, HGX, or MGX Products.

As a Senior Engineer, you'll collaborate with cross-functional teams including hardware engineers, system architects, and software developers. Your responsibilities will span from gathering requirements to creating solutions that provide simplified manageability experiences. The role requires expertise in Python programming and extensive experience with large-scale data center operations.

NVIDIA offers a competitive compensation package with a base salary range of $184,000 - $356,500 USD, plus equity and benefits. The company is known for its innovative culture and commitment to pushing technological boundaries. You'll be joining a team that's dedicated to transforming the world's largest industries through AI and digital twins technology.

The ideal candidate will have 10+ years of experience, strong technical skills, and a proven track record in data center management solutions. This is an excellent opportunity for someone who is passionate about AI infrastructure, enjoys solving complex problems, and wants to be part of shaping the future of computing technology at one of the technology world's most desirable employers.

Last updated a month ago

Responsibilities For Senior System Software Engineer – DC Platform Software Tools

  • Drive next generation GPU Server Software manageability workflows for scaling AI infrastructure for Datacenters
  • Work with internal and external customers to understand requirements for various tools
  • Contribute to all phases of product development, from definition to customer support
  • Maintain detailed documentation of tool designs, capabilities, and usage guidelines
  • Define KPIs for tools and work across various stakeholders to improve it over time

Requirements For Senior System Software Engineer – DC Platform Software Tools

Python
Linux
  • BS, MS, or PhD in EE/CS or related field with 10+ years of experience
  • Proven record of working in management solutions for large scale clusters in data centers
  • Strong and demonstrable skill in Python
  • Experience programming and debugging skills for large scale data centers
  • Experience in SCM (e.g., Git, Perforce) and project management tools like Jira
  • Excellent written and oral communication skills
  • Self-starter who loves to find creative solutions to complicated problems

Benefits For Senior System Software Engineer – DC Platform Software Tools

Equity
  • Equity
  • Benefits package

Interested in this job?