Principal Firmware Engineer - Data Center Server Management

World leader in accelerated computing, pioneering AI and digital twins technology.
$272,000 - $471,500
Embedded
Principal Software Engineer
Hybrid
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS

Description For Principal Firmware Engineer - Data Center Server Management

NVIDIA, the pioneering company that invented the GPU and revolutionized parallel computing, is seeking a Principal Firmware Engineer for their Data Center Server Management team. This role is at the forefront of developing solutions for NVIDIA's GH200 superchip, which powers HPC and generative AI workloads. The position involves architecting end-to-end manageability solutions for data center products, working with cutting-edge technology in AI computing.

The role requires a seasoned professional with 15+ years of experience in server firmware and platform software development. You'll be responsible for driving server management for large clusters and data centers, working directly with data center architects and cloud customers to implement solutions that meet complex requirements. The position involves collaboration with various internal teams to ensure proper design and implementation of firmware and software modules.

As a Principal Engineer, you'll be instrumental in designing and building data center health management workflows, driving reliability and optimization in firmware architecture, and owning the quality and performance of firmware delivered to data centers. The role offers the opportunity to work with NVIDIA's advanced technology stack, including their GPU and Grace solutions, while solving complex challenges in server management at scale.

The position offers a competitive compensation package with a base salary range of $272,000 - $471,500 USD, plus equity and comprehensive benefits. This is an excellent opportunity for a technical leader who can drive large complex problems with 50+ engineers and has a passion for creating innovative solutions in the AI computing space.

Last updated 5 days ago

Responsibilities For Principal Firmware Engineer - Data Center Server Management

  • Drive server management for large clusters and data centers deploying GPUs and Grace solution
  • Work with data center architects and cloud customers to define requirements
  • Ensure requirements are designed and implemented correctly in firmware and software modules
  • Design & build data center health management workflow
  • Drive reliability and optimization in firmware architecture
  • Work with cluster bring up team and resolve issues
  • Own firmware delivery to data centers for quality, reliability and telemetry performance

Requirements For Principal Firmware Engineer - Data Center Server Management

Python
Linux
  • 15+ years of experience in server firmware (BMC) and platform software development
  • BS, MS, or PhD in EE/CS or related field
  • Hands on experience with data center health management workflow
  • Strong knowledge of data center management and server architecture
  • Strong skills in C/C++ and Python
  • Experience in programming and debugging server platforms
  • Experience with SCM (Git, Perforce) and project management tools like Jira
  • Excellent written and oral communication skills
  • Self-starter with creative problem-solving abilities

Benefits For Principal Firmware Engineer - Data Center Server Management

Equity
  • Equity compensation
  • Comprehensive benefits package

Interested in this job?

Jobs Related To NVIDIA Principal Firmware Engineer - Data Center Server Management

System Software Architect, Programmable Vision Accelerator

Lead software architect role for NVIDIA's Programmable Vision Accelerator, focusing on embedded systems, computer vision, and machine learning acceleration.

Principal Autonomous Vehicles Engineer - Mapping and Localization

Principal Autonomous Vehicles Engineer position at NVIDIA, focusing on mapping and localization for self-driving technology, requiring 15+ years of experience in computer vision and C++ programming.

Principal Platform Software Engineer - OpenBMC Platform Architect

Lead next-generation data center server platform architecture at NVIDIA, focusing on firmware development and hardware integration for GPU baseboards.

Senior Firmware Architect - Server Manageability

Senior Firmware Architect position at NVIDIA focusing on server manageability and GPU-based AI servers, requiring expertise in firmware development and system architecture.

Sr. Director Engineering

Senior Director Engineering position at Qualcomm leading Application Processor/Modem chipset development, requiring 15+ years of semiconductor experience and strong technical leadership skills.