Taro Logo

Principal Firmware Engineer - Data Center Server Management

World leader in accelerated computing, pioneering AI and digital twins technology.
$272,000 - $471,500
Embedded
Principal Software Engineer
Hybrid
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS

Description For Principal Firmware Engineer - Data Center Server Management

NVIDIA, the pioneering force behind the GPU and modern AI computing, is seeking a Principal Firmware Engineer to lead their data center server management initiatives. This role sits at the intersection of hardware and software, focusing on the NVIDIA GH200 superchip platform designed for HPC and generative AI workloads. The position offers an opportunity to architect end-to-end manageability solutions for next-generation AI supercomputing platforms.

As a Principal Firmware Engineer, you'll be responsible for driving server management solutions for large-scale GPU and Grace solution deployments. The role requires deep technical expertise in server firmware, platform software development, and data center health management. You'll work closely with internal teams, component leads, and customers to design and implement robust solutions that meet the demanding requirements of modern data centers.

The ideal candidate brings 15+ years of relevant experience and a strong educational background in Computer Science or Electrical Engineering. You'll need to demonstrate expertise in C/C++, Python, and server architecture, along with a proven track record of delivering firmware solutions for large data centers. This position offers competitive compensation, including a base salary range of $272,000 - $471,500, plus equity and benefits.

NVIDIA's commitment to innovation and its position as "the AI computing company" makes this an exciting opportunity for someone looking to work at the forefront of technological advancement. The role combines technical leadership with hands-on development, requiring both architectural vision and practical implementation skills. You'll be joining a company that's transforming industries through AI and accelerated computing, making it an ideal position for those passionate about pushing the boundaries of technology.

Last updated 2 days ago

Responsibilities For Principal Firmware Engineer - Data Center Server Management

  • Drive server management for large clusters and data centers deploying GPUs and Grace solution
  • Work with data center architects and cloud customers to define requirements
  • Ensure requirements are designed and implemented correctly in firmware and software modules
  • Design & build data center health management workflow
  • Drive reliability and optimization in firmware architecture
  • Work with cluster bring up team and resolve issues
  • Own firmware delivery to data centers for quality, reliability and telemetry performance

Requirements For Principal Firmware Engineer - Data Center Server Management

Python
Linux
  • 15+ years of experience in server firmware (BMC) and platform software development
  • BS, MS, or PhD in EE/CS or related field
  • Hands on experience with data center health management workflow
  • Strong knowledge of data center management and server architecture
  • Strong skills in C/C++ and Python
  • Experience in programming and debugging server platforms
  • Experience with SCM (Git, Perforce) and project management tools like Jira
  • Excellent written and oral communication skills
  • Self-starter with creative problem-solving abilities

Benefits For Principal Firmware Engineer - Data Center Server Management

Equity
  • Equity

Jobs Related To NVIDIA Principal Firmware Engineer - Data Center Server Management