Taro Logo

Senior Platform Telemetry Engineer

World leader in accelerated computing, pioneering AI and digital twins technology to transform industries.
$148,000 - $287,500
Backend
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Job Description

NVIDIA, the pioneering company that invented the GPU and revolutionized parallel computing, is seeking a Senior Platform Telemetry Engineer to join their innovative team. This role is crucial in developing next-generation fleet management solutions for scaling AI infrastructure using NVIDIA's GH200 superchip. The position offers a unique opportunity to work at the forefront of AI computing, focusing on designing and implementing sophisticated monitoring and fault-remediation solutions at scale.

The ideal candidate will be responsible for driving architecture decisions, working directly with customers, and ensuring the delivery of high-performance solutions for AI supercomputing platforms. This role combines deep technical expertise with strategic thinking, requiring strong skills in C/C++, Python, and various telemetry technologies. You'll work with time series databases, REST APIs, and visualization solutions while collaborating with cross-functional teams to deliver robust, scalable solutions.

NVIDIA offers a competitive compensation package with a base salary range of $148,000 - $287,500 USD (depending on level), plus equity and comprehensive benefits. The company's culture emphasizes innovation, autonomy, and creative problem-solving, making it an ideal environment for engineers who want to make a significant impact in the AI computing space. Working at NVIDIA means being part of a team that's driving technological advancement in AI, deep learning, and parallel computing, with the opportunity to influence the future of computing technology.

Last updated a day ago

Responsibilities For Senior Platform Telemetry Engineer

  • Drive next generation fleet management solutions for scaling AI infrastructure
  • Design architecture for fleet health monitoring and fault-remediation solution at scale
  • Write architecture specs and design documents
  • Own end to end delivery of product by working across teams
  • Conduct code reviews
  • Work with QA teams to productize the code
  • Contribute to all phases of product development
  • Educate customers about product architecture and incorporate feedback

Requirements For Senior Platform Telemetry Engineer

Python
Linux
  • BS, MS, or PhD in EE/CS or related field
  • 5+ years hands-on coding experience
  • Strong knowledge of time series databases (Influxdb & Prometheus)
  • Experience with REST APIs
  • Knowledge of telemetry visualization solutions
  • Strong firmware architecture knowledge
  • Strong C/C++ and Python programming skills
  • Experience with SCM and project management tools
  • Excellent written and oral communication skills

Benefits For Senior Platform Telemetry Engineer

Equity
  • Equity
  • Comprehensive benefits package