NVIDIA, the pioneering company that invented the GPU and revolutionized parallel computing, is seeking a Senior Platform Telemetry Engineer to join their innovative team. This role is crucial in developing next-generation fleet management solutions for scaling AI infrastructure using NVIDIA's GH200 superchip. The position offers a unique opportunity to work at the forefront of AI computing, focusing on designing and implementing sophisticated monitoring and fault-remediation solutions at scale.
The ideal candidate will be responsible for driving architecture decisions, working directly with customers, and ensuring the delivery of high-performance solutions for AI supercomputing platforms. This role combines deep technical expertise with strategic thinking, requiring strong skills in C/C++, Python, and various telemetry technologies. You'll work with time series databases, REST APIs, and visualization solutions while collaborating with cross-functional teams to deliver robust, scalable solutions.
NVIDIA offers a competitive compensation package with a base salary range of $148,000 - $287,500 USD (depending on level), plus equity and comprehensive benefits. The company's culture emphasizes innovation, autonomy, and creative problem-solving, making it an ideal environment for engineers who want to make a significant impact in the AI computing space. Working at NVIDIA means being part of a team that's driving technological advancement in AI, deep learning, and parallel computing, with the opportunity to influence the future of computing technology.