NVIDIA is seeking a Senior DGX Cloud Performance Engineer to drive the performance analysis, optimization, and modeling of NVIDIA's DGX Cloud clusters. This role sits at the intersection of cloud computing and artificial intelligence, working with NVIDIA's cutting-edge DGX™ Cloud platform - an end-to-end, scalable AI solution built on the latest NVIDIA architecture and co-engineered with leading cloud service providers.
The position requires deep expertise in parallel and distributed systems, with a focus on optimizing large-scale AI workloads. You'll be responsible for conducting comprehensive performance analysis of critical AI applications, developing benchmarks, and driving architectural decisions that shape the future of NVIDIA's cloud infrastructure.
Working closely with cross-functional teams, you'll help define DGX Cloud cluster architecture across different cloud service providers, optimize workloads, and develop methodologies that advance hardware-software co-design. The role involves hands-on work with various LLM workloads across industries like healthcare, climate modeling, and financial services.
This is an exceptional opportunity for an experienced engineer to impact the future of AI infrastructure at scale. You'll be working at NVIDIA, a leader in accelerated computing, with competitive compensation including a base salary range of $224,000 - $425,500 (depending on level), equity, and comprehensive benefits. The position offers the chance to work with cutting-edge technology while collaborating with some of the industry's brightest minds.