NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Performance Architect to join their AI performance modeling team. This role focuses on developing and optimizing deep learning systems, particularly for LLM workloads, on state-of-the-art hardware architectures. The position offers an opportunity to work at the intersection of hardware and software optimization for AI systems, making significant contributions to NVIDIA's next-generation inference products.
The ideal candidate will analyze cutting-edge deep learning networks, develop analytical models, and work on performance optimization across both hardware and software domains. They will collaborate with architecture, software, and product teams to influence the direction of future deep learning solutions. This role requires extensive experience with AI models, particularly LLMs and AIGC models, along with deep knowledge of machine learning frameworks and hardware architectures.
NVIDIA offers a competitive compensation package and is known for being one of the technology industry's most desirable employers. The company maintains a strong commitment to diversity and inclusion, fostering an innovative and collaborative work environment. This position provides an exceptional opportunity to work on groundbreaking technology that is transforming industries while being part of a forward-thinking team at a company that continues to push the boundaries of what's possible in AI and accelerated computing.