We are seeking a deeply technical, creative, and hands-on software engineer to pioneer the next generation of scalable, always-available profiling services. This role will enable developers worldwide to harness the full power of NVIDIA GPUs. We are looking for someone who can help us build the best possible experience for ML performance engineers seeking to debug, profile, and optimize their training and serving pipelines using next-generation profiling technologies.
What you'll be doing: Develop tools and features for NVIDIA GPUs that enable ML engineers to profile long-running ML workloads on single node and multi-node clusters. Synthesize customer's performance analysis use cases into the key GPU performance metrics required to advise those insights. Use NVIDIA GPU performance monitoring system and design efficient hardware performance counter arrangements for observation. Optimize GPU profiling tools to minimize overheads, improve observability, and make smart tradeoffs between observability and observer effects. Innovate and improve our GPU profiling library with new features to maximize ML application performance.
What we need to see:
Ways to stand out from the crowd:
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.