NVIDIA is seeking a Platform Reliability Engineer to join their team working on the Unified Commerce Platform (UCP). This role is crucial in maintaining the reliability and excellence of their commerce platform that handles critical functions like subscription management, payment processing, and fraud prevention. The position requires a unique blend of software engineering expertise and reliability engineering mindset.
The ideal candidate will be responsible for developing and implementing comprehensive testing frameworks, automation solutions, and reliability processes that ensure the platform meets its SLA commitments across all tenant environments. This includes creating automated testing strategies, performance monitoring systems, and proactive issue identification mechanisms.
As a Platform Reliability Engineer, you'll work at the intersection of development and reliability assurance, directly impacting customer trust and platform stability. The role involves designing test frameworks for various levels of testing, from unit tests to end-to-end validation, while also implementing monitoring solutions and establishing reliability processes.
The position offers the opportunity to work with cutting-edge commerce platform technology while ensuring its reliability and performance. You'll be part of a team that values both technical excellence and customer satisfaction, working on systems that process sensitive financial data and require the highest standards of security and reliability.
This role at NVIDIA, the world leader in accelerated computing, offers the chance to work on systems that directly impact business operations and customer experience. The company's focus on AI and digital twins technology makes this an exciting opportunity for someone passionate about reliability engineering in a cutting-edge technical environment.