Taro Logo

System Software Engineer, Event Operations

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$168,000 - $322,000
DevOps
Senior Software Engineer
Hybrid
5,000+ Employees
6+ years of experience
AI · Education

Description For System Software Engineer, Event Operations

NVIDIA, a pioneer in visual and accelerated computing for over 25 years, is seeking a System Software Engineer for their Deep Learning Institute (DLI). This role combines DevOps expertise with educational technology, focusing on ensuring the stability and reliability of NVIDIA's training platform. The position offers an opportunity to work at the intersection of AI education and platform engineering, maintaining and improving systems that enable thousands of developers to advance their AI skills.

The role involves managing the technical infrastructure for training events, implementing SRE principles, and ensuring seamless operation of the Learning Management System platform. You'll be working with cutting-edge technologies including containerization, cloud services, and NVIDIA's AI stack, while contributing to the company's mission of making AI education accessible and effective.

As part of NVIDIA's learning systems platform team, you'll collaborate with educators and technical teams to deliver exceptional learning experiences. The position requires strong DevOps skills, including experience with Kubernetes, cloud platforms (AWS, Azure, GCP), and infrastructure as code tools like Terraform. Knowledge of AI technologies, particularly generative AI and NVIDIA's tech stack, is highly valued.

The role offers competitive compensation ($168,000 - $322,000 base salary) plus equity and benefits, reflecting NVIDIA's position as one of technology's most desirable employers. This is an excellent opportunity for a seasoned DevOps engineer who wants to impact AI education while working with cutting-edge technology at a global leader in accelerated computing.

Last updated 2 days ago

Responsibilities For System Software Engineer, Event Operations

  • Develop comprehensive operational plans and de-risking strategies for technical training events
  • Provide technical leadership during live training events
  • Manage deployments and resolve emergent issues
  • Oversee platform stability, scalability, and reliability
  • Implement SRE principles and lead incident response
  • Lead cross-functional coordination
  • Establish and enforce operational best practices

Requirements For System Software Engineer, Event Operations

Python
Kubernetes
Linux
  • Bachelor's degree in Computer Science or related technical field
  • 6+ years of DevOps experience with containerized applications
  • Experience with Docker, Kubernetes across AWS, Azure, and GCP
  • Proficient in Python and Linux shell scripting
  • Experience with Terraform for cloud infrastructure
  • Strong analytical and problem-solving skills
  • Excellent communication and teamwork skills

Benefits For System Software Engineer, Event Operations

Medical Insurance
Equity
  • Competitive base salary
  • Equity compensation
  • Comprehensive benefits package

Interested in this job?

Jobs Related To NVIDIA System Software Engineer, Event Operations