Taro Logo

Senior System Software Engineer - DevOps and Infrastructure Automation

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$148,000 - $287,500
DevOps
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI

Job Description

NVIDIA, the world leader in accelerated computing, is seeking a Senior DevOps Engineer for their AI Inference Operations Team. This role represents a unique opportunity to be at the forefront of DevOps practices, taking ownership of critical systems that drive engineering innovation. The position involves building and maintaining infrastructure for AI Inferencing products, managing CI/CD pipelines, and implementing security measures.

The ideal candidate will have 3+ years of experience in Computer Science or related fields, with strong expertise in Python, Kubernetes, and cloud platforms. The role offers a competitive salary range of $148,000 - $287,500 USD (depending on level), plus equity and comprehensive benefits.

Working at NVIDIA means joining one of technology's most desirable employers, where you'll be part of a team driving success in Deep Learning and Artificial Intelligence. The company is committed to fostering a diverse work environment and values innovation and autonomous thinking.

This position offers the opportunity to work with cutting-edge technology, collaborate with cross-functional teams, and contribute to the development of AI infrastructure. You'll be responsible for designing and implementing solutions that improve efficiency and reliability across the organization, while working with external partners to achieve ambitious goals.

The role combines technical expertise with strategic thinking, requiring skills in infrastructure automation, Kubernetes, and observability. It's an excellent opportunity for someone passionate about infrastructure and DevOps to make a significant impact at one of the world's most innovative companies.

Last updated 5 hours ago

Responsibilities For Senior System Software Engineer - DevOps and Infrastructure Automation

  • Building and maintaining infrastructure for AI Inferencing products including Dynamo and NIXL
  • Maintain CI/CD pipelines to automate build, test, and deployment process
  • Enable performing scans and handling of security CVEs for infrastructure components
  • Collaborate with cross-functional teams to integrate pipelines from deep learning frameworks

Requirements For Senior System Software Engineer - DevOps and Infrastructure Automation

Python
Kubernetes
Linux
  • Masters degree or equivalent experience
  • 3+ years of experience in Computer Science, computer architecture, or related field
  • Excellent Bash, CI/CD, Python programming and software design skills
  • Experience in administering, monitoring, and deploying systems on GitHub and cloud platforms
  • Highly skilled in Kubernetes and Docker/containerd
  • Experience in AWS, Azure or GCP
  • Knowledge of distributed systems programming

Benefits For Senior System Software Engineer - DevOps and Infrastructure Automation

Equity
  • Equity
  • Comprehensive benefits package

Related Jobs

Senior Software Release Engineer, Holoscan

Senior Software Release Engineer position at NVIDIA, focusing on build, release, and deployment processes for the Holoscan platform, offering competitive salary and benefits.

Senior Software Engineer - Bare Metal DevOps

Senior Software Engineer position at NVIDIA focusing on Bare Metal DevOps, managing infrastructure and developing solutions for AI workloads using Kubernetes, Rust, Go, and Python.

Senior Software Engineer - Bare Metal DevOps

Senior Software Engineer role at NVIDIA focusing on Bare Metal DevOps, managing infrastructure and Kubernetes clusters for AI workloads.

SWQA Tools Development Engineer

Senior SWQA Tools Development Engineer position at NVIDIA, focusing on certification testing and automation tool development using AI/ML technologies.

Senior Software Engineer - Build Systems

Senior Software Engineer position at NVIDIA focusing on build systems and DevOps, working with Bazel and robotics infrastructure.