Taro Logo

Senior DevOps Engineer

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
DevOps
Senior Software Engineer
In-Person
8+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior DevOps Engineer

NVIDIA, the world leader in accelerated computing, is seeking a Senior DevOps Engineer to join their Farm team. This role focuses on improving growing services infrastructure and requires a passionate individual dedicated to operational excellence. The position involves working with a diverse team of skilled engineers on critical infrastructure management and automation tasks.

The role combines hands-on technical work with strategic infrastructure planning, requiring expertise in multiple programming languages, cloud services, and modern DevOps practices. You'll be responsible for maintaining high-performance computing environments, implementing monitoring solutions, and ensuring system reliability.

Key technical areas include Linux systems, container orchestration, cloud platforms, and automation tools. The ideal candidate will have strong experience with CI/CD pipelines, infrastructure as code, and modern monitoring solutions like Grafana and Prometheus.

This position offers the opportunity to work on large-scale systems at a company at the forefront of AI and accelerated computing. You'll be part of a team that values continuous improvement and innovation, with chances to work on cutting-edge technology and contribute to NVIDIA's mission of transforming major industries through AI and digital twins.

Last updated 7 months ago

Responsibilities For Senior DevOps Engineer

  • Own services and work with cross-functional teams
  • Perform frequent code testing and deployment
  • Improve infrastructure provisioning and management using automation
  • Identify areas to improve service resiliency
  • Support globally distributed on-prem environment (LSF)
  • Determine root-cause for production incidents and write RCA reports
  • Ensure highest level of up-time and Quality of Service
  • Participate in team's on-call rotation

Requirements For Senior DevOps Engineer

Python
Go
Linux
Kubernetes
  • B.S. degree in Computer Science or related technical field or equivalent experience
  • 8+ years coding/scripting in high level programming languages
  • Experience with web applications, databases, APIs, and cloud platforms
  • Knowledge of operating services including web servers, load balancers, databases
  • Deep understanding of Linux operating system and TCP/IP fundamentals
  • Experience with cloud services (AWS, GCP, Azure)
  • Proficiency in monitoring tools like Grafana and Prometheus
  • Expertise in CI/CD, GitOps and Infrastructure as Code
  • Strong communication and documentation skills

Interested in this job?