Software Engineer, Performance, AI Infrastructure

Tesla is an automotive and technology company leading in electric vehicles and AI development.
$104,000 - $360,000
Machine Learning
Senior Software Engineer
In-Person
5+ years of experience
AI · Automotive · Robotics
This job posting may no longer be active. You may be interested in these related jobs instead:
Reinforcement Learning Engineer, Optimus

Senior Reinforcement Learning Engineer position at Tesla, focusing on developing AI systems for humanoid robots through advanced machine learning techniques.

Senior Software Development Engineer, Ring & Blink AI

Senior Software Development Engineer position at Amazon's Ring & Blink AI team, focusing on computer vision and machine learning development for smart home security devices.

Senior Software Engineer, Deep Learning Inference

Senior Software Engineer position at NVIDIA focusing on Deep Learning Inference optimization and implementation, requiring 5+ years of experience in software engineering and machine learning.

Senior Delivery Consultant - Application Developer, Data & Machine Learning, WWPS ProServe

Senior ML/Cloud consultant role at AWS ProServe, implementing machine learning solutions and providing technical guidance to customers, with competitive compensation and benefits.

Senior Software Engineer, Machine Learning

Senior Machine Learning Engineer role at LinkedIn, developing AI algorithms for content understanding and classification at scale.

Description For Software Engineer, Performance, AI Infrastructure

Tesla is seeking a Senior Software Engineer to join their AI Infrastructure team, focusing on performance optimization for neural network training systems. This role is crucial for both the Autopilot and Humanoid robot initiatives, working with state-of-the-art GPU clusters and Tesla's supercomputer, Dojo. The position demands expertise in CUDA programming, deep learning frameworks, and high-performance computing. You'll be responsible for optimizing training workflows, reducing model convergence time, and maximizing hardware efficiency. Tesla offers a comprehensive benefits package and the opportunity to work on cutting-edge AI applications in autonomous driving and robotics. The role combines deep technical expertise with practical implementation in one of the most advanced AI infrastructure environments. This is an excellent opportunity for experienced engineers passionate about pushing the boundaries of AI performance and scalability.

Last updated 3 months ago

Responsibilities For Software Engineer, Performance, AI Infrastructure

  • Reduce wall clock time to convergence of training jobs by identifying bottlenecks in the ML stack
  • Integrate efficient, low-level code with the overall high-level training framework
  • Profile workloads and implement solutions to increase training efficiency
  • Optimize workloads for efficient hardware utilization (CPU, GPU compute, data throughput, networking)

Requirements For Software Engineer, Performance, AI Infrastructure

Python
Linux
  • Extensive experience in CUDA kernel programming and pushing GPUs to their limits
  • Experience programming in Python
  • Experience with at least one deep learning framework (ideally in PyTorch)
  • Demonstrated experience in profiling CPU/GPU code
  • Proficient in system-level software, hardware-software interactions and resource utilization
  • Good knowledge of CUDA kernels used in training state-of-the-art deep learning models
  • Experience with high-performance networking (Infiniband, RDMA, NCCL)
  • Experience with Triton (preferred)

Benefits For Software Engineer, Performance, AI Infrastructure

Medical Insurance
Dental Insurance
Vision Insurance
401k
Mental Health Assistance
Parental Leave
Commuter Benefits
  • Medical plans with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental and vision plans with $0 paycheck contribution
  • Company Paid HSA Contribution
  • Healthcare and Dependent Care FSA
  • 401(k) with employer match
  • Employee Stock Purchase Plans
  • Company paid Basic Life, AD&D, short-term and long-term disability insurance
  • Employee Assistance Program
  • Sick and Vacation time
  • Back-up childcare and parenting support
  • Commuter benefits
  • Employee discounts and perks program

Interested in this job?