Taro Logo

DL Computing Performance Architect

NVIDIA is the world leader in accelerated computing, pioneering solutions for challenges no one else can solve. Their work in AI and digital twins is transforming major industries and impacting society.
Machine Learning
Senior Software Engineer
Hybrid
4+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For DL Computing Performance Architect

NVIDIA is seeking a technical expert to lead their DL performance projections and analysis effort. This role offers an opportunity to make a significant impact in a fast-paced, technology-focused company. The DL Computing Performance Architect will be responsible for establishing DL applications and use-cases for analysis and projections, specifying hardware/software configurations, creating and maintaining workloads and micro-benchmark suites, generating projections and analysis reports, and collaborating across the company to guide the direction of next-gen deep learning HW/SW.

Key responsibilities include:

  • Establishing DL applications and use-cases for analysis and projections
  • Specifying hardware/software configurations and metrics for performance analysis
  • Creating and maintaining workloads and micro-benchmark suites
  • Generating projections, comparisons, and analysis reports
  • Collaborating with architecture, software, and product teams

Requirements:

  • 4+ years of relevant industry experience
  • Strong software skills with C/C++, Python, MPI, OpenMP, etc.
  • Experience in DL workload and operator optimization and performance analysis (preferred)
  • Familiarity with GPU computing and parallel programming models (preferred)
  • Excellent oral and written communication skills
  • Strong organizational, time management, and task prioritization skills

This position offers the chance to work at the forefront of deep learning and high-performance computing, contributing to NVIDIA's innovative technologies that are shaping various industries. Join NVIDIA to be part of a team that's driving advancements in AI, automotive, and HPC applications.

Last updated 8 months ago

Responsibilities For DL Computing Performance Architect

  • Establish DL applications and use-cases for analysis and projections
  • Specify hardware/software configurations and metrics to analyze performance, power, accuracy and resiliency in uniprocessor and multiprocessor configurations
  • Create and maintain workloads and micro-benchmark suites
  • Generate projections, comparisons and analysis reports for internal/external consumption
  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software and product teams

Requirements For DL Computing Performance Architect

Python
  • 4+ years working experience in relevant industry
  • Strong software skills with C/C++, Python, MPI, OpenMP etc.
  • Experience of DL workload and operator optimization and performance analysis (preferred)
  • Familiarity with GPU computing and parallel programming models (preferred)
  • Excellent oral and written communication skills
  • Good organizational, time management and task prioritization skills

Interested in this job?