Deep Learning Architect, LLM Inference - New College Graduate 2024

NVIDIA is the world leader in accelerated computing, pioneering solutions for challenges no one else can solve.
$104,000 - $189,750
Machine Learning
Entry-Level Software Engineer
In-Person
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Prompt Engineer

Join SuperDial as a Prompt Engineer to develop and optimize LLM solutions for healthcare workflows, combining AI expertise with real-world healthcare applications.

Founding Engineer (ML × SWE)

Foundry is seeking a Founding Engineer to build core ML systems and RL infrastructure for browser automation, offering competitive pay and equity.

2025 Software Development Engineer - Machine Learning

Entry-level Machine Learning Software Development Engineer position at Amazon, focusing on building innovative ML solutions and distributed systems.

Associate I AI Engineer

Entry-level AI Engineering position at S&P Global, focusing on developing AI solutions for financial markets and data analytics.

Software Developer - Oracle Labs

Entry-level software developer position at Oracle Labs focusing on machine learning and AI development.

Description For Deep Learning Architect, LLM Inference - New College Graduate 2024

NVIDIA is at the forefront of the generative AI revolution. The Inference Benchmarking (IB) team focuses on advanced inference server performance for Large Language Models (LLMs). As a Deep Learning Architect for LLM Inference, you'll be responsible for characterizing the latest LLMs and inference servers, collaborating with performance marketing teams, working with AI startup engineers, profiling GPU kernel-level performance, developing analysis tools, contributing to deep learning software projects, verifying TRT-LLM performance, and guiding the direction of inference serving across the company.

Key responsibilities include:

  • Characterizing LLMs and inference servers like vLLM and DeepSpeed-MII
  • Creating content to highlight TRT-LLM achievements
  • Collaborating with AI startup engineers
  • Profiling GPU performance and identifying optimization opportunities
  • Developing profiling and analysis software tools
  • Contributing to projects like PyTorch, vLLM, and LLMPerf
  • Verifying TRT-LLM performance for new GPU product launches
  • Collaborating across teams to ensure world-class performance

Requirements:

  • Master's or PhD in Computer Science, Electrical Engineering, or related fields
  • Knowledge of deep learning inference serving, PyTorch, and compiler optimizations
  • Proficiency in C++ and Python, familiarity with CUDA
  • Experience with LLMs and their performance challenges
  • Understanding of CPU and GPU microarchitecture
  • Experience with complex software projects

Preferred qualifications:

  • Drive to improve software and hardware performance
  • History of developing workplace efficiency tools
  • Experience with database and visualization tools like D3.js

NVIDIA offers a competitive base salary range of $104,000 - $189,750 USD, along with equity and comprehensive benefits. Join a team of highly skilled professionals in one of the technology world's most desirable employers.

Last updated 7 months ago

Responsibilities For Deep Learning Architect, LLM Inference - New College Graduate 2024

  • Characterize latest LLMs and inference servers
  • Collaborate with performance marketing team
  • Work with AI startup engineers
  • Profile GPU kernel-level performance
  • Develop profiling and analysis software tools
  • Contribute to deep learning software projects
  • Verify TRT-LLM performance for new GPU product launches
  • Guide the direction of inference serving across the company

Requirements For Deep Learning Architect, LLM Inference - New College Graduate 2024

Python
  • Master's or PhD in Computer Science, Electrical Engineering, or related fields
  • Knowledge of deep learning inference serving, PyTorch, and compiler optimizations
  • Proficiency in C++ and Python, familiarity with CUDA
  • Experience with LLMs and their performance challenges
  • Understanding of CPU and GPU microarchitecture
  • Experience with complex software projects like compilers, operating systems, or frameworks

Benefits For Deep Learning Architect, LLM Inference - New College Graduate 2024

Equity
  • Equity
  • Comprehensive benefits

Interested in this job?