AI Computing Software Development Engineer, TensorRT-LLM

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.

Taipei City, Taiwan • Hsinchu, North District, Hsinchu City, Taiwan

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

Description For AI Computing Software Development Engineer, TensorRT-LLM

NVIDIA is seeking a Software Development Engineer to join their TensorRT-LLM team, focusing on LLM inference optimization. This role sits at the intersection of deep learning and high-performance computing, working on software that powers AI breakthroughs worldwide. The position involves developing and optimizing inference software for Large Language Models, collaborating with cross-functional teams, and contributing to the evolution of AI computing platforms. The ideal candidate will combine strong technical skills in Python, deep learning frameworks, and software engineering with an understanding of LLM architectures and inference techniques. This role offers the opportunity to work at a leading technology company that's driving innovation in AI and accelerated computing, with exposure to cutting-edge technology and the chance to influence the future of AI infrastructure. NVIDIA offers competitive compensation and benefits, fostering an inclusive environment that values diversity and innovation. The position requires both technical expertise and collaborative skills, making it ideal for someone passionate about advancing the field of AI computing.

Last updated 2 days ago

Responsibilities For AI Computing Software Development Engineer, TensorRT-LLM

Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance
Performance analysis, optimization, and tuning for Large Language Models (LLMs)
Follow academic developments in AI and update TensorRT-LLM
Provide feedback into architecture and hardware design and development
Collaborate across company teams to guide deep learning inference direction
Publish key results in scientific conferences

Requirements For AI Computing Software Development Engineer, TensorRT-LLM

Python

Master or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree
3+ years of relevant software development experience
Excellent Python programming skills, software design, and software engineering skills
Awareness of latest developments in LLM architectures and inference techniques
Experience with deep learning frameworks like PyTorch and HuggingFace
Proactive and able to work without supervision
Excellent written and oral communication skills in English

Benefits For AI Computing Software Development Engineer, TensorRT-LLM

Medical Insurance

Dental Insurance

Vision Insurance

Competitive salaries
Generous benefits package

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.

Taipei City, Taiwan • Hsinchu, North District, Hsinchu City, Taiwan

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

Interested in this job?

AI Computing Software Development Engineer, TensorRT-LLM

NVIDIA

Description For AI Computing Software Development Engineer, TensorRT-LLM

Responsibilities For AI Computing Software Development Engineer, TensorRT-LLM

Requirements For AI Computing Software Development Engineer, TensorRT-LLM

Benefits For AI Computing Software Development Engineer, TensorRT-LLM

NVIDIA

Jobs Related To NVIDIA AI Computing Software Development Engineer, TensorRT-LLM