Taro Logo

AI Computing Software Development Engineer, TensorRT-LLM

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Taipei City, TaiwanHsinchu, North District, Hsinchu City, Taiwan
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI

Description For AI Computing Software Development Engineer, TensorRT-LLM

NVIDIA is seeking a Software Development Engineer to join their TensorRT-LLM team, focusing on LLM inference optimization. This role sits at the intersection of deep learning and high-performance computing, working on software that powers AI breakthroughs worldwide. The position involves developing and optimizing inference software for Large Language Models, collaborating with cross-functional teams, and contributing to the evolution of AI computing platforms. The ideal candidate will combine strong technical skills in Python, deep learning frameworks, and software engineering with an understanding of LLM architectures and inference techniques. This role offers the opportunity to work at a leading technology company that's driving innovation in AI and accelerated computing, with exposure to cutting-edge technology and the chance to influence the future of AI infrastructure. NVIDIA offers competitive compensation and benefits, fostering an inclusive environment that values diversity and innovation. The position requires both technical expertise and collaborative skills, making it ideal for someone passionate about advancing the field of AI computing.

Last updated 2 days ago

Responsibilities For AI Computing Software Development Engineer, TensorRT-LLM

  • Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance
  • Performance analysis, optimization, and tuning for Large Language Models (LLMs)
  • Follow academic developments in AI and update TensorRT-LLM
  • Provide feedback into architecture and hardware design and development
  • Collaborate across company teams to guide deep learning inference direction
  • Publish key results in scientific conferences

Requirements For AI Computing Software Development Engineer, TensorRT-LLM

Python
  • Master or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree
  • 3+ years of relevant software development experience
  • Excellent Python programming skills, software design, and software engineering skills
  • Awareness of latest developments in LLM architectures and inference techniques
  • Experience with deep learning frameworks like PyTorch and HuggingFace
  • Proactive and able to work without supervision
  • Excellent written and oral communication skills in English

Benefits For AI Computing Software Development Engineer, TensorRT-LLM

Medical Insurance
Dental Insurance
Vision Insurance
  • Competitive salaries
  • Generous benefits package

Interested in this job?

Jobs Related To NVIDIA AI Computing Software Development Engineer, TensorRT-LLM