Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

NVIDIA is the world leader in accelerated computing, pioneering visual computing and GPU technology.
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI · Enterprise SaaS

Description For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

NVIDIA, a pioneer in visual computing and GPU technology, is seeking a Senior DevTech Engineer to join their team focusing on Windows LLM and GenAI Open-Source Ecosystem. This role sits at the intersection of cutting-edge AI technology and GPU computing, where you'll work on enabling Windows AI enthusiasts and developers with innovative models and functionality.

The position involves working with large language models (LLMs) and Generative AI, contributing to open-source projects like PyTorch and llama.cpp, and optimizing performance on NVIDIA RTX platforms. You'll be responsible for improving user experience, solving deployment challenges, and working closely with both internal teams and external partners.

As an ideal candidate, you bring 5+ years of experience in GPU deployment and optimization, strong programming skills in C/C++ and Python, and deep understanding of transformer architectures and LLMs. Your role will be crucial in shaping the future of AI deployment on Windows platforms and influencing next-generation GPU features.

NVIDIA offers a competitive compensation package and a work environment that promotes diversity, inclusion, and flexibility. You'll be part of a team that's driving innovation in AI, High-Performance Computing, and Visualization, working on technology that's transforming industries and society.

This position offers an exciting opportunity to work at the forefront of AI technology, combining technical expertise with practical application while collaborating with industry leaders and innovative partners. Join NVIDIA to help shape the future of AI computing and make a significant impact in the field of machine learning and generative AI.

Last updated 15 days ago

Responsibilities For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

  • Improve Windows LLM & GenAI user experience on NVIDIA RTX
  • Engage with internal product teams and external OSS maintainers
  • Work on solving local end-to-end LLM & Generative AI GPU deployment challenges
  • Apply profiling and debugging tools for analyzing GPU-accelerated AI applications
  • Conduct trainings, develop sample code and host presentations
  • Guide developers on efficient adoption of DL frameworks
  • Collaborate with GPU driver and architecture teams

Requirements For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

Python
Linux
  • 5+ years of professional experience in local GPU deployment, profiling and optimization
  • BS or MS degree in Computer Science, Engineering, or related degree
  • Strong proficiency in C/C++, Python, software design
  • Familiarity with Windows operating system
  • Understanding of Transformer architectures and LLMs
  • Experience with open-source LLM and GenAI software
  • Experience with CUDA and NVIDIA's Nsight GPU profiling
  • Strong verbal and written communication skills in English
  • Excellent interpersonal skills
  • Willingness to travel for conferences and partner visits

Benefits For Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

  • Competitive salaries
  • Extensive benefits package

Interested in this job?

Jobs Related To NVIDIA Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

Senior Scientific Machine Learning Software Engineer - Physics

Senior ML Engineer role at NVIDIA focusing on scientific computing and physics simulations, developing AI frameworks for digital twins and simulation surrogates.

Deep Learning Performance Architect

Senior Deep Learning Performance Architect role at NVIDIA focusing on AI performance modeling, analysis, and optimization for next-gen hardware architectures.

Senior Applied LLM Engineer, AI – Chip Design

Senior Applied LLM Engineer position at NVIDIA focusing on developing AI solutions for chip design, combining machine learning expertise with semiconductor industry innovation.

Senior Technical Marketing Engineer, CUDA-X Accelerated Solvers - CAE

Senior Technical Marketing Engineer position at NVIDIA, focusing on CUDA-X Accelerated Solvers and CAE, combining technical expertise with marketing skills.

Senior Full Stack Engineer, Deep Learning Algorithms

Senior Full Stack Engineer position at NVIDIA focusing on Deep Learning Algorithms, requiring 5+ years of experience in software development and expertise in Python, frontend/backend development, and AI.