Taro Logo

Cloud Machine Learning LLM Serving Engineer

A global leader in wireless technology innovation and the development of mobile technologies.
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI · Enterprise SaaS

Description For Cloud Machine Learning LLM Serving Engineer

Qualcomm's Cloud Computing team is seeking a Cloud Machine Learning LLM Serving Engineer to join their innovative team developing hardware and software for Machine Learning solutions. This role offers an exciting opportunity to work on cutting-edge AI technologies spanning data center, edge, infrastructure, and automotive markets. The position involves working with deep learning frameworks, optimizing ML models, and building software tools for AI infrastructure.

The ideal candidate will have strong expertise in deep learning, particularly with LLMs and various ML frameworks. You'll be responsible for improving and optimizing key Deep Learning models on Qualcomm AI 100, implementing kernels for AI workloads, and building framework extensions. The role requires excellent programming skills in C++/Python and a deep understanding of ML optimization techniques.

Working at Qualcomm offers unique advantages, including collaboration with leading engineering and technology experts, comprehensive benefits, and continuous learning opportunities. The company's culture promotes innovation and inclusive thinking, allowing you to contribute to world-changing technologies. The environment is fast-paced and requires strong communication skills due to regular cross-functional interaction.

This position offers growth opportunities in one of the world's leading technology companies, working on advanced AI and ML solutions that impact various industries. You'll be part of a team that values technical excellence, innovation, and professional development, with access to cutting-edge resources and technologies.

Last updated 4 days ago

Responsibilities For Cloud Machine Learning LLM Serving Engineer

  • Improve and optimize key Deep Learning models on Qualcomm AI 100
  • Build deep learning framework extensions for Qualcomm AI 100
  • Implement Kernels for AI workloads
  • Collaborate with internal teams to analyze and optimize training and inference for deep learning
  • Build software tools and ecosystem around AI SW Stack
  • Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo
  • Optimize workloads for both scale-up and scale-out systems
  • Optimize the entire deep learning pipeline including graph compiler integration

Requirements For Cloud Machine Learning LLM Serving Engineer

Python
  • Bachelor's/Masters/PHD degree in Engineering, Machine learning/AI, Information Systems, Computer Science, or related field
  • 2+ years Software Engineering or related work experience
  • 2+ years experience with Programming Language such as C++, Python
  • Deep Learning experience or knowledge – LLMs, NLP, Vision, Audio, Recommendation systems
  • Knowledge of Pytorch, TensorFlow software stacks
  • Excellent C/C++/Python programming and software design skills
  • Experience in using C++ 14 (advanced features)
  • Experience of profiling software and optimization techniques

Benefits For Cloud Machine Learning LLM Serving Engineer

Medical Insurance
401k
Mental Health Assistance
Education Budget
  • World-class health benefit coverage
  • Financial future preparation programs
  • Emotional/mental strength support
  • Wellbeing programs
  • Tuition reimbursement
  • Mentorship programs

Interested in this job?

Jobs Related To Qualcomm Cloud Machine Learning LLM Serving Engineer

Solutions Engineer - Netherlands

Solutions Engineer position at Qualcomm Netherlands, focusing on Edge AI implementation and machine learning solutions for edge devices, requiring 4+ years of software engineering experience.

Software Engineer, Machine Learning Group

Software Engineer position at Qualcomm's Machine Learning Group, focusing on AI/ML software development, model architecture optimization, and tools creation for enhanced customer experiences.

Solutions Engineer [Pre-sales] - UK

Solutions Engineer position at Qualcomm focusing on Edge AI and machine learning, combining pre-sales engineering with technical implementation of ML solutions.

Engineer, Cloud ML Accelerator

Software engineering role at Qualcomm focusing on Linux user-space development for machine learning acceleration, requiring expertise in C++, system architecture, and AI infrastructure.

AI Software Engineer

AI Software Engineer position at Qualcomm Bangalore, focusing on C++/Python development and machine learning, requiring 2+ years of experience in software engineering.