Qualcomm's Cloud Computing team is seeking a Cloud Machine Learning LLM Serving Engineer to join their innovative team developing hardware and software for Machine Learning solutions. This role offers an exciting opportunity to work on cutting-edge AI technologies spanning data center, edge, infrastructure, and automotive markets. The position involves working with deep learning frameworks, optimizing ML models, and building software tools for AI infrastructure.
The ideal candidate will have strong expertise in deep learning, particularly with LLMs and various ML frameworks. You'll be responsible for improving and optimizing key Deep Learning models on Qualcomm AI 100, implementing kernels for AI workloads, and building framework extensions. The role requires excellent programming skills in C++/Python and a deep understanding of ML optimization techniques.
Working at Qualcomm offers unique advantages, including collaboration with leading engineering and technology experts, comprehensive benefits, and continuous learning opportunities. The company's culture promotes innovation and inclusive thinking, allowing you to contribute to world-changing technologies. The environment is fast-paced and requires strong communication skills due to regular cross-functional interaction.
This position offers growth opportunities in one of the world's leading technology companies, working on advanced AI and ML solutions that impact various industries. You'll be part of a team that values technical excellence, innovation, and professional development, with access to cutting-edge resources and technologies.