Senior MLSys Engineer - Kernel Optimization

OctoAI is a leading startup in the fast-paced generative AI market, delivering generative AI infrastructure to run, tune, and scale models that power AI applications.
$175,000 - $240,000
Backend
Senior Software Engineer
Remote
51 - 100 Employees
5+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:
Sr. Software Development Engineer, Amazon

Senior Software Development Engineer position at Amazon's CE Tech team, focusing on building AI-driven recommendation systems and scalable services to enhance customer shopping experience.

Sr Software Development Engineer, Amazon Fulfillment Technologies (AFT) - Platform Engineering & Services

Senior Software Development Engineer role at Amazon Fulfillment Technologies, building scalable fulfillment systems and ML-powered platforms to enhance warehouse operations efficiency.

System Software Engineer, Tools

Senior System Software Engineer position at Annapurna Labs (AWS) developing tools and software solutions for cloud platform development, requiring 5+ years experience in software development.

Senior Software Engineer

Senior Software Engineer role at Microsoft working on OneDrive and SharePoint cloud services, focusing on system design, migration, and infrastructure development.

Senior Software Engineer - C/C++

Senior Software Engineer position at Microsoft focusing on Windows kernel and driver development, requiring 7+ years of C/C++ experience and system-level programming expertise.

Description For Senior MLSys Engineer - Kernel Optimization

OctoAI is a leading startup in the generative AI market, focused on empowering businesses to build differentiated applications with the latest AI features. As a Senior MLSys Engineer specializing in Kernel Optimization, you'll join the Automation team to develop the most efficient engine for generative model deployment.

Your role will involve:

  • Developing and optimizing high-performance computing kernels for GPU acceleration
  • Implementing solutions in C/C++ and Python
  • Deep diving into GPU performance optimizations
  • Working on kernel optimizations for CUDA or other accelerators
  • Collaborating on machine learning compilers and frameworks

We're looking for candidates with:

  • A degree in Computer Science, Electrical Engineering, or related field
  • Strong programming skills in C/C++ and Python
  • Deep understanding of GPU performance optimizations
  • Experience with kernel optimizations on CUDA or other accelerators
  • Contributions to innovative projects like Cutlass, FlashAttention, or vllm

OctoAI offers a comprehensive benefits package, including fully covered healthcare premiums, competitive compensation with stock options, 401(k), flexible work options, generous time off, and parental leave.

Join us in our mission to make models work for developers, allowing them to focus on building apps that wow their customers without becoming AI infrastructure experts. We value diversity, creativity, and a balanced life. If you're passionate about pushing the boundaries of AI infrastructure, we want to hear from you!

Last updated 9 months ago

Responsibilities For Senior MLSys Engineer - Kernel Optimization

  • Develop and optimize high-performance computing kernels with a focus on GPU acceleration
  • Implement and enhance programming solutions in C/C++ and Python
  • Deep dive into GPU performance optimizations to maximize efficiency and speed
  • Work on kernel optimizations specifically for CUDA or other accelerators
  • Collaborate with the team to extend and improve existing machine learning compilers or frameworks

Requirements For Senior MLSys Engineer - Kernel Optimization

Python
  • Bachelor's, Master's or PhD's degree in Computer Science, Electrical Engineering, or a related field
  • Strong programming skills in C/C++ and Python
  • Deep understanding and experience in GPU performance optimizations
  • Proven experience with kernel optimizations on CUDA or other accelerators
  • Proven experience contributing to innovative OSS/closed source projects like Cutlass, FlashAttention, FlashInfer, mlc-llm, vllm

Benefits For Senior MLSys Engineer - Kernel Optimization

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Fully covered healthcare premiums for employees and dependents (Medical, Dental, Vision, Life Insurance, Disability Insurance)
  • Competitive compensation with salary, bonuses, and stock options
  • Flexible Spending Accounts and Health Savings Account
  • 401(k) options
  • Flexible work options and hours
  • Generous time off policies
  • Comprehensive parental leave
  • Volunteer Time Off (4 days/year)
  • Additional leaves (disability, paid family medical leave, paid military leave)

Interested in this job?