Software Engineer - LLM Inference

CentML

CentML's mission is to massively reduce the cost of developing and deploying ML models to enable anyone to harness the power of AI and everyone to benefit from its potential.

Toronto, ON, Canada • San Francisco Bay Area, CA, USA

Machine Learning

Mid-Level Software Engineer

Hybrid

2+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer - LLM Inference

CentML is on a mission to revolutionize AI accessibility by drastically reducing the costs associated with developing and deploying ML models. As a Software Engineer specializing in LLM Inference, you'll be at the forefront of making large language models more efficient, scalable, and accessible.

In this role, you'll work on architecting and implementing cutting-edge inference stacks for LLMs. You'll collaborate with diverse teams focusing on resource orchestration, distributed systems, inference engine optimization, and high-performance GPU kernel development. Your responsibilities will include writing high-quality code, conducting benchmarks and profiling, and ensuring the scalability of our core backend software.

The ideal candidate has a strong background in computer science or a related field, with at least 2 years of industry experience. You should be proficient in Python and C/C++, and have a passion for machine learning and performance engineering. Experience with LLMs, GPU programming, or distributed systems is a plus.

Join CentML to contribute to the democratization of Machine Learning and be part of a team that values diversity, offers competitive benefits, and provides opportunities for professional growth. If you're ready to make a significant impact in the world of AI, this is your chance to shine!

Last updated 10 months ago

Responsibilities For Software Engineer - LLM Inference

Write safe, scalable, modular, and high-quality (C++/Python) code for our core backend software.
Perform benchmarking, profiling, and system-level programming for GPU applications.
Provide code reviews, design docs, and tutorials to facilitate collaboration among the team.
Conduct unit tests and performance tests for different stages of the inference pipeline.

Requirements For Software Engineer - LLM Inference

Python

Linux

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
Strong coding skills in Python and C/C++.
2+ years of industry experience in software engineering.
Knowledgeable and passionate about machine learning and performance engineering.

Benefits For Software Engineer - LLM Inference

Equity

Medical Insurance

Dental Insurance

Parental Leave

Education Budget

An open and inclusive work environment
Employee stock options
Best-in-class medical and dental benefits
Parental Leave top-up for 6 months
Professional development budget
Flexible vacation time to promote a healthy work-life blend