Taro Logo

Software Engineer - LLM Inference

CentML's mission is to massively reduce the cost of developing and deploying ML models to enable anyone to harness the power of AI and everyone to benefit from its potential.
Machine Learning
Mid-Level Software Engineer
Hybrid
2+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer - LLM Inference

CentML is on a mission to revolutionize AI accessibility by drastically reducing the costs associated with developing and deploying ML models. As a Software Engineer specializing in LLM Inference, you'll be at the forefront of making large language models more efficient, scalable, and accessible.

In this role, you'll work on architecting and implementing cutting-edge inference stacks for LLMs. You'll collaborate with diverse teams focusing on resource orchestration, distributed systems, inference engine optimization, and high-performance GPU kernel development. Your responsibilities will include writing high-quality code, conducting benchmarks and profiling, and ensuring the scalability of our core backend software.

The ideal candidate has a strong background in computer science or a related field, with at least 2 years of industry experience. You should be proficient in Python and C/C++, and have a passion for machine learning and performance engineering. Experience with LLMs, GPU programming, or distributed systems is a plus.

Join CentML to contribute to the democratization of Machine Learning and be part of a team that values diversity, offers competitive benefits, and provides opportunities for professional growth. If you're ready to make a significant impact in the world of AI, this is your chance to shine!

Last updated 10 months ago

Responsibilities For Software Engineer - LLM Inference

  • Write safe, scalable, modular, and high-quality (C++/Python) code for our core backend software.
  • Perform benchmarking, profiling, and system-level programming for GPU applications.
  • Provide code reviews, design docs, and tutorials to facilitate collaboration among the team.
  • Conduct unit tests and performance tests for different stages of the inference pipeline.

Requirements For Software Engineer - LLM Inference

Python
Linux
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  • Strong coding skills in Python and C/C++.
  • 2+ years of industry experience in software engineering.
  • Knowledgeable and passionate about machine learning and performance engineering.

Benefits For Software Engineer - LLM Inference

Equity
Medical Insurance
Dental Insurance
Parental Leave
Education Budget
  • An open and inclusive work environment
  • Employee stock options
  • Best-in-class medical and dental benefits
  • Parental Leave top-up for 6 months
  • Professional development budget
  • Flexible vacation time to promote a healthy work-life blend

Interested in this job?