Taro Logo
2

Are applied MLE skills are not useful at a startup?

Profile picture
Mid-Level Software Engineer [L4] at Google2 months ago

Hi All,

I work on ML in Google Ads. I spend most of my time training models and running experiments. I spent little time writing code but more on analyzing the experiments, followed by hypothesizing and designing new experiments.

Most startups are using LLMs from frontier labs or open-source models and prompting them to work for their use case. It feels that my knowledge in building models is not particularly useful because these startups are not building LLMs from scratch, and most of the work is done in prompting, evals, and building infrastructure to support these models. I feel this mainly involves backend skills and the required ML skills like prompting, evals can easily be picked up by general SWEs too.

I feel that, unless you work on frontier AI research, the skills of an applied MLE are not that useful in building or working at a startup.

Would love to hear what people think about this.

Thanks!

28
2

Discussion

(2 comments)
  • 2
    Profile picture
    Thoughtful Tarodactyl
    Taro Community
    a month ago

    There are plenty of startups that hire for research engineering roles (not frontier LLM research). I interviewed at a few as well. I wouldn't count yourself out.

    Right now there are a TON of startups trying to take a slice of the LLM wave so finding good startups with LLM work is just noisy now. I also think fine tuning skills are important because many companies are building fine-tuned LLMs on proprietary datasets (eg medical/financial models)

    https://fal.ai/careers/applied-ml-engineer

    https://elevenlabs.io/careers/3d650946-5ac2-4729-9ae4-129c43fcd0b5/research-engineer?ashby_jid=3d650946-5ac2-4729-9ae4-129c43fcd0b5

    https://jobs.ashbyhq.com/twelve-labs/c3c6c6c2-795c-4eff-a236-5f6059f6d112

  • 2
    Profile picture
    Software Engineer
    a month ago

    Hm. In the case that you're focusing on model training/serving, it is true that your skills will be much better compensated working for a hot AI mega-startup or Google/Meta. Almost all other companies simply don't have the scale/focus on AI to commercialize that skill. An alternative option is to try to work at a startup as a Research Engineer, but the startup will probably go under if they require a heavy R&D arm.

    But isn't it an issue that ML skills have a limited employer set? Not really, it's niche but there are still enough companies that you'll always have a set of companies you can work for. The research topics will definitely evolve though.

    Maybe you should think of it in tiers:

    • Model building/serving at Google/Meta/OpenAI/Anthropic
    • Model building/serving at a Tier 1-2 company (Databricks, etc)
    • Research style engineering at a larger startup (Elevenlabs, Inflection AI, etc)

    Just be careful with startups, a lot of founders have gone in way over their heads and just don't know it yet