When I first entered the ML field 10+ years ago, classical models such as SVMs and Random Forests were used, and it seemed like there more "deep" modeling problems to tackle. At Google it seems like unless you work on a core LLM team (such as DeepMind), a lot of ML work involves using pre-trained LLMs by prompt tuning (or occasionally fine tuning them) and managing data pipelines. It seems less common to be writing code to manage tensors, these days. I still enjoy my role very much, but I'm curious: what are ML roles like at other companies?
I love this question! Here's my experience after interviewing ~20 companies and working at a few in the last 2 years
This is your using XGBoost to build some prediction models for specific tasks. Example: notification personalization at ecommerce company. A lot of the skill here is really data engineering with spark, and understanding classical models. Feature engineering
At about 4-5 companies (FAANG, newly IPO'ed unicorns, late stage startups) a lot of the work was using the tons of data they had to train deep learning models + build training pipelines, checkpointing all that for transformer based models for ad ranking, using sensor data to generate health predictions + some companies where their whole thing is building ML models for a very specific use case e.g. ML for heart issue diagnostics
A lot of new startups are fine tuning LLMs for very specific use cases that are unlocked by LLM. Eg. document parsing for vertical X
These are research engineering roles for extremely competitive companies working on tough problems where the reward is really high once you crack it. E.g. LLM optimization, RL models for markets, Distributed LLM inference with privacy preserving, robotics, AGI labs
nice breakdown! What are examples of moonshot research companies?
I imagine a lot of that work is happening in Big Tech already
I'm interested to learn more about the ML work at google. I'm surprised there arent a ton of projects that involve training pytorch based models?
My 1 guess is that google is a super mature company so not many ML focused projects exists as a lot of it is saturated and handled by PhDs unlike some of the pre/recent IPO companies where theres a ton of low hanging fruit where even basic deep learning architectures are enough