How is project risk management done in big tech? Do they have a standard risk management process that defines how to identify risks, the levels of impact and probability to categorize risks, RACI matrix for risks, how to report risks...etc... Or is that left to each project manager to do on their own?
Is there any good risk management process that you can describe?
Big Tech isn't a monolith - Each FAANG company is very different in terms of culture, and it will vary even more based on team and individual people. Back at Meta, I worked across Portal and Instagram Ads, and these 2 orgs had very different ways of doing things. I think why Meta and many other Big Tech companies are so successful is that they're very good at empowering engineers and having a more bottoms-up culture in terms of allowing SWEs to organize their projects in a way that fits for them.
To answer the question though, here's how I managed risk as a TL:
Thanks, Alex. Can you please elaborate more on point 2 on how to use system design to identify risks or let me know in which video in the system design series you described this?
The main video to watch in the System Design series for this is "Part 7 - How Can Things REALLY Break?"
However, many other parts of the series are very relevant here too: