Reliable Machine Learning: Applying SRE Principles to ML in Production
- Length: 408 pages
- Edition: 1
- Language: English
- Publisher: O'Reilly Media
- Publication Date: 2022-11-01
- ISBN-10: 1098106229
- ISBN-13: 9781098106225
- Sales Rank: #373795 (See Top 100 Books)
Whether you’re part of a small startup or a multinational corporation, this practical book shows data scientists, software and site reliability engineers, product managers, and business owners how to run ML reliably, effectively, and accountably within your organization. You’ll gain insight into everything from how to do model monitoring in production to how to run a well-tuned model development team in a product organization.
By applying an SRE mindset to machine learning, authors and engineering professionals Cathy Chen, Kranti Parisa, Niall Richard Murphy, D. Sculley, Todd Underwood, and featured guest authors show you how to run an efficient and reliable ML system. Whether you want to increase revenue, optimize decision making, solve problems, or understand and influence customer behavior, you’ll learn how to perform day-to-day ML tasks while keeping the bigger picture in mind.
You’ll examine:
- What MLÂ is: how it functions and what it relies on
- Conceptual frameworks for understanding how ML “loops” work
- Effective “productionization,” and how it can be made easily monitorable, deployable, and operable
- Why ML systems make production troubleshooting more difficult, and how to get around them
- How ML, product, and production teams can communicate effectively