Machine learning design patterns : solutions to common challenges in data preparation, model building, and MLOps 🔍
Valliappa Lakshmanan; Sara Robinson; Michael Munn
O'Reilly Media, Inc, USA, O'Reilly Media, Sebastopol, CA, 2020
English [en] · PDF · 16.8MB · 2020 · 📘 Book (non-fiction) · 🚀/lgli/lgrs/nexusstc/zlib · Save
description
The design patterns in this book capture best practices and solutions to recurring problems in machine learning. The authors, three Google engineers, catalog proven methods to help data scientists tackle common problems throughout the ML process. These design patterns codify the experience of hundreds of experts into straightforward, approachable advice. In this book, you will find detailed explanations of 30 patterns for data and problem representation, operationalization, repeatability, reproducibility, flexibility, explainability, and fairness. Each pattern includes a description of the problem, a variety of potential solutions, and recommendations for choosing the best technique for your situation. You'll learn how to: Identify and mitigate common challenges when training, evaluating, and deploying ML models Represent data for different ML model types, including embeddings, feature crosses, and more Choose the right model type for specific problems Build a robust training loop that uses checkpoints, distribution strategy, and hyperparameter tuning Deploy scalable ML systems that you can retrain and update to reflect new data Interpret model predictions for stakeholders and ensure models are treating users fairly
Alternative filename
nexusstc/Machine Learning Design Patterns: Solutions to Common Challenges in Data Preparation, Model Building, and MLOps/c26f444336fe33aa6668bdebb2ac2178.pdf
Alternative filename
lgrsnf/Machine Learning Design Patterns 2021.pdf
Alternative filename
zlib/Computers/Computer Science/Valliappa Lakshmanan/Machine Learning Design Patterns: Solutions to Common Challenges in Data Preparation, Model Building, and MLOps_16984050.pdf
Alternative title
Машинное обучение. Паттерны проектирования: перевод с английского
Alternative author
Lakshmanan, Valliappa, Robinson, Sara, Munn, Michael
Alternative author
Валлиаппа Лакшманан, Сара Робинсон, Майкл Мунии
Alternative author
Лакшманан, Валлиаппа
Alternative publisher
O'Reilly Media, Incorporated
Alternative publisher
БХВ-Петербург
Alternative edition
O'Reilly, Санкт-Петербург, Russia, 2022
Alternative edition
United States, United States of America
Alternative edition
First edition, Sebastopol, CA, 2020
Alternative edition
1, 2020-11-03
Alternative edition
Beijing, 2020
Alternative edition
1, PT, 2020
metadata comments
lg3085467
metadata comments
{"isbns":["1098115783","9781098115784"],"last_page":400,"publisher":"O'Reilly Media, Inc, USA"}
metadata comments
Пер.: Lakshmanan, Valliappa Machine learning design patterns 978-1-098-11578-4
Предм. указ.: с. 433-444
Предм. указ.: с. 433-444
metadata comments
РГБ
metadata comments
Russian State Library [rgb] MARC:
=001 011020765
=005 20220404132846.0
=008 220321s2022\\\\ru\a\\\\\\\\\\000\|\rus|d
=017 \\ $a КН-П-22-019057 $b RuMoRKP
=020 \\ $a 978-5-9775-6797-8 (рус.) $c 1200 экз.
=040 \\ $a RuMoRKP $b rus $d RuMoRGB
=041 1\ $a rus $h eng
=044 \\ $a ru
=080 \\ $a 004.42'236 $2 4
=084 \\ $a 32.973 $2 rubbks
=084 \\ $a З973.236-01,07 $2 rubbk
=084 \\ $a З973.233-018-5-05,07 $2 rubbk
=100 1\ $a Лакшманан, Валлиаппа
=245 00 $a Машинное обучение. Паттерны проектирования : $b перевод с английского $c Валлиаппа Лакшманан, Сара Робинсон, Майкл Мунии
=246 20 $a Паттерны проектирования
=260 \\ $a Санкт-Петербург $b БХВ-Петербург $c 2022 $e Чехов, Московская область
=300 \\ $a 444 с. $b ил. $c 24 см
=336 \\ $a Текст (визуальный)
=337 \\ $a непосредственный
=490 0\ $a O'Reilly
=520 \\ $a Приводимые в книге паттерны проектирования отражают лучшие практические подходы к решению типичных задач машинного обучения. Указанные паттерны, реализованные в программном коде, сконцентрировали опыт сотен экспертов в простые и легкодоступные советы. Книга содержит подробный разбор 30 паттернов, служащих для представления данных и задач, тренировки моделей, отказоустойчивого обслуживания, обеспечения воспроизводимости и искусственного интеллекта. Каждый паттерн включает в себя постановку задачи, ряд потенциальных решений и рекомендации по выбору технического приема, наилучшим образом подходящего к данной ситуации. Для программистов в области машинного обучения
=534 \\ $p Пер.: $a Lakshmanan, Valliappa $t Machine learning design patterns $z 978-1-098-11578-4
=555 \\ $a Предм. указ.: с. 433-444
=650 \7 $a Компьютеризация обучения $2 RuMoRKP
=650 \7 $a Техника. Технические науки -- Энергетика. Радиоэлектроника -- Радиоэлектроника -- Вычислительная техника -- Вычислительные машины электронные цифровые -- Специализированные компьютеры и системы. Отдельные информационные технологии -- Образовательные системы -- Теория -- Пособие для специалиста $2 rubbk
=650 \7 $a Техника. Технические науки -- Энергетика. Радиоэлектроника -- Радиоэлектроника -- Вычислительная техника -- Вычислительные машины электронные цифровые -- Автоматическая обработка информации -- Программирование -- Автоматизация -- Пособие для специалиста $2 rubbk
=700 1\ $a Робинсон, Сара
=700 1\ $a Мунн, Майкл
=852 7\ $a РГБ $b CZ2 $h З813/Л19 $x 83
=852 \\ $a РГБ $b FB $j 2 22-16/295 $x 90
=001 011020765
=005 20220404132846.0
=008 220321s2022\\\\ru\a\\\\\\\\\\000\|\rus|d
=017 \\ $a КН-П-22-019057 $b RuMoRKP
=020 \\ $a 978-5-9775-6797-8 (рус.) $c 1200 экз.
=040 \\ $a RuMoRKP $b rus $d RuMoRGB
=041 1\ $a rus $h eng
=044 \\ $a ru
=080 \\ $a 004.42'236 $2 4
=084 \\ $a 32.973 $2 rubbks
=084 \\ $a З973.236-01,07 $2 rubbk
=084 \\ $a З973.233-018-5-05,07 $2 rubbk
=100 1\ $a Лакшманан, Валлиаппа
=245 00 $a Машинное обучение. Паттерны проектирования : $b перевод с английского $c Валлиаппа Лакшманан, Сара Робинсон, Майкл Мунии
=246 20 $a Паттерны проектирования
=260 \\ $a Санкт-Петербург $b БХВ-Петербург $c 2022 $e Чехов, Московская область
=300 \\ $a 444 с. $b ил. $c 24 см
=336 \\ $a Текст (визуальный)
=337 \\ $a непосредственный
=490 0\ $a O'Reilly
=520 \\ $a Приводимые в книге паттерны проектирования отражают лучшие практические подходы к решению типичных задач машинного обучения. Указанные паттерны, реализованные в программном коде, сконцентрировали опыт сотен экспертов в простые и легкодоступные советы. Книга содержит подробный разбор 30 паттернов, служащих для представления данных и задач, тренировки моделей, отказоустойчивого обслуживания, обеспечения воспроизводимости и искусственного интеллекта. Каждый паттерн включает в себя постановку задачи, ряд потенциальных решений и рекомендации по выбору технического приема, наилучшим образом подходящего к данной ситуации. Для программистов в области машинного обучения
=534 \\ $p Пер.: $a Lakshmanan, Valliappa $t Machine learning design patterns $z 978-1-098-11578-4
=555 \\ $a Предм. указ.: с. 433-444
=650 \7 $a Компьютеризация обучения $2 RuMoRKP
=650 \7 $a Техника. Технические науки -- Энергетика. Радиоэлектроника -- Радиоэлектроника -- Вычислительная техника -- Вычислительные машины электронные цифровые -- Специализированные компьютеры и системы. Отдельные информационные технологии -- Образовательные системы -- Теория -- Пособие для специалиста $2 rubbk
=650 \7 $a Техника. Технические науки -- Энергетика. Радиоэлектроника -- Радиоэлектроника -- Вычислительная техника -- Вычислительные машины электронные цифровые -- Автоматическая обработка информации -- Программирование -- Автоматизация -- Пособие для специалиста $2 rubbk
=700 1\ $a Робинсон, Сара
=700 1\ $a Мунн, Майкл
=852 7\ $a РГБ $b CZ2 $h З813/Л19 $x 83
=852 \\ $a РГБ $b FB $j 2 22-16/295 $x 90
Alternative description
Cover
Copyright
Table of Contents
Preface
Who Is This Book For?
What’s Not in the Book
Code Samples
Conventions Used in This Book
O’Reilly Online Learning
How to Contact Us
Acknowledgments
Chapter 1. The Need for Machine Learning Design Patterns
What Are Design Patterns?
How to Use This Book
Machine Learning Terminology
Models and Frameworks
Data and Feature Engineering
The Machine Learning Process
Data and Model Tooling
Roles
Common Challenges in Machine Learning
Data Quality
Reproducibility
Data Drift
Scale
Multiple Objectives
Summary
Chapter 2. Data Representation Design Patterns
Simple Data Representations
Numerical Inputs
Categorical Inputs
Design Pattern 1: Hashed Feature
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 2: Embeddings
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 3: Feature Cross
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 4: Multimodal Input
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 3. Problem Representation Design Patterns
Design Pattern 5: Reframing
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 6: Multilabel
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 7: Ensembles
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 8: Cascade
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 9: Neutral Class
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 10: Rebalancing
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 4. Model Training Patterns
Typical Training Loop
Stochastic Gradient Descent
Keras Training Loop
Training Design Patterns
Design Pattern 11: Useful Overfitting
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 12: Checkpoints
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 13: Transfer Learning
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 14: Distribution Strategy
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 15: Hyperparameter Tuning
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Summary
Chapter 5. Design Patterns for Resilient Serving
Design Pattern 16: Stateless Serving Function
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 17: Batch Serving
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 18: Continued Model Evaluation
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 19: Two-Phase Predictions
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 20: Keyed Predictions
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 6. Reproducibility Design Patterns
Design Pattern 21: Transform
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 22: Repeatable Splitting
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 23: Bridged Schema
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 24: Windowed Inference
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 25: Workflow Pipeline
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 26: Feature Store
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 27: Model Versioning
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 7. Responsible AI
Design Pattern 28: Heuristic Benchmark
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 29: Explainable Predictions
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 30: Fairness Lens
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 8. Connected Patterns
Patterns Reference
Pattern Interactions
Patterns Within ML Projects
ML Life Cycle
AI Readiness
Common Patterns by Use Case and Data Type
Natural Language Understanding
Computer Vision
Predictive Analytics
Recommendation Systems
Fraud and Anomaly Detection
Index
About the Authors
Colophon
Copyright
Table of Contents
Preface
Who Is This Book For?
What’s Not in the Book
Code Samples
Conventions Used in This Book
O’Reilly Online Learning
How to Contact Us
Acknowledgments
Chapter 1. The Need for Machine Learning Design Patterns
What Are Design Patterns?
How to Use This Book
Machine Learning Terminology
Models and Frameworks
Data and Feature Engineering
The Machine Learning Process
Data and Model Tooling
Roles
Common Challenges in Machine Learning
Data Quality
Reproducibility
Data Drift
Scale
Multiple Objectives
Summary
Chapter 2. Data Representation Design Patterns
Simple Data Representations
Numerical Inputs
Categorical Inputs
Design Pattern 1: Hashed Feature
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 2: Embeddings
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 3: Feature Cross
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 4: Multimodal Input
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 3. Problem Representation Design Patterns
Design Pattern 5: Reframing
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 6: Multilabel
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 7: Ensembles
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 8: Cascade
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 9: Neutral Class
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 10: Rebalancing
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 4. Model Training Patterns
Typical Training Loop
Stochastic Gradient Descent
Keras Training Loop
Training Design Patterns
Design Pattern 11: Useful Overfitting
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 12: Checkpoints
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 13: Transfer Learning
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 14: Distribution Strategy
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 15: Hyperparameter Tuning
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Summary
Chapter 5. Design Patterns for Resilient Serving
Design Pattern 16: Stateless Serving Function
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 17: Batch Serving
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 18: Continued Model Evaluation
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 19: Two-Phase Predictions
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 20: Keyed Predictions
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 6. Reproducibility Design Patterns
Design Pattern 21: Transform
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 22: Repeatable Splitting
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 23: Bridged Schema
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 24: Windowed Inference
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 25: Workflow Pipeline
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 26: Feature Store
Problem
Solution
Why It Works
Trade-Offs and Alternatives
Design Pattern 27: Model Versioning
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 7. Responsible AI
Design Pattern 28: Heuristic Benchmark
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 29: Explainable Predictions
Problem
Solution
Trade-Offs and Alternatives
Design Pattern 30: Fairness Lens
Problem
Solution
Trade-Offs and Alternatives
Summary
Chapter 8. Connected Patterns
Patterns Reference
Pattern Interactions
Patterns Within ML Projects
ML Life Cycle
AI Readiness
Common Patterns by Use Case and Data Type
Natural Language Understanding
Computer Vision
Predictive Analytics
Recommendation Systems
Fraud and Anomaly Detection
Index
About the Authors
Colophon
Alternative description
The design patterns in this book capture best practices and solutions to recurring problems in machine learning. Authors Valliappa Lakshmanan, Sara Robinson, and Michael Munn catalog the first tried-and-proven methods to help engineers tackle problems that frequently crop up during the ML process. These design patterns codify the experience of hundreds of experts into advice you can easily follow.
The authors, three Google Cloud engineers, describe 30 patterns for data and problem representation, operationalization, repeatability, reproducibility, flexibility, explainability, and fairness. Each pattern includes a description of the problem, a variety of potential solutions, and recommendations for choosing the most appropriate remedy for your situation.
Youll learn how to:
Identify and mitigate common challenges when training, evaluating, and deploying ML models
Represent data for different ML model types, including embeddings, feature crosses, and more
Choose the right model type for specific problems
Build a robust training loop that uses checkpoints, distribution strategy, and hyperparameter tuning
Deploy scalable ML systems that you can retrain and update to reflect new data
Interpret model predictions for stakeholders and ensure that models are treating users fairly
The authors, three Google Cloud engineers, describe 30 patterns for data and problem representation, operationalization, repeatability, reproducibility, flexibility, explainability, and fairness. Each pattern includes a description of the problem, a variety of potential solutions, and recommendations for choosing the most appropriate remedy for your situation.
Youll learn how to:
Identify and mitigate common challenges when training, evaluating, and deploying ML models
Represent data for different ML model types, including embeddings, feature crosses, and more
Choose the right model type for specific problems
Build a robust training loop that uses checkpoints, distribution strategy, and hyperparameter tuning
Deploy scalable ML systems that you can retrain and update to reflect new data
Interpret model predictions for stakeholders and ensure that models are treating users fairly
Alternative description
The design patterns in this book capture best practices and solutions to recurring problems in machine learning. The authors, three Google engineers, catalog proven methods to help data scientists tackle common problems throughout the ML process. These design patterns codify the experience of hundreds of experts into straightforward, approachable advice. In this book, you will find detailed explanations of 30 patterns for data and problem representation, operationalization, repeatability, reproducibility, flexibility, explainability, and fairness. Each pattern includes a description of the problem, a variety of potential solutions, and recommendations for choosing the best technique for your situation.-- Source other than the Library of Congress
Alternative description
Приводимые в книге паттерны проектирования отражают лучшие практические подходы к решению типичных задач машинного обучения. Указанные паттерны, реализованные в программном коде, сконцентрировали опыт сотен экспертов в простые и легкодоступные советы. Книга содержит подробный разбор 30 паттернов, служащих для представления данных и задач, тренировки моделей, отказоустойчивого обслуживания, обеспечения воспроизводимости и искусственного интеллекта. Каждый паттерн включает в себя постановку задачи, ряд потенциальных решений и рекомендации по выбору технического приема, наилучшим образом подходящего к данной ситуации. Для программистов в области машинного обучения
date open sourced
2021-08-11
🚀 Fast downloads
Become a member to support the long-term preservation of books, papers, and more. To show our gratitude for your support, you get fast downloads. ❤️
- Fast Partner Server #1 (recommended)
- Fast Partner Server #2 (recommended)
- Fast Partner Server #3 (recommended)
- Fast Partner Server #4 (recommended)
- Fast Partner Server #5 (recommended)
- Fast Partner Server #6 (recommended)
- Fast Partner Server #7
- Fast Partner Server #8
- Fast Partner Server #9
- Fast Partner Server #10
- Fast Partner Server #11
🐢 Slow downloads
From trusted partners. More information in the FAQ. (might require browser verification — unlimited downloads!)
- Slow Partner Server #1 (slightly faster but with waitlist)
- Slow Partner Server #2 (slightly faster but with waitlist)
- Slow Partner Server #3 (slightly faster but with waitlist)
- Slow Partner Server #4 (slightly faster but with waitlist)
- Slow Partner Server #5 (no waitlist, but can be very slow)
- Slow Partner Server #6 (no waitlist, but can be very slow)
- Slow Partner Server #7 (no waitlist, but can be very slow)
- Slow Partner Server #8 (no waitlist, but can be very slow)
- After downloading: Open in our viewer
All download options have the same file, and should be safe to use. That said, always be cautious when downloading files from the internet, especially from sites external to Anna’s Archive. For example, be sure to keep your devices updated.
External downloads
-
For large files, we recommend using a download manager to prevent interruptions.
Recommended download managers: Motrix -
You will need an ebook or PDF reader to open the file, depending on the file format.
Recommended ebook readers: Anna’s Archive online viewer, ReadEra, and Calibre -
Use online tools to convert between formats.
Recommended conversion tools: CloudConvert and PrintFriendly -
You can send both PDF and EPUB files to your Kindle or Kobo eReader.
Recommended tools: Amazon‘s “Send to Kindle” and djazz‘s “Send to Kobo/Kindle” -
Support authors and libraries
✍️ If you like this and can afford it, consider buying the original, or supporting the authors directly.
📚 If this is available at your local library, consider borrowing it for free there.
Total downloads:
A “file MD5” is a hash that gets computed from the file contents, and is reasonably unique based on that content. All shadow libraries that we have indexed on here primarily use MD5s to identify files.
A file might appear in multiple shadow libraries. For information about the various datasets that we have compiled, see the Datasets page.
For information about this particular file, check out its JSON file. Live/debug JSON version. Live/debug page.