Praise for Kubeflow Operations Guide
This book is the go-to resource for enterprise deployment of Kubeflow from on-premise to the cloud. It will take you through how to think about Kubeflow on an operational level and then through the ways a team needs to think about integrating with their infrastructure for resources such as GPUs and Identity management.
Jeremy Lewi, Cofounder of Kubeflow, Principal Software Engineer, Primer
Patterson, Katzenellenbogen, and Harris have pulled together a terrific book that describes not just the components of setting up a production-ready Kubeflow deployment, but the tactical steps necessary to do so on-premises or on any of the hyperscale clouds. This is an essential book for understanding how to bring Kubeflow from experimentation to enterprise-ready.
David Aronchick, Cofounder of Kubeflow
A concise guide that covers planning, installing, and managing ML infrastructure across on-premises and cloud. This book provides a sorely needed step-by-step tutorial for using Kubeflow to support notebooks and autoscaled ML pipelines across hybrid cloud setups.
Lak Lakshmanan, Director of Analytics and AI Solutions, Google Cloud
Kubeflow Operations is a great resource that dives deep into the operational aspects of running real-world Kubeflow and Kubernetes clusters. This book also includes best practices for managing Kubernetes security, multitenancy, traffic routing, service mesh, GPUs, autoscaling, and capacity planning.
Chris Fregly, Developer Advocate, AI and Machine Learning at AWS
The Josh Patterson, Michael Katzenellenbogen, and Austin Harris book on Kubeflow should be a valuable roadmap for any data engineer or data scientist who is trying to build a modern data driven system. TBs/sec data streams, and online complex DL/ML-based decision models are becoming mainstream. With the availability of 400 Gb/s NDR Infiniband networking and PFLOPS CPU/GPU processing power on a single chip the role of the data scientist is often reduced to assembling available tools and monitoring the whole process rather than actively analyzing data and/or developing models. Data is driving both the feature generation and the model building. This is what this book is about.
Alex Kozlov, Ph.D., Senior Data Scientist, NVIDIA
Josh Patterson is a skilled practitioner who has helped many companies deploy and use Kubeflow successfully. He has also been deeply involved in the Kubeflow community for several years, giving him in-depth knowledge of the topic and a unique perspective not present in other Kubeflow guides. It is my pleasure to recommend this book.
Hamel Husain, Staff Machine Learning Engineer, GitHub
Kubeflow is a great way to consistently manage MLOps workflows across many clouds (including on-premises). Setting up and managing a hybrid Kubeflow is nontrivial and the authors do a great job at demystifying the whole process of explaining practical issues faced by MLOps engineers, starting from the guts of Kubeflow to deployment and operating in different clouds. This book fills a gap in the MLOps space very nicely and is highly recommended for both MLOps as well as the data scientist persona.
Debo Dutta, Distinguished Engineer, Cisco
Kubeflow is quickly emerging as the open-source MLOps platform of choice in enterprise IT, and this book masterfully covers the ins and outs of Kubeflow operations. It should be required reading for all MLOps engineers.
Mike Oglesby, MLOps Engineer, NetApp
Kubeflow is a favored development platform to simplify building and deploying AI capabilities into modern applications that utilize Kubernetes to scale and evolve efficiently. The Kubeflow Operations Guide provides valuable insights for planning, implementing, and operating Kubeflow.
Zeki Yasar, Principal Solutions Architect, ePlus Technology, Inc.
This book provides an exceptional deep dive into the operation of Kubeflow on-premise or via cloud providers. Kubeflow is a vital project in the machine learning engineering ecosystem and this publication provides a missing puzzle piece in the ecosystem: an excellent guide on how to set up and operate your machine learning engineering stack with Kubeflow or how to deploy machine learning models with KFServing effectively. I see this book as the go-to reference for machine learning or DevOps engineers wanting to understand a production Kubeflow setup. I wish the book would have been around when I set up my first clusters running Kubeflow; it would have saved me hours.
Hannes Hapke, Senior Machine Learning Engineer at SAP Concur
This book helped me to fully get my head around all the different parts of the Kubeflow system and understand what role Kubeflow plays in helping build a more reliable and reproducible data science deployment pipeline. From security to Jupyter implementation and on to deployment, this book was the guide that helped me see how the pieces fit together.
JD Long, RenaissanceRe
This book is a must-read guide for any DevOps team considering standardizing model deployments. Learn from the best and understand how machine learning works.
Axel Damian Sirota, Machine Learning Research Engineer
Kubeflow Operations Guide
by Josh Patterson , Michael Katzenellenbogen , and Austin Harris
Copyright 2021 Josh Patterson, Michael Katzenellenbogen, and Austin Harris. All rights reserved.
Printed in the United States of America.
Published by OReilly Media, Inc. , 1005 Gravenstein Highway North, Sebastopol, CA 95472.
OReilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://oreilly.com). For more information, contact our corporate/institutional sales department: 800-998-9938 or corporate@oreilly.com .
- Acquisitions Editor: Jonathan Hassell
- Development Editor: Michele Cronin
- Production Editor: Deborah Baker
- Copyeditor: Piper Editorial, LLC
- Proofreader: Sonia Saruba
- Indexer: Potomac Indexing, LLC
- Interior Designer: David Futato
- Cover Designer: Karen Montgomery
- Illustrator: Kate Dullea
- December 2020: First Edition
Revision History for the First Edition
- 2020-12-04: First Release
See http://oreilly.com/catalog/errata.csp?isbn=9781492053279 for release details.
The OReilly logo is a registered trademark of OReilly Media, Inc. Kubeflow Operations Guide, the cover image, and related trade dress are trademarks of OReilly Media, Inc.
The views expressed in this work are those of the authors, and do not represent the publishers views. While the publisher and the authors have used good faith efforts to ensure that the information and instructions contained in this work are accurate, the publisher and the authors disclaim all responsibility for errors or omissions, including without limitation responsibility for damages resulting from the use of or reliance on this work. Use of the information and instructions contained in this work is at your own risk. If any code samples or other technology this work contains or describes is subject to open source licenses or the intellectual property rights of others, it is your responsibility to ensure that your use thereof complies with such licenses and/or rights.
978-1-492-05327-9
[LSI]
Dedication
For my sons Ethan, Griffin, and Dane: Go forth, be bold, be persistent.
J. Patterson
Preface
What Is in This Book?
This book focuses on the DevOps and MLOps sides of deploying and operating Kubeflow. The authors feel that this is compelling and relevant content for todays practicing DevOps/MLOps teams as this sector is still changing. Many machine learning platforms today take different approaches to the architecture and solution space of managing machine learning workflows. The difficulty of considering all aspects of operating a machine learning platform is where this story kicks off in : Where are we today and what do we need to be thinking about from ground zero for machine learning platforms?