Data Lakehouse in Action
Architecting a modern and scalable data analytics platform
Pradeep Menon
BIRMINGHAMMUMBAI
Data Lakehouse in Action
Copyright 2022 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Publishing Product Manager: Sunith Shetty
Senior Editor: David Sugarman
Content Development Editor: Priyanka Soam
Technical Editor: Sonam Pandey
Copy Editor: Safis Editing
Project Coordinator: Aparna Ravikumar Nair
Proofreader: Safis Editing
Indexer: Sejal Dsilva
Production Designer: Joshua Misquitta
Marketing Coordinator: Abeer Riyaz Dawe
First published: March 2022
Production reference: 1070222
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-80181-593-2
Many people have contributed to the creation of this book. From its inception to its publishing, my mentors, friends, colleagues, and family have constantly motivated, guided, and supported me. Unfortunately, there is not enough space to thank all of them. However, I will make five key mentions that were absolutely pivotal for creating this book.
Firstly, I want to thank my parents, who have supported me through the thick and thin of life. Their upbringing ensured that I was capable enough to undertake the Herculean task of writing a book.
Secondly, I want to thank my wife, Archana, and my daughter, Anaisha. They have constantly supported me while writing this book. They ensured that the boat was afloat as I burnt the midnight oil.
Thirdly, I want to thank my colleague and an accomplished architect, Debananda Ghosh. His technical knowledge, understanding of the complex dynamics of data, and honest feedback helped me make manifold improvements to this book's contents.
Fourthly, I want to thank the Packt Publishing team: Sunith Shetty, Priyanka Soam, Aishwarya Mohan, and David Sugarman. This team is an author's dream open to ideas, dedicated, and diligent. I'm thankful for the fantastic support provided by the team that made the writing process an absolute pleasure.
And finally, I want to thank my best friend and beloved pet, Pablo (a beagle). Without him, I wouldn't have had a chance to complete any book. He has single-handedly made me disciplined in my approach to life. The dedication and focus required to complete a book are directly attributable to the discipline instilled in me by him.
Contributors
About the author
Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI.
Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios.
Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft's strategic customers across Asia.
Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
About the reviewer
Debananda Ghosh is a senior specialist and global black belt (Cloud Analytics Asia) at Microsoft. He completed his bachelor's at Jadavpur University in B.Engineering, pursuing his postgraduate in data science and business analytics from McCombs School of Business at the University of Texas at Austin. He specializes in the fields of data and AI. His expertise includes data warehouses, DBA, data engineering, machine learning, data science product innovation, data and AI architecture and presales, and cloud analytics product sales. He has worked with customers in finance, manufacturing, utilities, telecoms, retail, e-commerce, and aviation. Currently working in the Microsoft Cloud Analytics product field, he helps industry partners achieve their digital transformation projects using advanced analytics and AI capabilities.