Praise for The Enterprise Data Catalog
This book is a much-needed and refreshing addition to the data catalog landscape. Ole masterfully combines industry and practical experience with information and library science concepts to provide data catalog implementers with essential techniques for delivering a superior data discovery experience for their organization.
Juan Sequeda, Ph.D., Principal scientist and head of AI Lab, data.world
Ole Olesen-Bagneuxs book helps enterprise organizations navigate the complex data landscape with higher precision than ever before. It highlights the tremendous opportunity we have to harvest the full value of investing in data through an enterprise data search engine. Ole provides the magical missing piece that enables data-driven organizations to reach their full potential. This book is a must-read for all IT professionals, data authorities, and data enthusiasts.
Ann Fogelgren, Chief Information Officer, GN Group
For too long the information science and data management communities have been far apart. Dr. Olesen-Bagneuxs groundbreaking work clearly demonstrates the vital necessity of bringing these communities together toward realizing the full potential of data assets. The fresh perspectives developed in his book show us the way forward for innovation both in practice and in the study of data systems that reflect the human context of data work.
George Fletcher, Professor in the Data and Artificial Intelligence Cluster, Eindhoven University of Technology
Information science techniques have always been used to retrieve information that meets peoples informational needs. What if we applied these techniques to data catalogs to discover strategic data for an organization? In this book, Ole wisely demonstrates chapter by chapter how this is possible.
Fabiola Aparecida Vizentim, Librarian, ontologist, data architect, IA Biblio BR Group
Making data accessible is a challenge for most data-driven enterprises. In his book, Ole addresses a key component: the search for data. I find it especially inspiring and shocking how easily we can use thousands of years of knowledge from librarians and apply it to our modern data and metadata problems, including difficult topics such as data lineage. Impressive book.
Tomas Kratky, Founder and CEO, Manta Software
I highly recommend The Enterprise Data Catalog for anyone who wants to improve their data management skills and make better data-driven decisions. Its a comprehensive guide that will help you understand and implement best practices for data cataloging, discovery, and governance. Ole has done an outstanding job.
Ravit Jain, Founder and host of The Ravit Show, data science evangelist
By applying library and information science principles to an area largely driven by software engineering disciplines, this book offers fresh perspectives and directly applicable advice to data novices and veterans alike.
Nikolaj Agerbo Sabinsky, Principal consultant, Sabinsky Consult
A foundational and comprehensive book that will benefit practitioners and strategists alike. Theory and methodology from library and information science is used to understand both the problems faced and the solutions to be applied, illuminating in an engaging way the tasks of organization, curation, discovery, and management of data to achieve organizational success. Data catalogs are poised to become the essential enterprise application, and this book is likely to become an essential guidebook to implementing and ensuring their success.
Deb Seys, Senior director, Learning and Communities, Alation
Brillant introduction to data catalogs. Well-written and sharp, this book presents both conceptual background and practical tools for how to develop and implement enterprise data catalogs.
Jens-Erik Mai, Professor of Information, University of Copenhagen
Olesen-Bagneuxs ideas and use of techniques from the world of library science bring data catalogs to life with improved access to information, a better user experience, and stronger collaboration.
Mark McLellan, Product strategy, Rowbot.io
Ole Olesen-Bagneux offers a unique point of view about data catalogs. Lucid and full of insights, his book is destined to become the definitive guide to data catalog evaluation and implementation.
Jeffrey Tyzzer, Senior solution architect, Starburst
The Enterprise Data Catalog provides data practitioners and data curators with the critical skills they need to organize and search for data in meaningful ways. Ole makes clear the foundational role of metadata in information discovery, and in the process highlights the importance of information science professionals in modern data environments.
Susannah Barnes, Data intelligence program lead, Alation
With more and more technologies promising to solve the data problem, The Enterprise Data Catalog provides a fresh, much-needed, vendor-agnostic explanation of why and how data catalogs must work. In simple, non-salesy terms, this book shows that simply cataloging data in the right way can restore our collective ability to connect and create shared understanding across enterprises. A great guide for anyone trying to tame the enterprise data wilderness and unlock its potential.
Mark D. Kitson, Data strategy and management consultant, independent, Global Fortune 500 firms
The Enterprise Data Catalog
by Ole Olesen-Bagneux
Copyright 2023 Ole Olesen-Bagneux. All rights reserved.
Printed in the United States of America.
Published by OReilly Media, Inc. , 1005 Gravenstein Highway North, Sebastopol, CA 95472.
OReilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (https://oreilly.com). For more information, contact our corporate/institutional sales department: 800-998-9938 or corporate@oreilly.com.
- Acquisitions Editors: Aaron Black & Jess Haberman
- Development Editor: Rita Fernando
- Production Editors: Clare Laylock & Katherine Tozer
- Copyeditor: nSight, Inc.
- Proofreader: Tim Stewart
- Indexer: Judith McConville
- Interior Designer: David Futato
- Cover Designer: Karen Montgomery
- Illustrator: Kate Dullea
- February 2023: First Edition
Revision History for the First Edition
- 2023-02-15: First Release
See https://oreilly.com/catalog/errata.csp?isbn=9781492098713 for release details.
The OReilly logo is a registered trademark of OReilly Media, Inc. The Enterprise Data Catalog, the cover image, and related trade dress are trademarks of OReilly Media, Inc.
The views expressed in this work are those of the author and do not represent the publishers views. While the publisher and the author have used good faith efforts to ensure that the information and instructions contained in this work are accurate, the publisher and the author disclaim all responsibility for errors or omissions, including without limitation responsibility for damages resulting from the use of or reliance on this work. Use of the information and instructions contained in this work is at your own risk. If any code samples or other technology this work contains or describes is subject to open source licenses or the intellectual property rights of others, it is your responsibility to ensure that your use thereof complies with such licenses and/or rights.
This work is part of a collaboration between OReilly and Alation. See our statement of editorial independence .
978-1-492-09871-3
[LSI]
Foreword
When I began focusing on data cataloging in the mid-2010s, the world of data analytics had reached an inflection point. The great modern data infrastructure projects, centered on