• Complain

Vinicius Aquino do Vale - Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)

Here you can read online Vinicius Aquino do Vale - Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition) full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2021, publisher: BPB Publications, genre: Politics. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Vinicius Aquino do Vale Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)
  • Book:
    Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)
  • Author:
  • Publisher:
    BPB Publications
  • Genre:
  • Year:
    2021
  • Rating:
    3 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 60
    • 1
    • 2
    • 3
    • 4
    • 5

Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition): summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Understand data in a simple way using a data lake.

Key Features

In-depth practical demonstration of Hadoop/Yarn concepts with numerous examples.

Includes graphical illustrations and visual explanations for Hadoop commands and parameters.

Includes details of dimensional modeling and Data Vault modeling.

Includes details of how to create and define a structure to a data lake.

Description

The book Data Processing and Modeling with Hadoop explains how a distributed system works and its benefits in the big data era in a straightforward and clear manner. After reading the book, you will be able to plan and organize projects involving a massive amount of data.

The book describes the standards and technologies that aid in data management and compares them to other technology business standards. The reader receives practical guidance on how to segregate and separate data into zones, as well as how to develop a model that can aid in data evolution. It discusses security and the measures that are utilized to reduce the impact of security. Self-service analytics, Data Lake, Data Vault 2.0, and Data Mesh are discussed in the book.

After reading this book, the reader will have a thorough understanding of how to structure a data lake, as well as the ability to plan, organize, and carry out the implementation of a data-driven business with full governance and security.

What you will learn

Learn the basics of components to the Hadoop Ecosystem.

Understand the structure, files, and zones of a Data Lake.

Learn to implement the security part of the Hadoop Ecosystem.

Learn to work with the Data Vault 2.0 modeling.

Learn to develop a strategy to define good governance.

Learn new tools to work with Data and Big Data

Who this book is for

This book caters to big data developers, technical specialists, consultants, and students who want to build good proficiency in big data. Knowing basic SQL concepts, modeling, and development would be good, although not mandatory.

Table of Contents

1. Understanding the Current Moment

2. Defining the Zones

3. The Importance of Modeling

4. Massive Parallel Processing

5. Doing ETL/ELT

6. A Little Governance

7. Talking About Security

8. What Are the Next Steps?

Vinicius Aquino do Vale: author's other books


Who wrote Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)? Find out the surname, the name of the author of the book and a list of all author's works by series.

Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition) — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Table of Contents
Guide

Data Processing and Modeling with Hadoop Mastering Hadoop Ecosystem - photo 1

Data Processing
and
Modeling with
Hadoop

Data Processing and Modeling with Hadoop Mastering Hadoop Ecosystem Including ETL Data Vault DMBok GDPR and Various Data-Centric Tools English Edition - image 2

Mastering Hadoop Ecosystem Including
ETL, Data Vault, DMBok, GDPR, and
Various Data-Centric Tools

Data Processing and Modeling with Hadoop Mastering Hadoop Ecosystem Including ETL Data Vault DMBok GDPR and Various Data-Centric Tools English Edition - image 3

Vinicius Aquino do Vale
Data Processing and Modeling with Hadoop Mastering Hadoop Ecosystem Including ETL Data Vault DMBok GDPR and Various Data-Centric Tools English Edition - image 4

www.bpbonline.com

FIRST EDITION 2022

Copyright BPB Publications, India

ISBN: 978-93-91392-284

All Rights Reserved. No part of this publication may be reproduced, distributed or transmitted in any form or by any means or stored in a database or retrieval system, without the prior written permission of the publisher with the exception to the program listings which may be entered, stored and executed in a computer system, but they can not be reproduced by the means of publication, photocopy, recording, or by any electronic and mechanical means.

LIMITS OF LIABILITY AND DISCLAIMER OF WARRANTY

The information contained in this book is true to correct and the best of authors and publishers knowledge. The author has made every effort to ensure the accuracy of these publications, but publisher cannot be held responsible for any loss or damage arising from any information in this book.

All trademarks referred to in the book are acknowledged as properties of their respective owners but BPB Publications cannot guarantee the accuracy of this information.

wwwbpbonlinecom Dedicated to Carlos Rogrio do Vale and Almeni Mendes Aquino - photo 5

www.bpbonline.com

Dedicated to

Carlos Rogrio do Vale and
Almeni Mendes Aquino do Vale
My parents offered me all support to
arrive here and be able to write this book.

About the Author

Vinicius Aquino do Vale is an experienced technical consultant who has been working with clients and partners for 15 years in the design of technological solutions. In his career, Vinicius has participated in large projects as a specialist in Big Data technologies, having advanced knowledge of the Hadoop ecosystem. He has worked on several Big Data projects in the largest companies in Brazil assisting in architecture design, implementation, configuration, ingestion, analysis and ETL. He participated in the construction of all data lake / smart data flows, in addition to integrating the entire system with analytics tools like QLikSense, QlikView, Tableau, Metabase, Tibco SpotFire, in addition to implementing security integration with AD / LDAP. Vinicius served as an MBA professor, teaching NoSQL, Data Ingestion and Parallel Mass Processing classes, as well as speaking at IT events for IT companies and communities. Vinicius is a PostgreSQL, MongoDB and Cassandra database specialist, as well as a Linux Server specialist: CentOS, Debian, RedHat, SUSE. Vinicius dedicated years to its improvement, obtaining international certifications such as: ITIL, LPIC (Linux), OCJP (Oracle Certified Professional, Java SE 6 Programmer), OCE-WCD (Oracle Certified Expert, Java EE 6 Web Component Developer), OCE-JPAD (Oracle Certified Expert, Java EE 6 Java Persistence API Developer), OCE-EJB (Oracle Certified Expert, Java EE 6 Enterprise JavaBeans Developer), Hadoop Administrator (Cloudera), PostgreSQL (EnterpriseDB). He has extensive experience in Java development for the Web, working with several technologies and frameworks, and also has the ability to lead and coordinate projects with agile methodology with SCRUM / Kanban. In addition, vinicius has domain over public clouds like Google Cloud (GCP), Azure and AWS.

In 2014, Vinicius founded his own education company, Sudoers, where he is the Founder and Professor of Technology, helping, training and mentoring young people to pursue careers in technology.

About the Reviewer

Utkarsh Singhal has 10+ years of experience in software development. He has intensive knowledge on various big data tools like Hadoop, Spark, Pig, Hive, Sqoop along with programming languages like Java, Python etc. Utkarsh has a zeal to learn new technologies; he has earned certifications in various cloud based tech like Oracle, Azure and other framework such as Neo4J and Trifacta. Utkarsh pursued B. Tech in Computer Science from Rajasthan Technical University, Rajasthan. He is currently working as Technical BA in Bank of America, Gurugram.

Acknowledgement

Initially I thank God for allowing me to write this book, allowing me to pass the knowledge on. I'm Vinicius Aquino do Vale (https://www.linkedin.com/in/aquinovale/), Brazilian, married to Carla Malu Pereira de Morais and a specialist in Big Data. I have been working with Hadoop and Big Data since 2015 helping large companies to become Data Driven. I also work as a teacher, instructor and lecturer bringing knowledge in a simple, clear and objective way to professionals in the technology area.

Finally, I would like to thank BPB Publications for giving me this opportunity to write my first book for them.

Preface

Hello reader, I am very grateful to read this book which aims to help direct managers and professionals in the technology area to understand more deeply how to process and model with Hadoop to find value with data.

The book aims to guide professionals and managers to use Hadoop in the best possible way, being distributed in 8 chapters.

In this chapter, we talked about the reasons for using Hadoop and how the new data age came to change the world.

In this chapter, we talked about zones and their separations, explaining how to transform data into information.

In this chapter, we talked about modeling and the most common types of modeling.

In this chapter, we talked about distributed processing and its characteristics, talking about the main tools.

In this chapter, we talked about how ELT works in Big Data environments.

In this chapter, we talked about the importance of governance and its importance in the world of data.

In this chapter, we talked a little about security and how it impacts the day-to-day business, and

), another great Big Data expert, we talked about new trends involving data and much more.

Downloading the coloured images:

Please follow the link to download the
Coloured Images of the book:

https://rebrand.ly/0310eb
Errata

We take immense pride in our work at BPB Publications and follow best practices to ensure the accuracy of our content to provide with an indulging reading experience to our subscribers. Our readers are our mirrors, and we use their inputs to reflect and improve upon human errors, if any, that may have occurred during the publishing processes involved. To let us maintain the quality and help us reach out to any readers who might be having difficulties due to any unforeseen errors, please write to us at :

Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)»

Look at similar books to Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition). We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition)»

Discussion, reviews of the book Data Processing and Modeling with Hadoop: Mastering Hadoop Ecosystem Including ETL, Data Vault, DMBok, GDPR, and Various Data-Centric Tools (English Edition) and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.