• Complain

Shumin Guo - Hadoop Operations and Cluster Management Cookbook

Here you can read online Shumin Guo - Hadoop Operations and Cluster Management Cookbook full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2013, publisher: Packt Publishing, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Shumin Guo Hadoop Operations and Cluster Management Cookbook
  • Book:
    Hadoop Operations and Cluster Management Cookbook
  • Author:
  • Publisher:
    Packt Publishing
  • Genre:
  • Year:
    2013
  • Rating:
    4 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 80
    • 1
    • 2
    • 3
    • 4
    • 5

Hadoop Operations and Cluster Management Cookbook: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Hadoop Operations and Cluster Management Cookbook" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Over 60 recipes showing you how to design, configure, manage, monitor, and tune a Hadoop cluster

Overview

  • Hands-on recipes to configure a Hadoop cluster from bare metal hardware nodes
  • Practical and in depth explanation of cluster management commands
  • Easy-to-understand recipes for securing and monitoring a Hadoop cluster, and design considerations
  • Recipes showing you how to tune the performance of a Hadoop cluster
  • Learn how to build a Hadoop cluster in the cloud

In Detail

We are facing an avalanche of data. The unstructured data we gather can contain many insights that could hold the key to business success or failure. Harnessing the ability to analyze and process this data with Hadoop is one of the most highly sought after skills in todays job market. Hadoop, by combining the computing and storage powers of a large number of commodity machines, solves this problem in an elegant way!

Hadoop Operations and Cluster Management Cookbook is a practical and hands-on guide for designing and managing a Hadoop cluster. It will help you understand how Hadoop works and guide you through cluster management tasks.

This book explains real-world, big data problems and the features of Hadoop that enables it to handle such problems. It breaks down the mystery of a Hadoop cluster and will guide you through a number of clear, practical recipes that will help you to manage a Hadoop cluster.

We will start by installing and configuring a Hadoop cluster, while explaining hardware selection and networking considerations. We will also cover the topic of securing a Hadoop cluster with Kerberos, configuring cluster high availability and monitoring a cluster. And if you want to know how to build a Hadoop cluster on the Amazon EC2 cloud, then this is a book for you.

What you will learn from this book

  • Defining your big data problem
  • Designing and configuring a pseudo-distributed Hadoop cluster
  • Configuring a fully distributed Hadoop cluster and tuning your Hadoop cluster for better performance
  • Managing the DFS and MapReduce cluster
  • Configuring Hadoop logging, auditing, and job scheduling
  • Hardening the Hadoop cluster with security and access control methods
  • Monitoring a Hadoop cluster with tools such as Chukwa, Ganglia, Nagio, and Ambari
  • Setting up a Hadoop cluster on the Amazon cloud

Approach

Solve specific problems using individual self-contained code recipes, or work through the book to develop your capabilities. This book is packed with easy-to-follow code and commands used for illustration, which makes your learning curve easy and quick.

Who this book is written for

If you are a Hadoop cluster system administrator with Unix/Linux system management experience and you are looking to get a good grounding in how to set up and manage a Hadoop cluster, then this book is for you. Its assumed that you will have some experience in Unix/Linux command line already, as well as being familiar with network communication basics.

Shumin Guo: author's other books


Who wrote Hadoop Operations and Cluster Management Cookbook? Find out the surname, the name of the author of the book and a list of all author's works by series.

Hadoop Operations and Cluster Management Cookbook — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Hadoop Operations and Cluster Management Cookbook" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Hadoop Operations and Cluster Management Cookbook

Hadoop Operations and Cluster Management Cookbook

Copyright 2013 Packt Publishing

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: July 2013

Production Reference: 1170713

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78216-516-3

www.packtpub.com

Cover Image by Girish Suryavanshi (<>)

Credits

Author

Shumin Guo

Reviewers

Hector Cuesta-Arvizu

Mark Kerzner

Harvinder Singh Saluja

Acquisition Editor

Kartikey Pandey

Lead Technical Editor

Madhuja Chaudhari

Technical Editors

Sharvari Baet

Jalasha D'costa

Veena Pagare

Amit Ramadas

Project Coordinator

Anurag Banerjee

Proofreader

Lauren Tobon

Indexer

Hemangini Bari

Graphics

Abhinash Sahu

Production Coordinator

Nitesh Thakur

Cover Work

Nitesh Thakur

About the Author

Shumin Guo is a PhD student of Computer Science at Wright State University in Dayton, OH. His research fields include Cloud Computing and Social Computing. He is enthusiastic about open source technologies and has been working as a System Administrator, Programmer, and Researcher at State Street Corp. and LexisNexis.

I would like to sincerely thank my wife, Min Han, for her support both technically and mentally. This book would not have been possible without encouragement from her.

About the Reviewers

Hector Cuesta-Arvizu provides consulting services for software engineering and data analysis with over eight years of experience in a variety of industries, including financial services, social networking, e-learning, and Human Resources.

Hector holds a BA in Informatics and an MSc in Computer Science. His main research interests lie in Machine Learning, High Performance Computing, Big Data, Computational Epidemiology, and Data Visualization. He has also helped in the technical review of the book Raspberry Pi Networking Cookbook by Rick Golden , Packt Publishing . He has published 12 scientific papers in International Conferences and Journals. He is an enthusiast of Lego Robotics and Raspberry Pi in his spare time.

You can follow him on Twitter at https://twitter.com/hmCuesta.

Mark Kerzner holds degrees in Law, Math, and Computer Science. He has been designing software for many years and Hadoop-based systems since 2008. He is the President of SHMsoft, a provider of Hadoop applications for various verticals, and a co-author of the book/project Hadoop Illuminated . He has authored and co-authored books and patents.

I would like to acknowledge the help of my colleagues, in particular Sujee Maniyam, and last but not least, my multitalented family.

Harvinder Singh Saluja has over 20 years of software architecture and development experience, and is the co-founder of MindTelligent, Inc. He works as Oracle SOA, Fusion MiddleWare, and Oracle Identity and Access Manager, and Oracle Big Data Specialist and Chief Integration Specialist at MindTelligent, Inc. Harvinder's strengths include his experience with strategy, concepts, and logical and physical architecture and development using Java/JEE/ADF/SEAM, SOA/AIA/OSB/OSR/OER, and OIM/OAM technologies.

He leads and manages MindTelligent's onshore and offshore and Oracle SOA/OSB/AIA/OSB/OER/OIM/OAM engagements. His specialty includes the AIA Foundation Pack development of custom PIPS for Utilities, Healthcare, and Energy verticals. His integration engagements include CC&B (Oracle Utilities Customer Care and Billing), Oracle Enterprise Taxation and Policy, Oracle Utilities Mobile Workforce Management, Oracle Utilities Meter Data Management, Oracle eBusiness Suite, Siebel CRM, and Oracle B2B for EDI X12 and EDIFACT.

His strengths include enterprise-wide security using Oracle Identity and Access Management, OID/OVD/ODSM/OWSM, including provisioning, workflows, reconciliation, single sign-on, SPML API, Connector API, and Web Services message and transport security using OWSM and Java cryptography.

He was awarded JDeveloper Java Extensions Developer of the Year award in 2003 by Oracle magazine.

www.PacktPub.com
Support files, eBooks, discount offers and more

You might want to visit www.PacktPub.com for support files and downloads related to your book.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at > for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

httpPacktLibPacktPubcom Do you need instant solutions to your IT - photo 1

http://PacktLib.PacktPub.com

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can access, read and search across Packt's entire library of books.

Why Subscribe?
  • Fully searchable across every book published by Packt
  • Copy and paste, print and bookmark content
  • On demand and accessible via web browser
Free Access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view nine entirely free books. Simply use your login credentials for immediate access.

Preface

Today, many organizations are facing the Big Data problem. Managing and processing Big Data can incur a lot of challenges for traditional data processing platforms such as relational database systems. Hadoop was designed to be a distributed and scalable system for dealing with Big Data problems. A Hadoop-based Big Data platform uses Hadoop as the data storage and processing engine. It deals with the problem by transforming the Big Data input into expected output.

Hadoop Operations and Cluster Management Cookbook provides examples and step-by-step recipes for you to administrate a Hadoop cluster. It covers a wide range of topics for designing, configuring, managing, and monitoring a Hadoop cluster. The goal of this book is to help you manage a Hadoop cluster more efficiently and in a more systematic way.

Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Hadoop Operations and Cluster Management Cookbook»

Look at similar books to Hadoop Operations and Cluster Management Cookbook. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Hadoop Operations and Cluster Management Cookbook»

Discussion, reviews of the book Hadoop Operations and Cluster Management Cookbook and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.