LitArk » Books » Computer

Arun C. Murthy - Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2

Here you can read online Arun C. Murthy - Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2014, publisher: Addison-Wesley Professional, genre: Computer. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Book:
Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2
Author:
Arun C Murthy / Vinod Kumar Vavilapalli / Doug Eadline / Joseph Niemiec / Jeff Markham
Publisher:
Addison-Wesley Professional
Genre:
Books / Computer
Year:
2014
Rating:
4 / 5
Favourites:
Add to favourites
Your mark:
- 80
- 1
- 2
- 3
- 4
- 5

Description
Author's other books
Similar books

Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm. From the Foreword by Raymie Stata, CEO of Altiscale The Insiders Guide to Building Distributed, Big Data Applications with Apache Hadoop YARN Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revolutionary advances. YARN project founder Arun Murthy and project lead Vinod Kumar Vavilapalli demonstrate how YARN increases scalability and cluster utilization, enables new programming models and services, and opens new options beyond Java and batch processing. They walk you through the entire YARN project lifecycle, from installation through deployment. Youll find many examples drawn from the authors cutting-edge experiencefirst as Hadoops earliest developers and implementers at Yahoo! and now as Hortonworks developers moving the platform forward and helping customers succeed with it.

Arun C. Murthy: author's other books

Who wrote Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2? Find out the surname, the name of the author of the book and a list of all author's works by series.

Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

About This eBook

ePUB is an open, industry-standard format for eBooks. However, support of ePUB and its many features varies across reading devices and applications. Use your device or app settings to customize the presentation to your liking. Settings that you can customize often include font, font size, single or double column, landscape or portrait mode, and figures that you can click or tap to enlarge. For additional information about the settings and features on your reading device or app, visit the device manufacturers Web site.

Many titles include programming code or configuration examples. To optimize the presentation of these elements, view the eBook in single-column, landscape mode and adjust the font size to the smallest setting. In addition to presenting code and configurations in the reflowable text format, we have included images of the code that mimic the presentation found in the print book; therefore, where the reflowable format may compromise the presentation of the code listing, you will see a Click here to view code image link. Click the link to view the print-fidelity code image. To return to the previous page viewed, click the Back button on your device or app.

Apache Hadoop YARN

Moving beyond MapReduce and Batch Processing with Apache Hadoop 2

Arun C. Murthy
Vinod Kumar Vavilapalli

Doug Eadline
Joseph Niemiec
Jeff Markham

Apache Hadoop YARN Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 - image 1

Upper Saddle River, NJ Boston Indianapolis San Francisco
New York Toronto Montreal London Munich Paris Madrid
Capetown Sydney Tokyo Singapore Mexico City

Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trademark claim, the designations have been printed with initial capital letters or in all capitals.

The authors and publisher have taken care in the preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein.

For information about buying this title in bulk quantities, or for special sales opportunities (which may include electronic versions; custom cover designs; and content particular to your business, training goals, marketing focus, or branding interests), please contact our corporate sales department at or (800) 382-3419.

For government sales inquiries, please contact .

For questions about sales outside the United States, please contact .

Visit us on the Web: informit.com/aw

Library of Congress Cataloging-in-Publication Data

Murthy, Arun C.

Apache Hadoop YARN : moving beyond MapReduce and batch processing with Apache Hadoop 2
/ Arun C. Murthy, Vinod Kumar Vavilapalli, Doug Eadline, Joseph Niemiec, Jeff Markham.

pages cm

Includes index.

ISBN 978-0-321-93450-5 (pbk. : alk. paper)

1. Apache Hadoop. 2. Electronic data processingDistributed processing. I. Title.

QA76.9.D5M97 2014

004'.36dc23

2014003391

Apache, Apache Hadoop, Hadoop, and the Hadoop elephant logo are trademarks of The Apache Software Foundation. Used with permission. No endorsement by The Apache Software Foundation is implied by the use of these marks.

Hortonworks is a trademark of Hortonworks, Inc., registered in the U.S. and other countries.

All rights reserved. Printed in the United States of America. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction, storage in a retrieval system, or transmission in any form or by any means, electronic, mechanical, photocopying, recording, or likewise. To obtain permission to use material from this work, please submit a written request to Pearson Education, Inc., Permissions Department, One Lake Street, Upper Saddle River, New Jersey 07458, or you may fax your request to (201) 236-3290.

ISBN-13: 978-0-321-93450-5
ISBN-10: 0-321-93450-4
Text printed in the United States on recycled paper at RR Donnelley in Crawfordsville, Indiana.
First printing, March 2014

Foreword by Raymie Stata

William Gibson was fond of saying: The future is already hereits just not very evenly distributed. Those of us who have been in the web search industry have had the privilegeand the curseof living in the future of Big Data when it wasnt distributed at all. What did we learn? We learned to measure everything. We learned to experiment. We learned to mine signals out of unstructured data. We learned to drive business value through data science. And we learned that, to do these things, we needed a new data-processing platform fundamentally different from the business intelligence systems being developed at the time.

The future of Big Data is rapidly arriving for almost all industries. This is driven in part by widespread instrumentation of the physical worldvehicles, buildings, and even people are spitting out log streams not unlike the weblogs we know and love in cyberspace. Less obviously, digital recordssuch as digitized government records, digitized insurance policies, and digital medical recordsare creating a trove of information not unlike the webpages crawled and parsed by search engines. Its no surprise, then, that the tools and techniques pioneered first in the world of web search are finding currency in more and more industries. And the leading such tool, of course, is Apache Hadoop.

But Hadoop is close to ten years old. Computing infrastructure has advanced significantly in this decade. If Hadoop was to maintain its relevance in the modern Big Data world, it needed to advance as well. YARN represents just the advancement needed to keep Hadoop relevant.

As described in the historical overview provided in this book, for the majority of Hadoops existence, it supported a single computing paradigm: MapReduce. On the compute servers we had at the time, horizontal scalingthrowing more server nodes at a problemwas the only way the web search industry could hope to keep pace with the growth of the web. The MapReduce paradigm is particularly well suited for horizontal scaling, so it was the natural paradigm to keep investing in.

With faster networks, higher core counts, solid-state storage, and (especially) larger memories, new paradigms of parallel computing are becoming practical at large scales. YARN will allow Hadoop users to move beyond MapReduce and adopt these emerging paradigms. MapReduce will not go awayits a good fit for many problems, and it still scales better than anything else currently developed. But, increasingly, MapReduce will be just one tool in a much larger tool chesta tool chest named YARN.

In short, the era of Big Data is just starting. Thanks to YARN, Hadoop will continue to play a pivotal role in Big Data processing across all industries. Given this, I was pleased to learn that YARN project founder Arun Murthy and project lead Vinod Kumar Vavilapalli have teamed up with Doug Eadline, Joseph Niemiec, and Jeff Markham to write a volume sharing the history and goals of the YARN project, describing how to deploy and operate YARN, and providing a tutorial on how to get the most out of it at the application level.

This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Similar books «Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2»

Look at similar books to Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.

Roos

Hadoop for dummies: [Understand the value of big data and how Hadoop can help manage it ; navigate the Hadoop 2 ecosystem and create clusters ; use applications for data mining, problem-solving,

White

Hadoop

Parsian

Data algorithms recipes for scaling up with Hadoop and Spark

Achari

Hadoop essentials: delve into the key concepts of Hadoop and get a thorough understanding of the Hadoop ecosystem

Venkat Ankam

Big Data Analytics

Sam R. Alapati

Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS

Grover Mark

Hadoop Application Architectures: Designing Real-World Big Data Applications

Mark Grover

Hadoop Application Architectures

Shiva Achari

Hadoop Essentials

Tom White

Hadoop: The Definitive Guide

Garry Turkington

Learning Hadoop 2

Dirk deRoos

Hadoop For Dummies

Reviews about «Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2»

Discussion, reviews of the book Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.