• Complain

Shrey Mehrotra - Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark

Here you can read online Shrey Mehrotra - Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2019, publisher: Packt Publishing, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

No cover
  • Book:
    Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark
  • Author:
  • Publisher:
    Packt Publishing
  • Genre:
  • Year:
    2019
  • Rating:
    4 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 80
    • 1
    • 2
    • 3
    • 4
    • 5

Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

A practical guide for solving complex data processing challenges by applying the best optimizations techniques in Apache Spark.

Key Features
  • Learn about the core concepts and the latest developments in Apache Spark
  • Master writing efficient big data applications with Spark's built-in modules for SQL, Streaming, Machine Learning and Graph analysis
  • Get introduced to a variety of optimizations based on the actual experience
  • Book Description

    Apache Spark is a flexible framework that allows processing of batch and real-time data. Its unified engine has made it quite popular for big data use cases. This book will help you to get started with Apache Spark 2.0 and write big data applications for a variety of use cases.

    It will also introduce you to Apache Spark one of the most popular Big Data processing frameworks. Although this book is intended to help you get started with Apache Spark, but it also focuses on explaining the core concepts.

    This practical guide provides a quick start to the Spark 2.0 architecture and its components. It teaches you how to set up Spark on your local machine. As we move ahead, you will be introduced to resilient distributed datasets (RDDs) and DataFrame APIs, and their corresponding transformations and actions. Then, we move on to the life cycle of a Spark application and learn about the techniques used to debug slow-running applications. You will also go through Spark's built-in modules for SQL, streaming, machine learning, and graph analysis.

    Finally, the book will lay out the best practices and optimization techniques that are key for writing efficient Spark applications. By the end of this book, you will have a sound fundamental understanding of the Apache Spark framework and you will be able to write and optimize Spark applications.

    What you will learn
  • Learn core concepts such as RDDs, DataFrames, transformations, and more
  • Set up a Spark development environment
  • Choose the right APIs for your applications
  • Understand Spark's architecture and the execution flow of a Spark application
  • Explore built-in modules for SQL, streaming, ML, and graph analysis
  • Optimize your Spark job for better performance
  • Who this book is for

    If you are a big data enthusiast and love processing huge amount of data, this book is for you. If you are data engineer and looking for the best optimization techniques for your Spark applications, then you will find this book helpful. This book also helps data scientists who want to implement their machine learning algorithms in Spark. You need to have a basic understanding of any one of the programming languages such as Scala, Python or Java.

    Shrey Mehrotra: author's other books


    Who wrote Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark? Find out the surname, the name of the author of the book and a list of all author's works by series.

    Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark — read online for free the complete book (whole text) full work

    Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

    Light

    Font size:

    Reset

    Interval:

    Bookmark:

    Make
    Apache Spark Quick Start Guide Quickly learn the art of writing efficient big - photo 1
    Apache Spark Quick Start Guide
    Quickly learn the art of writing efficient big data applications with Apache Spark
    Shrey Mehrotra
    Akash Grade
    BIRMINGHAM - MUMBAI Apache Spark Quick Start Guide Copyright 2019 Packt - photo 2
    BIRMINGHAM - MUMBAI
    Apache Spark Quick Start Guide

    Copyright 2019 Packt Publishing

    All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

    Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

    Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

    Commissioning Editor: Amey Varangaonkar
    Acquisition Editor: Siddharth Mandal
    Content Development Editor: Smit Carvalho
    Technical Editor: Aishwarya More
    Copy Editor: Safis Editing
    Project Coordinator: Pragati Shukla
    Proofreader: Safis Editing
    Indexer: Pratik Shirodkar
    Graphics: Alishon Mendonsa
    Production Coordinator: Deepika Naik

    First published: January 2019

    Production reference: 1310119

    Published by Packt Publishing Ltd.
    Livery Place
    35 Livery Street
    Birmingham
    B3 2PB, UK.

    ISBN 978-1-78934-910-8

    www.packtpub.com

    maptio Mapt is an online digital library that gives you full access to over - photo 3
    mapt.io

    Mapt is an online digital library that gives you full access to over 5,000 books and videos, as well as industry leading tools to help you plan your personal development and advance your career. For more information, please visit our website.

    Why subscribe?
    • Spend less time learning and more time coding with practical eBooks and videos from over 4,000 industry professionals

    • Improve your learning with Skill Plans built especially for you

    • Get a free eBook or video every month

    • Mapt is fully searchable

    • Copy and paste, print, and bookmark content

    Packt.com

    Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.packt.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at customercare@packtpub.com for more details.

    At www.packt.com , you can also read a collection of free technical articles, sign up for a range of free newsletters, and receive exclusive discounts and offers on Packt books and eBooks.

    Contributors
    About the authors

    Shrey Mehrotra has over 8 years of IT experience and, for the past 6 years, has been designing the architecture of cloud and big-data solutions for the finance, media, and governance sectors. Having worked on research and development with big-data labs and been part of Risk Technologies, he has gained insights into Hadoop, with a focus on Spark, HBase, and Hive. His technical strengths also include Elasticsearch, Kafka, Java, YARN, Sqoop, and Flume. He likes spending time performing research and development on different big-data technologies. He is the coauthor of the books Learning YARN and Hive Cookbook, a certified Hadoop developer, and he has also written various technical papers.

    Akash Grade is a data engineer living in New Delhi, India. Akash graduated with a BSc in computer science from the University of Delhi in 2011, and later earned an MSc in software engineering from BITS Pilani. He spends most of his time designing highly scalable data pipeline using big-data solutions such as Apache Spark, Hive, and Kafka. Akash is also a Databricks-certified Spark developer. He has been working on Apache Spark for the last five years, and enjoys writing applications in Python, Go, and SQL.

    About the reviewer

    Nisith Kumar Nanda is a passionate big data consultant who loves to find solutions to complex data problems. He has around 10 years of IT experience working on multiple technologies with various clients globally. His core expertise involves working with open source big data technologies such as Apache Spark, Kafka, Cassandra, HBase, to build critical next generation real-time and batch applications. He is very proficient in various programming languages such as Java, Scala, and Python. He is passionate about AI, machine learning, and NLP.

    I would like to thank my family and especially my wife, Samita, for their support. I will also take this opportunity to thank my friends and colleagues who helped me to grow professionally.
    Packt is searching for authors like you

    If you're interested in becoming an author for Packt, please visit authors.packtpub.com and apply today. We have worked with thousands of developers and tech professionals, just like you, to help them share their insight with the global tech community. You can make a general application, apply for a specific hot topic that we are recruiting an author for, or submit your own idea.

    Preface

    Apache Spark is a flexible in-memory framework that allows the processing of both batch and real-time data in a distributed way. Its unified engine has made it quite popular for big data use cases.

    This book will help you to quickly get started with Apache Spark 2.x and help you write efficient big data applications for a variety of use cases. You will get to grip with the low-level details as well as core concepts of Apache Spark, and the way they can be used to solve big data problems . You will be introduced to RDD and DataFrame APIs, and their corresponding transformations and actions.

    This book will help you learn Spark's components for machine learning, stream processing, and graph analysis. At the end of the book, you'll learn different optimization techniques for writing efficient Spark code.

    Who this book is for

    If you are a big data enthusiast and love processing huge amounts of data, this book is for you. If you are a data engineer and looking for the best optimization techniques for your Spark applications, then you will find this book helpful. This book will also help data scientists who want to implement their machine learning algorithms in Spark. You need to have a basic understanding of programming languages such as Scala, Python, or Java.

    What this book covers

    , Introduction to Apache Spark , provides an introduction to Spark 2.0. It provides a brief description of different Spark components, including Spark Core, Spark SQL, Spark Streaming, machine learning, and graph processing. It also discusses the advantages of Spark compared to other similar frameworks.

    Next page
    Light

    Font size:

    Reset

    Interval:

    Bookmark:

    Make

    Similar books «Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark»

    Look at similar books to Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


    Reviews about «Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark»

    Discussion, reviews of the book Apache Spark Quick Start Guide: Quickly learn the art of writing efficient big data applications with Apache Spark and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.