• Complain

Pascal Bugnion - Scala: Guide for Data Science Professionals

Here you can read online Pascal Bugnion - Scala: Guide for Data Science Professionals full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2017, publisher: Packt Publishing, genre: Computer. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Pascal Bugnion Scala: Guide for Data Science Professionals

Scala: Guide for Data Science Professionals: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Scala: Guide for Data Science Professionals" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Scala will be a valuable tool to have on hand during your data science journey for everything from data cleaning to cutting-edge machine learning

About This Book
  • Build data science and data engineering solutions with ease
  • An in-depth look at each stage of the data analysis process from reading and collecting data to distributed analytics
  • Explore a broad variety of data processing, machine learning, and genetic algorithms through diagrams, mathematical formulations, and source code
Who This Book Is For

This learning path is perfect for those who are comfortable with Scala programming and now want to enter the field of data science. Some knowledge of statistics is expected.

What You Will Learn
  • Transfer and filter tabular data to extract features for machine learning
  • Read, clean, transform, and write data to both SQL and NoSQL databases
  • Create Scala web applications that couple with JavaScript libraries such as D3 to create compelling interactive visualizations
  • Load data from HDFS and HIVE with ease
  • Run streaming and graph analytics in Spark for exploratory analysis
  • Bundle and scale up Spark jobs by deploying them into a variety of cluster managers
  • Build dynamic workflows for scientific computing
  • Leverage open source libraries to extract patterns from time series
  • Master probabilistic models for sequential data
In Detail

Scala is especially good for analyzing large sets of data as the scale of the task doesnt have any significant impact on performance. Scalas powerful functional libraries can interact with databases and build scalable frameworks resulting in the creation of robust data pipelines.

The first module introduces you to Scala libraries to ingest, store, manipulate, process, and visualize data. Using real world examples, you will learn how to design scalable architecture to process and model data starting from simple concurrency constructs and progressing to actor systems and Apache Spark. After this, you will also learn how to build interactive visualizations with web frameworks.

Once you have become familiar with all the tasks involved in data science, you will explore data analytics with Scala in the second module. Youll see how Scala can be used to make sense of data through easy to follow recipes. You will learn about Bokeh bindings for exploratory data analysis and quintessential machine learning with algorithms with Spark ML library. Youll get a sufficient understanding of Spark streaming, machine learning for streaming data, and Spark graphX.

Armed with a firm understanding of data analysis, you will be ready to explore the most cutting-edge aspect of data science machine learning. The final module teaches you the A to Z of machine learning with Scala. Youll explore Scala for dependency injections and implicits, which are used to write machine learning algorithms. Youll also explore machine learning topics such as clustering, dimentionality reduction, Naive Bayes, Regression models, SVMs, neural networks, and more.

This learning path combines some of the best that Packt has to offer into one complete, curated package. It includes content from the following Packt products:

  • Scala for Data Science, Pascal Bugnion
  • Scala Data Analysis Cookbook, Arun Manivannan
  • Scala for Machine Learning, Patrick R. Nicolas
Style and approach

A complete package with all the information necessary to start building useful data engineering and data science solutions straight away. It contains a diverse set of recipes that cover the full spectrum of interesting data analysis tasks and will help you revolutionize your data analysis skills using Scala.

Pascal Bugnion: author's other books


Who wrote Scala: Guide for Data Science Professionals? Find out the surname, the name of the author of the book and a list of all author's works by series.

Scala: Guide for Data Science Professionals — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Scala: Guide for Data Science Professionals" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Scala: Guide for Data Science Professionals

Table of Contents
Scala: Guide for Data Science Professionals

Scala: Guide for Data Science Professionals

Scala will be a valuable tool to have on hand during your data science journey for everything from data cleaning to cutting-edge machine learning

A course in three modules

BIRMINGHAM - MUMBAI Scala Guide for Data Science Professionals Copyright 2017 - photo 1

BIRMINGHAM - MUMBAI

Scala: Guide for Data Science Professionals

Copyright 2017 Packt Publishing

All rights reserved. No part of this course may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this course to ensure the accuracy of the information presented. However, the information contained in this course is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this course.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this course by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Published on: January 2017

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78728-285-8

www.packtpub.com

Credits

Authors

Pascal Bugnion

Arun Manivannan

Patrick R. Nicolas

Reviewers

Umanga Bista

Radek Ostrowski

Yuanhang Wang

Amir Hajian

Shams Mahmood Imam

Gerald Loeffler

Subhajit Datta

Rui Gonalves

Patricia Hoffman, PhD

Md Zahidul Islam

Content Development Editor

Trusha Shriyan

Graphics

Kirk D'Penha

Production Coordinator

Shantanu N. Zagade

Preface

Scala is a popular language for data science. By emphasizing immutability and functional constructs, Scala lends itself well to the construction of robust libraries for concurrency and big data analysis. A rich ecosystem of tools for data science has therefore developed around Scala, including libraries for accessing SQL and NoSQL databases, frameworks for building distributed applications like Apache Spark and libraries for linear algebra and numerical algorithms. We will explore this rich and growing ecosystem in this learning path.

What this learning path covers

, Scala for Data Science, will introduce you to the libraries for ingesting, storing, manipulating, processing, and visualizing data in Scala. Packed with real-world examples and interesting data sets, this module will teach you to ingest data from flat files and web APIs and store it in a SQL or NoSQL database. It will show you how to design scalable architectures to process and modeling your data, starting from simple concurrency constructs such as parallel collections and futures, through to actor systems and Apache Spark. As well as Scala's emphasis on functional structures and immutability, you will learn how to use the right parallel construct for the job at hand, minimizing development time without compromising scalability. Finally, you will learn how to build beautiful interactive visualizations using web frameworks. This module gives tutorials on some of the most common Scala libraries for data science, allowing you to quickly get up to speed with building data science and data engineering solutions.

, Scala Data Analysis Cookbook, will introduce you to the most popular Scala tools, libraries, and frameworks through practical recipes around loading, manipulating, and preparing your data. It will also help you explore and make sense of your data using stunning and insightful visualizations, and machine learning toolkits.Starting with introductory recipes on utilizing the Breeze and Spark libraries, get to grips with how to import data from a host of possible sources and how to pre-process numerical, string, and date data. Next, you'll get an understanding of concepts that will help you visualize data using the Apache Zeppelin and Bokeh bindings in Scala, enabling exploratory data analysis. Discover how to program quintessential machine learning algorithms using Spark ML library. Work through steps to scale your machine learning models and deploy them into a standalone cluster, EC2, YARN, and Mesos. Finally dip into the powerful options presented by Spark Streaming, and machine learning for streaming data, as well as utilizing Spark GraphX.

, Scala for Machine Learning, will introduce you to the functional capabilities of the Scala programming language that are critical to the creation of machine learning algorithms such as dependency injection and implicits.Your learning journey starts with data pre-processing and filtering techniques, then move on to clustering and dimension reduction, Nave Bayes, regression models, sequential data, regularization and kernelization, support vector machines, Neural networks, generic algorithms and re-enforcement learning. The review of the Akka framework and Apache Spark clusters concludes the tutorial. Techniques throughout the module is applied to the analysis, recommendation, classification, and prediction of financial markets.

This module will guide you through the process of building AI applications with diagrams, formal mathematical notation, source code snippets and useful tips.

What you need for this learning path

The examples provided in this learning path require that you have a working Scala installation and SBT, the Simple Build Tool, a command line utility for compiling and running Scala code. We will walk you through how to install these in the next sections. We do not require a specific IDE. The code examples can be written in your favorite text editor or IDE.

Who this learning path is for

This learning path is perfect for those who are comfortable with Scala programming and now want to enter the field of data science. Some knowledge of statistics is expected.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this coursewhat you liked or disliked. Reader feedback is important for us as it helps us develop titles that you will really get the most out of.

To send us general feedback, simply e-mail <>, and mention the course's title in the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide at www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt course, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for this course from your account at http://www.packtpub.com. If you purchased this course elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you.

You can download the code files by following these steps:

  1. Log in or register to our website using your e-mail address and password.
Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Scala: Guide for Data Science Professionals»

Look at similar books to Scala: Guide for Data Science Professionals. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Scala: Guide for Data Science Professionals»

Discussion, reviews of the book Scala: Guide for Data Science Professionals and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.