Jupyter for Data Science
Exploratory analysis, statistical modeling, machine learning, and data visualization with Jupyter
Dan Toomey
BIRMINGHAM - MUMBAI
Jupyter for Data Science
Copyright 2017 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
First published: October 2017
Production reference: 1171017
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-78588-007-0
www.packtpub.com
Credits
Author Dan Toomey | Copy Editor Safis Editing |
Reviewers Jesse Bacon Ruben Oliva Ramos | Project Coordinator Nidhi Joshi |
Commissioning Editor Amey Varangaonkar | Proofreader Safis Editing |
Acquisition Editor Amey Varangaonkar | Indexer Tejal Daruwale Soni |
Content Development Editor Aishwarya Pandere | Graphics Tania Dutta |
Technical Editor Prasad Ramesh | Production Coordinator Aparna Bhagat |
About the Author
Dan Toomey has been developing applications for over 20 years. He has worked in a variety of industries and companies of all sizes, in roles from sole contributor to VP/CTO level. For the last 10 years or so, he has been contracting companies in the eastern Massachusetts area under Dan Toomey Software Corp. Dan has also written R for Data Science and Learning Jupyter with Packt Publishing.
About the Reviewers
Jesse Bacon is a hobbyist programmer and technologist in the Washington D.C. metro area. In his free time, he mostly studies a new and interesting technology or visits the gym pursuing fitness goals. Jesse values the opinions of the development community and looks forward to a new generation of programmers with all the gifts of today's computing environments.
Ruben Oliva Ramos is a computer systems engineer from Tecnologico of Len Institute, with a master's degree in computer and electronic systems engineering, teleinformatics, and networking specialization from the University of Salle Bajio in Leon, Guanajuato Mexico. He has more than 5 years of experience in developing web applications to control and monitor devices connected with Arduino and Raspberry Pi using web frameworks and cloud services to build Internet of Things applications.
He is a mechatronics teacher at the University of Salle Bajio and teaches students of master's in design and engineering of mechatronics systems. He also works at Centro de Bachillerato Tecnologico Industrial 225 in Leon, teaching electronics, robotics and control, automation, and microcontrollers at Mechatronics Technician Career.
Ruben is a consultant and developer projects in areas such as monitoring systems and datalogger data using technologies such as Android, iOS, Windows Phone, HTML5, PHP, CSS, Ajax, JavaScript, Angular, ASP .NET databases (SQlite, mongoDB, and MySQL), web servers (Node.js and IIS), hardware programming (Arduino, Raspberry pi, Ethernet Shield, GPS and GSM/GPRS, ESP8266), and control and monitor systems for data acquisition and programming.
He is the author of these two books for Packt Publishing:
- Internet of Things Programming with JavaScript
- Advanced Analytics with R and Tableau
Also Monitoring, controlling and acquisition of data with Arduino and Visual Basic .NET for Alfaomega.
I would like to thank my savior and lord, Jesus Christ, for giving me the strength and courage to pursue this project; to my dearest wife, Mayte; our two lovely sons, Ruben and Dario; my dear father, Ruben; my dearest mom Rosalia; my brother, Juan Tomas; and my sister, Rosalia, whom I love. This is for all their support while reviewing this book, for allowing me to pursue my dream and tolerating not being with them after my busy day. I'm very grateful to Packt Publishing for giving the opportunity to collaborate as an author and reviewer and join to this honest and professional team.
www.PacktPub.com
For support files and downloads related to your book, please visit www.PacktPub.com . Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at service@packtpub.com for more details. At www.PacktPub.com , you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.
https://www.packtpub.com/mapt
Get the most in-demand software skills with Mapt. Mapt gives you full access to all Packt books and video courses, as well as industry-leading tools to help you plan your personal development and advance your career.
Why subscribe?
- Fully searchable across every book published by Packt
- Copy and paste, print, and bookmark content
- On demand and accessible via a web browser
Customer Feedback
Thanks for purchasing this Packt book. At Packt, quality is at the heart of our editorial process. To help us improve, please leave us an honest review on this book's Amazon page at https://www.amazon.com/dp/1785880071.
If you'd like to join our team of regular reviewers, you can email us at customerreviews@packtpub.com. We award our regular reviewers with free eBooks and videos in exchange for their valuable feedback. Help us be relentless in improving our products!
Table of Contents
Preface
Jupyter is an open platform that is growing in use. Jupyter can have programs written in a variety of languages. Many of these languages are geared towards data science, such as R and Python. In this book, we'll look at solving data science problems using a variety of languages on the Jupyter platform.
We will start by looking into some of the basics of Jupyter. Then we will use Jupyter as the platform for our data analysis and visualizations. We'll look into data mining, data wrangling, and machine learning, all under the auspices of the Jupyter framework.