Copyright 2020 by AI Publishing
All rights reserved.
First Printing, 2020
Edited by AI Publishing
eBook Converted and Cover by Gazler Studio
Published by AI Publishing LLC
ISBN-13: 978-1-7347901-5-3
The contents of this book may not be copied, reproduced, duplicated, or transmitted without the direct written permission of the author. Under no circumstances whatsoever will any legal liability or blame be held against the publisher for any compensation, damages, or monetary loss due to the information contained herein, either directly or indirectly.
Legal Notice:
You are not permitted to amend, use, distribute, sell, quote, or paraphrase any part of the content within this book without the specific consent of the author.
Disclaimer Notice:
Kindly note that the information contained within this document is solely for educational and entertainment purposes. No warranties of any kind are indicated or expressed. Readers accept that the author is not providing any legal, professional, financial, or medical advice. Kindly consult a licensed professional before trying out any techniques explained in this book.
By reading this document, the reader consents that under no circumstances is the author liable for any losses, direct or indirect, that are incurred as a consequence of the use of the information contained within this document, including, but not restricted to, errors, omissions, or inaccuracies.
How to Contact Us
If you have any feedback, please let us know by sending an email to .
Your feedback is immensely valued, and we look forward to hearing from you.
It will be beneficial for us to improve the quality of our books.
To get the Python codes and materials used in this book, please click the link below:
https://www.aispublishing.net/book-pmlds
The order number is required.
About the Publisher
At AI Publishing Company, we have established an international learning platform specifically for young students, beginners, small enterprises, startups, and managers who are new to data science and artificial intelligence.
Through our interactive, coherent, and practical books and courses, we help beginners learn skills that are crucial to developing AI and data science projects.
Our courses and books range from basic introduction courses to language programming and data science to advanced courses for machine learning, deep learning, computer vision, big data, and much more. The programming languages used include Python, R, and some data science and AI software.
AI Publishings core focus is to enable our learners to create and try proactive solutions for digital problems by leveraging the power of AI and data science to the maximum extent.
Moreover, we offer specialized assistance in the form of our online content and eBooks, providing up-to-date and useful insight into AI practices and data science subjects, along with eliminating the doubts and misconceptions about AI and programming.
Our experts have cautiously developed our contents and kept them concise, short, and comprehensive so that you can understand everything clearly and effectively and start practicing the applications right away.
We also offer consultancy and corporate training in AI and data science for enterprises so that their staff can navigate through the workflow efficiently.
With AI Publishing, you can always stay closer to the innovative world of AI and data science.
If you are eager to learn the A to Z of AI and data science but have no clue where to start, AI Publishing is the finest place to go.
Please contact us by email at
AI Publishing is Looking for Authors Like You
Interested in becoming an author for AI Publishing? Please contact us at
We are working with developers and AI tech professionals just like you to help them share their insights with the global AI and Data Science lovers. You can share all your knowledge about hot topics in AI and Data Science.
Table of Contents
Preface
Thank you for your decision on purchasing this book. I can assure you that you will not regret your decision. The saying data is the new oil is no longer a mere cliche. Data is actually powering the industries of today. Organizations and companies need to improve their growth, which depends on correct decision making. Accurate decision making requires facts and figures and statistical analysis of data. Data science does exactly that. With data and machine learning, you can extract and visualize data in detail and create statistical models, which, in turn, help you in decision making. In this book, you will learn all these concepts. So, buckle up for a journey that may give you your career break!
Book Approach
The book follows a very simple approach. It is divided into 10 chapters. The first five chapters of the book are dedicated to data analysis and visualization, while the last five chapters are based on machine learning and statistical models for data science. Chapter 1 provides a very brief introduction to data science and machine learning and provides a roadmap for step by step learning approach to data science and machine learning. The process for environment setup, including the software needed to run scripts in this book, is also explained in this chapter.
Chapter 2 contains a crash course on Python for beginners. If you are already familiar with Python, you can skip Chapter 2. Chapter 3 and chapter 4 explain the use of NumPy and Pandas libraries, respectively, for data analysis. Chapter 5 explains the process of data visualization using Pythons data visualization libraries such as Matplotlib, Seaborn, and Pandas.
Chapters 6 and 7 provide an introduction to supervised machine learning approaches like regression and classification with the help of the Scikit Learn library. Chapter 8 explains unsupervised machine learning, where you study different clustering approaches for machine learning. Chapter 9 details the introduction to deep learning with TensorFlow 2.0 library, where you will study densely connected neural networks, recurrent neural networks, and convolutional neural networks. Finally, dimensionality reduction approaches have been discussed in the 10 th chapter of this book.
In each chapter, an explanation of theoretical concepts is followed by practical examples. Each chapter also contains exercises that students can use to evaluate their understanding of the concepts explained in the chapter. The Python notebook for each chapter is provided in the Source Codes folder in the GitHub repository. It is advised that instead of copying the code, you write the code yourself, and in case of an error, you match your code with the corresponding Python notebook, find and then correct the error. You can download the datasets used in this book either at runtime or in the Datasets folder in the GitHub repository.
Who Is This Book For?
This book explains different data science and machine learning concepts with the help of examples using various Python libraries. The book is aimed ideally at absolute beginners to data science and machine learning. Though a background in the Python programming language and feature engineering can help speed up learning, the book contains a crash course on Python programming language in the first chapter. Therefore, the only prerequisites to efficiently using this book are access to a computer with internet and basic knowledge of linear algebra and calculus. All the codes and datasets have been provided. However, to download data preparation libraries, you will need the internet.