• Complain

Danish Haroon - Python Machine Learning Case Studies: Five Case Studies for the Data Scientist

Here you can read online Danish Haroon - Python Machine Learning Case Studies: Five Case Studies for the Data Scientist full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2017, publisher: Apress, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Danish Haroon Python Machine Learning Case Studies: Five Case Studies for the Data Scientist
  • Book:
    Python Machine Learning Case Studies: Five Case Studies for the Data Scientist
  • Author:
  • Publisher:
    Apress
  • Genre:
  • Year:
    2017
  • Rating:
    4 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 80
    • 1
    • 2
    • 3
    • 4
    • 5

Python Machine Learning Case Studies: Five Case Studies for the Data Scientist: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Python Machine Learning Case Studies: Five Case Studies for the Data Scientist" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Embrace machine learning approaches and Python to enable automatic rendering of rich insights and solve business problems. The book uses a hands-on case study-based approach to crack real-world applications to which machine learning concepts can be applied. These smarter machines will enable your business processes to achieve efficiencies on minimal time and resources.

Python Machine Learning Case Studies takes you through the steps to improve business processes and determine the pivotal points that frame strategies. Youll see machine learning techniques that you can use to support your products and services. Moreover youll learn the pros and cons of each of the machine learning concepts to help you decide which one best suits your needs.

By taking a step-by-step approach to coding in Python youll be able to understand

the rationale behind model selection and decisions within the machine learning process. The book is equipped with practical examples along with code snippets to ensure that you understand the data science approach to solving real-world problems.

What You Will Learn

  • Gain insights into machine learning concepts

  • Work on real-world applications of machine learning

  • Learn concepts of model selection and optimization
  • Get a hands-on overview of Python from a machine learning point of view

Who This Book Is For

Data scientists, data analysts, artificial intelligence engineers, big data enthusiasts, computer scientists, computer sciences students, and capital market analysts.

Danish Haroon: author's other books


Who wrote Python Machine Learning Case Studies: Five Case Studies for the Data Scientist? Find out the surname, the name of the author of the book and a list of all author's works by series.

Python Machine Learning Case Studies: Five Case Studies for the Data Scientist — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Python Machine Learning Case Studies: Five Case Studies for the Data Scientist" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Danish Haroon 2017
Danish Haroon Python Machine Learning Case Studies
1. Statistics and Probability
Danish Haroon 1
(1)
Karachi, Pakistan
The purpose of this chapter is to instill in you the basic concepts of traditional statistics and probability. Certainly many of you might be wondering what it has to do with machine learning. Well, in order to apply a best fit model to your data, the most important prerequisite is for you to understand the data in the first place. This will enable you to find out distributions within data, measure the goodness of data, and run some basic tests to understand if some form of relationship exists between dependant and independent variables. Lets dive in.
Note
This book incorporates Python 2.7.11 as the de facto standard for coding examples. Moreover, you are required to have it installed it for the Exercises as well.
So why do I prefer Python 2.7.11 over Python 3x? Following are some of the reasons:
  • Third-party library support for Python 2x is relatively better than support for Python 3x. This means that there are a considerable number of libraries in Python 2x that lack support in Python 3x.
  • Some current Linux distributions and macOS provide Python 2x by default. The objective is to let readers, regardless of their OS version, apply the code examples on their systems, and thus this is the choice to go forward with.
  • The above-mentioned facts are the reason why companies prefer to work with Python 2x or why they decide not to migrate their code base from Python 2x to Python 3x.
Case Study: Cycle Sharing SchemeDetermining Brand Persona
Nancy and Eric were assigned with the huge task of determining the brand persona for a new cycle share scheme. They had to present their results at this years annual board meeting in order to lay out a strong marketing plan for reaching out to potential customers.
The cycle sharing scheme provides means for the people of the city to commute using a convenient, cheap, and green transportation alternative. The service has 500 bikes at 50 stations across Seattle. Each of the stations has a dock locking system (where all bikes are parked); kiosks (so customers can get a membership key or pay for a trip); and a helmet rental service. A person can choose between purchasing a membership key or short-term pass. A membership key entitles an annual membership, and the key can be obtained from a kiosk. Advantages for members include quick retrieval of bikes and unlimited 45-minute rentals. Short-term passes offer access to bikes for a 24-hour or 3-day time interval. Riders can avail and return the bikes at any of the 50 stations citywide.
Jason started this service in May 2014 and since then had been focusing on increasing the number of bikes as well as docking stations in order to increase convenience and accessibility for his customers. Despite this expansion, customer retention remained an issue. As Jason recalled, We had planned to put in the investment for a year to lay out the infrastructure necessary for the customers to start using it. We had a strategy to make sure that the retention levels remain high to make this model self-sustainable. However, it worked otherwise (i.e., the customer base didnt catch up with the rate of the infrastructure expansion).
A private service would have had three alternatives to curb this problem: get sponsors on board, increase service charges, or expand the pool of customers. Price hikes were not an option for Jason as this was a publicly sponsored initiative with the goal of providing affordable transportation to all. As for increasing the customer base, they had to decide upon a marketing channel that guarantees broad reach on low cost incurred.
Nancy, a marketer who had worked in the corporate sector for ten years, and Eric, a data analyst, were explicitly hired to find a way to make things work around this problem. The advantage on their side was that they were provided with the dataset of transaction history and thus they didnt had to go through the hassle of conducting marketing research to gather data.
Nancy realized that attracting recurring customers on a minimal budget required understanding the customers in the first place (i.e., persona). As she stated, Understanding the persona of your brand is essential, as it helps you reach a targeted audience which is likely to convert at a higher probability. Moreover, this also helps in reaching out to sponsors who target a similar persona. This two-fold approach can make our bottom line positive.
As Nancy and Eric contemplated the problem at hand, they had questions like the following: Which attribute correlates the best with trip duration and number of trips? Which age generation adapts the most to our service?
Following is the data dictionary of the Trips dataset that was provided to Nancy and Eric:
Table 1-1.
Data Dictionary for the Trips Data from Cycles Share Dataset
Feature name
Description
trip_id
Unique ID assigned to each trip
Starttime
Day and time when the trip started, in PST
Stoptime
Day and time when the trip ended, in PST
Bikeid
ID attached to each bike
Tripduration
Time of trip in seconds
from_station_name
Name of station where the trip originated
to_station_name
Name of station where the trip terminated
from_station_id
ID of station where trip originated
to_station_id
ID of station where trip terminated
Usertype
Value can include either of the following: short-term pass holder or member
Gender
Gender of the rider
Birthyear
Birth year of the rider
Exercises for this chapter required Eric to install the packages shown in Listing . He preferred to import all of them upfront to avoid bottlenecks while implementing the code snippets on your local machine.
However, for Eric to import these packages in his code, he needed to install them in the first place. He did so as follows :
  1. Opened terminal/shell
  2. Navigated to his code directory using terminal/shell
  3. Installed pip:
    python get-pip.py
  4. Installed each package separately, for example:
    pip install pandas
%matplotlib inline
import random
import datetime
import pandas as pd
import matplotlib.pyplot as plt
import statistics
import numpy as np
import scipy
from scipy import stats
import seaborn
Listing 1-1.
Importing Packages Required for This Chapter
Performing Exploratory Data Analysis
Eric recalled to have explained Exploratory Data Analysis in the following words:
What do I mean by exploratory data analysis ( EDA) ? Well, by this I mean to see the data visually. Why do we need to see the data visually? Well, considering that you have 1 million observations in your dataset then it wont be easy for you to understand the data just by looking at it, so it would be better to plot it visually. But dont you think its a waste of time? No not at all, because understanding the data lets us understand the importance of features and their limitations.
Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Python Machine Learning Case Studies: Five Case Studies for the Data Scientist»

Look at similar books to Python Machine Learning Case Studies: Five Case Studies for the Data Scientist. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Python Machine Learning Case Studies: Five Case Studies for the Data Scientist»

Discussion, reviews of the book Python Machine Learning Case Studies: Five Case Studies for the Data Scientist and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.