• Complain

Kassambara - Machine Learning Essentials: Practical Guide in R

Here you can read online Kassambara - Machine Learning Essentials: Practical Guide in R full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2018, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

No cover

Machine Learning Essentials: Practical Guide in R: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Machine Learning Essentials: Practical Guide in R" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Kassambara: author's other books


Who wrote Machine Learning Essentials: Practical Guide in R? Find out the surname, the name of the author of the book and a list of all author's works by series.

Machine Learning Essentials: Practical Guide in R — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Machine Learning Essentials: Practical Guide in R" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Machine Learning Essentials

Alboukadel KASSAMBARA

Machine Learning Essentials
Preface
0.1 What you will learn

Large amount of data are recorded every day in different fields, including marketing, bio-medical and security. To discover knowledge from these data, you need machine learning techniques, which are classified into two categories:

  1. Unsupervised machine learning methods :

These include mainly clustering and principal component analysis methods. The goal of clustering is to identify pattern or groups of similar objects within a data set of interest. Principal component methods consist of summarizing and visualizing the most important information contained in a multivariate data set.

These methods are "unsupervised" because we are not guided by a priori ideas of which variables or samples belong in which clusters or groups. The machine algorithm "learns" how to cluster or summarize the data.

  1. Supervised machine learning methods :

Supervised learning consists of building mathematical models for predicting the outcome of future observations. Predictive models can be classified into two main groups:

  • regression analysis for predicting a continuous variable. For example, you might want to predict life expectancy based on socio-economic indicators.

  • Classification for predicting the class (or group) of individuals. For example, you might want to predict the probability of being diabetes-positive based on the glucose concentration in the plasma of patients.

These methods are supervised because we build the model based on known outcome values. That is, the machine learns from known observation outcomes in order to predict the outcome of future cases.

In this book, we present a practical guide to machine learning methods for exploring data sets, as well as, for building predictive models.

You'll learn the basic ideas of each method and reproducible R codes for easily computing a large number of machine learning techniques.

0.2 Key features of this book

Our goal was to write a practical guide to machine learning for every one.

The main parts of the book include:

  • Unsupervised learning methods , to explore and discover knowledge from a large multivariate data set using clustering and principal component methods. You will learn hierarchical clustering, k-means, principal component analysis and correspondence analysis methods.
  • Regression analysis , to predict a quantitative outcome value using linear regression and non-linear regression strategies.
  • Classification techniques , to predict a qualitative outcome value using logistic regression, discriminant analysis, naive bayes classifier and support vector machines.
  • Advanced machine learning methods , to build robust regression and classification models using k-nearest neighbors methods, decision tree models, ensemble methods (bagging, random forest and boosting)
  • Model selection methods , to select automatically the best combination of predictor variables for building an optimal predictive model. These include, best subsets selection methods, stepwise regression and penalized regression (ridge, lasso and elastic net regression models). We also present principal component-based regression methods, which are useful when the data contain multiple correlated predictor variables.
  • Model validation and evaluation techniques for measuring the performance of a predictive model.
  • Model diagnostics for detecting and fixing a potential problems in a predictive model.

The book presents the basic principles of these tasks and provide many examples in R. This book offers solid guidance in data mining for students and researchers.

Key features:

  • Covers machine learning algorithm and implementation
  • Key mathematical concepts are presented
  • Short, self-contained chapters with practical examples. This means that, you don't need to read the different chapters in sequence.

At the end of each chapter, we present R lab sections in which we systematically work through applications of the various methods discussed in that chapter.

0.3 Book website

http://www.sthda.com/english

About the author

Alboukadel Kassambara is a PhD in Bioinformatics and Cancer Biology. He works since many years on genomic data analysis and visualization (read more: http://www.alboukadel.com/ ).

He has work experiences in statistical and computational methods to identify prognostic and predictive biomarker signatures through integrative analysis of large-scale genomic and clinical data sets.

He created a bioinformatics web-tool named GenomicScape (www.genomicscape.com) which is an easy-to-use web tool for gene expression data analysis and visualization.

He developed also a training website on data science, named STHDA (Statistical Tools for High-throughput Data Analysis, www.sthda.com/english), which contains many tutorials on data analysis and visualization using R software and packages.

He is the author of many popular R packages for:

  • multivariate data analysis (factoextra , http://www.sthda.com/english/rpkgs/factoextra ),
  • survival analysis (survminer , http://www.sthda.com/english/rpkgs/survminer/ ),
  • correlation analysis (ggcorrplot , http://www.sthda.com/english/wiki/ggcorrplot-visualization-of-a-correlation-matrix-using-ggplot2 ),
  • creating publication ready plots in R (ggpubr , http://www.sthda.com/english/rpkgs/ggpubr ).

Recently, he published several books on data analysis and visualization:

  1. Practical Guide to Cluster Analysis in R (https://goo.gl/yhhpXh )
  2. Practical Guide To Principal Component Methods in R (https://goo.gl/d4Doz9 )
  3. R Graphics Essentials for Great Data Visualization (https://goo.gl/oT8Ra6 )
  4. Network Analysis and Visulization in R (https://goo.gl/WBdn4n )
Introduction to R

R is a free and powerful statistical software for analyzing and visualizing data.

In this chapter, you'll learn how to install R and required packages, as well as, how to import your data into R.

2.1 Install R and RStudio

RStudio is an integrated development environment for R that makes using R easier. R and RStudio can be installed on Windows, MAC OSX and Linux platforms.

  1. R can be downloaded and installed from the Comprehensive R Archive Network (CRAN) webpage (http://cran.r-project.org/ )
  2. After installing R software, install also the RStudio software available at: http://www.rstudio.com/products/RStudio/ .
  3. Launch RStudio and start use R inside R studio.
2.2 Install and load required R packages

An R package is a collection of functionalities that extends the capabilities of base R. To use the R code provide in this book, you should install the following R packages:

  • tidyverse for easy data manipulation and visualization.
  • caret package for easy machine learning workflow.
  1. Installing packages :
mypkgs <- c ( "tidyverse" , "caret" ) install.packages (mypkgs)
  1. Load required packages . After installation, you must first load the package for using the functions in the package. The function library() is used for this task. For example, type this:
Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Machine Learning Essentials: Practical Guide in R»

Look at similar books to Machine Learning Essentials: Practical Guide in R. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Machine Learning Essentials: Practical Guide in R»

Discussion, reviews of the book Machine Learning Essentials: Practical Guide in R and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.