Machine Learning for Data Mining
Improve your data mining capabilities with advanced predictive modeling
Jesus Salcedo
BIRMINGHAM - MUMBAI
Machine Learning for Data Mining
Copyright 2019 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Commissioning Editor: Sunith Shetty
Acquisition Editor: Devika Battike
Content Development Editor: Unnati Guha
Technical Editor: Dinesh Chaudhary
Copy Editor: Safis Editing
Project Coordinator: Manthan Patel
Proofreader: Safis Editing
Indexer: Pratik Shirodkar
Graphics: Jisha Chirayil
Production Coordinator: Arvindkumar Gupta
First published: April 2019
Production reference: 1300419
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-83882-897-4
www.packtpub.com
Contributors
About the author
Jesus Salcedo has a PhD in psychometrics from Fordham University. He is an independent statistical consultant and has been using SPSS products for over 20 years. He is a former SPSS Curriculum Team Lead and Senior Education Specialist who has written numerous SPSS training courses and trained thousands of users.
Packt is searching for authors like you
If you're interested in becoming an author for Packt, please visit authors.packtpub.com and apply today. We have worked with thousands of developers and tech professionals, just like you, to help them share their insight with the global tech community. You can make a general application, apply for a specific hot topic that we are recruiting an author for, or submit your own idea.
mapt.io
Mapt is an online digital library that gives you full access to over 5,000 books and videos, as well as industry leading tools to help you plan your personal development and advance your career. For more information, please visit our website.
Why subscribe?
Spend less time learning and more time coding with practical eBooks and Videos from over 4,000 industry professionals
Improve your learning with Skill Plans built especially for you
Get a free eBook or video every month
Mapt is fully searchable
Copy and paste, print, and bookmark content
Packt.com
Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.packt.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at customercare@packtpub.com for more details.
At www.packt.com , you can also read a collection of free technical articles, sign up for a range of free newsletters, and receive exclusive discounts and offers on Packt books and eBooks.
Preface
30% of data mining vacancies also involve machine learning. And those that do are 30% better paid than the rest. If youre involved in data mining, you need to get on top of machine learning, before it gets on top of you.
Hands-On Machine Learning for Data Mining gives you everything you need to bring the power of machine learning into your data mining work. This book will enable you to pair the best algorithms with the right tools and processes. You will see how systems can learn from data, identify patterns, and make predictions on data, all with minimal human intervention.
Who this book is for
If you are a data mining professional who wishes to get a ticket to a 30% higher salary by adding machine learning to your skill set, then this is the ideal course for you. No prior knowledge in machine learning is assumed.
What this book covers
, Introducing Machine Learning Predictive Models , introduces you to the theory behind predictive models, looking at how they work and providing an insight into types of predictive modeling, such as the neural network model, which is explained in brief in this chapter.
, Getting Started with Machine Learning , introduces you to the implementation of a neural network model, and gives an insight into the implementation of Support Vector Machines (SVMs) as well.
, Understanding Models , explains different types of models and the situations in which each of them should ideally be used.
, Improving Individual Models , shows you different ways in which we can improve our models. This chapter will show you four methods to improve the accuracy of your model.
, Advanced Ways of Improving Models , focuses on combining different models in different ways to get increasingly better results. In this chapter, we will see how a certain part of a dataset, which doesn't contribute much to the results of a neural network model, performs very well on the CHAID and C5.0 decision tree models. We will also see how to model the errors to prepare our models.
To get the most out of this book
- Some knowledge on what data mining is, and the basic concepts of machine learning, will act as starting points for this book.
- Familiarity with any machine learning modeler, specifically the SPSS Modeler provided by IBM, will be a plus, but isn't necessary.
Download the example code files
You can download the example code files for this book from your account at www.packt.com. If you purchased this book elsewhere, you can visit www.packt.com/support and register to have the files emailed directly to you.
You can download the code files by following these steps:
- Log in or register at www.packt.com.
- Select the SUPPORT tab.
- Click on Code Downloads & Errata .
- Enter the name of the book in the Search box and follow the onscreen instructions.
Once the file is downloaded, please make sure that you unzip or extract the folder using the latest version of:
- WinRAR/7-Zip for Windows
- Zipeg/iZip/UnRarX for Mac
- 7-Zip/PeaZip for Linux
Next page