Hands-On Recommendation Systems with Python
Start building powerful and personalized, recommendation engines with Python
Rounak Banik
BIRMINGHAM - MUMBAI
Hands-On Recommendation Systems with Python
Copyright 2018 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Commissioning Editor: Pravin Dhandre
Acquisition Editor: Noyonika Das
Content Development Editor: Mohammed Yusuf Imaratwale
Technical Editor: Jinesh Topiwala
Copy Editor: Safis Editing
Project Coordinator: Hardik Bhinde
Proofreader: Safis Editing
Indexer: Priyanka Dhadke
Graphics: Jason Monteiro
Production Coordinator: Nilesh Mohite
First published: July 2018
Production reference: 2310718
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-78899-375-3
www.packtpub.com
To my mentor, Baran Toppare, and my friend, Utkarsh Bajpai, for their encouragement and support.
Rounak Banik
mapt.io
Mapt is an online digital library that gives you full access to over 5,000 books and videos, as well as industry leading tools to help you plan your personal development and advance your career. For more information, please visit our website.
Why subscribe?
Spend less time learning and more time coding with practical eBooks and Videos from over 4,000 industry professionals
Improve your learning with Skill Plans built especially for you
Get a free eBook or video every month
Mapt is fully searchable
Copy and paste, print, and bookmark content
PacktPub.com
Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at service@packtpub.com for more details.
At www.PacktPub.com , you can also read a collection of free technical articles, sign up for a range of free newsletters, and receive exclusive discounts and offers on Packt books and eBooks.
Contributors
About the author
Rounak Banik is a Young India Fellow and an ECE graduate from IIT Roorkee. He has worked as a software engineer at Parceed, a New York start-up, and Springboard, an EdTech start-up based in San Francisco and Bangalore. He has also served as a backend development instructor at Acadview, teaching Python and Django to around 35 college students from Delhi and Dehradun.
He is an alumni of Springboard's data science career track. He has given talks at the SciPy India Conference and published popular tutorials on Kaggle and DataCamp.
I would like to thank Baran Toppare, my mentor for introducing and guiding me through the wonderfully exciting world of data science (and recommendation systems). I am also grateful to Parul Gupta and Indrajit Rajtilak, my former colleagues at Springboard. Without their support, I would not have pursued the Data Science Career Track and therefore, this book would never have been a reality.
Finally, I'd like to thank my family and all my friends in college for being a constant source of motivation, inspiration and support that enabled me to complete this book.
About the reviewer
Dr. S. Gowrishankar is currently working as associate professor in the department of computer science and engineering at Dr. Ambedkar Institute of Technology, Bengaluru, Karnataka, India. His current research interests are mainly focused on data science, including its technical aspects as well as its applications and implications. Specifically, he is interested in the applications of machine learning, data mining, and big data analytics in healthcare.
Image credits
Some of the images in the book are for illustrative purposes only. For more information refer to the links as follows:
- Support Vector Machines Explained, Tristan Fletcher (https://static1.squarespace.com/static/58851af9ebbd1a30e98fb283/t/58902fbae4fcb5398aeb7505/1485844411772/SVM+Explained.pdf
- How Are Principal Component Analysis And Singular Value Decomposition Related?, Andre Perunicic (https://intoli.com/blog/pca-and-svd/
- Random Forest Classifier Machine Learning, Balazs Holczer (http://www.globalsoftwaresupport.com/random-forest-classifier-bagging-machine-learning/)
- Machine Learning for Recommendation, Dr. Rana Forsati (http://www.cse.msu.edu/~forsati/)
- Approaches to analyse and interpret biological profile data, Matthias Scholz (http://phdthesis-bioinformatics-maxplanckinstitute-molecularplantphys.matthias-scholz.de/)
- K Means, Image Courtesy of Michael Jordan (http://stanford.edu/~cpiech/cs221/handouts/kmeans.html)
Packt is searching for authors like you
If you're interested in becoming an author for Packt, please visit authors.packtpub.com and apply today. We have worked with thousands of developers and tech professionals, just like you, to help them share their insight with the global tech community. You can make a general application, apply for a specific hot topic that we are recruiting an author for, or submit your own idea.
Preface
Recommendation systems are at the heart of almost every internet business today, from Facebook to Netflix to Amazon. Providing good recommendations, whether it's friends, movies, or groceries, goes a long way in defining user experience and enticing your customers to use and buy from your platform.
This book shows you how to do just that. You will learn about different kinds of recommenders used in the industry and see how to build them from scratch using Python. No need to wade through tons of linear algebra and machine learning theory, you'll get started with building and learning about recommenders as quickly as possible.
In this book, you will build an IMDB Top 250 clone, a content-based engine that works on movie metadata, collaborative filters that make use of customer behavior data, and a hybrid recommender that incorporates content-based and collaborative filtering techniques.
Next page