• Complain

Pramod Singh - Machine Learning with PySpark: With Natural Language Processing and Recommender Systems

Here you can read online Pramod Singh - Machine Learning with PySpark: With Natural Language Processing and Recommender Systems full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2021, publisher: Apress, genre: Computer. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Pramod Singh Machine Learning with PySpark: With Natural Language Processing and Recommender Systems
  • Book:
    Machine Learning with PySpark: With Natural Language Processing and Recommender Systems
  • Author:
  • Publisher:
    Apress
  • Genre:
  • Year:
    2021
  • Rating:
    4 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 80
    • 1
    • 2
    • 3
    • 4
    • 5

Machine Learning with PySpark: With Natural Language Processing and Recommender Systems: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Machine Learning with PySpark: With Natural Language Processing and Recommender Systems" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable machine learning models, to natural language processing, to recommender systems.

Machine Learning with PySpark, Second Edition begins with the fundamentals of Apache Spark, including the latest updates to the framework. Next, you will learn the full spectrum of traditional machine learning algorithm implementations, along with natural language processing and recommender systems. Youll gain familiarity with the critical process of selecting machine learning algorithms, data ingestion, and data processing to solve business problems. Youll see a demonstration of how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forests. Youll also learn how to automate the steps using Spark pipelines, followed by unsupervised models such as K-means and hierarchical clustering. A section on Natural Language Processing (NLP) covers text processing, text mining, and embeddings for classification. This new edition also introduces Koalas in Spark and how to automate data workflow using Airflow and PySparks latest ML library.

After completing this book, you will understand how to use PySparks machine learning library to build and train various machine learning models, along with related components such as data ingestion, processing and visualization to develop data-driven intelligent applications

What you will learn:

  • Build a spectrum of supervised and unsupervised machine learning algorithms
  • Use PySparks machine learning library to implement machine learning and recommender systems
  • Leverage the new features in PySparks machine learning library
  • Understand data processing using Koalas in Spark
  • Handle issues around feature engineering, class balance, bias and variance, and cross validation to build optimally fit models

Who This Book Is For

Data science and machine learning professionals.

Pramod Singh: author's other books


Who wrote Machine Learning with PySpark: With Natural Language Processing and Recommender Systems? Find out the surname, the name of the author of the book and a list of all author's works by series.

Machine Learning with PySpark: With Natural Language Processing and Recommender Systems — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Machine Learning with PySpark: With Natural Language Processing and Recommender Systems" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Contents
Landmarks
Book cover of Machine Learning with PySpark Pramod Singh Machine Learning - photo 1
Book cover of Machine Learning with PySpark
Pramod Singh
Machine Learning with PySpark
With Natural Language Processing and Recommender Systems
2nd ed.
Logo of the publisher Pramod Singh Bangalore Karnataka India ISBN - photo 2
Logo of the publisher
Pramod Singh
Bangalore, Karnataka, India
ISBN 978-1-4842-7776-8 e-ISBN 978-1-4842-7777-5
https://doi.org/10.1007/978-1-4842-7777-5
Pramod Singh 2022
Apress Standard
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This Apress imprint is published by the registered company APress Media, LLC part of Springer Nature.

The registered company address is: 1 New York Plaza, New York, NY 10004, U.S.A.

I dedicate this book to my wife Neha, my son Ziaan, and my parents. Without you guys, this book wouldnt have been possible. You complete my world and are the source of my strength.

Foreword

Businesses are swimming in data, yet many organizations are struggling to remain afloat in a rapidly expanding sea of data. Ever-increasing connectivity of data-transmitting devices is driving growth in how much and how fast data is being generated. This explosion in digitalization has been accompanied by a proliferation of applications and vendors. Companies often use multiple vendors for the same use cases and store data across multiple systems. Digital data lives in more and more formats and across fragmented data architecture layers.

In a world where data lives everywhere, many organizations are on a race to be more data-driven in their decision-making to get and stay ahead of the competition. The winners in this race proactively manage their data needs and leverage their data and analytics capabilities to drive tangible business outcomes.

The starting point for using data and analytics as a strategic tool is having good data. Take a B2B company that was looking to improve how it forecasted next months sales, for example. Each month this company would aggregate individual sales reps hot leads into a sales forecast that might be wide of the mark. Improving the underlying data quality required four changes to how the company captured customer insights from the front line. First, the company clearly defined at which stage in the sales funnel a lead should be tagged to the sales forecast. Second, it reduced the granularity of information that the sales force had to enter along with each sales lead. Third, it gave more time to the sales reps to complete the data entry. Fourth, it secured commitment from the sales force to improve the data quality. Along with ensuring a common understanding of the changes, the company was able to improve how it translated frontline customer insights into better data.

With better data in their arsenals, businesses need to orchestrate the data in a complex web of systems, processes, and tools. Despite having a state-of-the art CRM (customer relationship management) system, numerous spreadsheets were metronomically circulated across the organization to syndicate the companys sales forecast. The company improved how it orchestrated the sales forecast data in four ways. First, it consolidated disparate planning and tracking tools into a consistent view in the CRM system. Second, it secured commitment from the organization to use the CRM forecast as one source of truth. Third, it configured the access for all relevant stakeholders to view the sales forecast. Fourth, it compressed the organizational layers that could make adjustments as fewer layers brought fewer agendas in managing the sales forecast.

The effective use of data and analytics can help businesses make better decisions to drive successful outcomes. For instance, the company mentioned previously developed a system of sanity checks to assure the sales forecast. It used machine learning to classify individual sales reps into groups of optimistic and conservative forecasters. Machine learning helped predict expected conversion rates at an individual level and at an aggregate basis that the company could use to sense-check deviations from the forecast envelope, either to confirm adjustments to the sales forecast or trigger a review of specific forecast data. Better visibility on the sales forecast enabled the supply chain function to proactively move the companys products to the branches, where sales was forecasted to increase, and place replenishment orders with its suppliers. As a result, the company incurred fewer lost sales episodes due to stock-outs and achieved a more optimal balance between product availability and working capital requirements.

Building data and analytics capabilities is a journey that comes in different waves. Going back to the example, the company first concentrated on a few use cases such as improved sales forecasting that could result in better visibility, more valuable insights, and automation of work. It defined and aligned on changes to its operating model and data governance. It set up a center of excellence to help accelerate use cases and capability building. It built a road map for further use case deployment and capability development upskilling current employees, acquiring new talent, and exploring data and analytics partnerships to get the full value of its proprietary data assets.

Great data and analytics capabilities can eventually lead to data monetization. Companies should start with use cases that address a raw customer need. The most successful use cases typically center on improving customer experience (as well as supplier and frontline worker experience). Making the plunge to build data and analytics capabilities is hard; yet successful companies are able to muster the courage to ride the waves. They start with small wins and then scale up and amplify use cases to capture value and drive tangible business outcomes through data and analytics.

Introduction I am going to be very honest with you When I signed the contract - photo 3
Introduction

I am going to be very honest with you. When I signed the contract to write this second edition, I thought it would be a bit easier to write, but I couldn't have been more wrong about this assumption. It has taken me quite a significant amount of time to complete the chapters. What I have come to realize is that it's never easy to break down a thought process and put it on paper in the most convincing manner. There are so many retrials in that process, but what helped was the foundation block or the blueprint that was already established in the first edition of this book. The main challenge was to figure out how I could make this book more relevant and useful for the readers. I mean there are literally thousands of books on this subject already that this might just end up as another book on the shelf.

Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Machine Learning with PySpark: With Natural Language Processing and Recommender Systems»

Look at similar books to Machine Learning with PySpark: With Natural Language Processing and Recommender Systems. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Machine Learning with PySpark: With Natural Language Processing and Recommender Systems»

Discussion, reviews of the book Machine Learning with PySpark: With Natural Language Processing and Recommender Systems and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.