LitArk » Books » Home and family

Tirthajyoti Sarkar - Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing

Here you can read online Tirthajyoti Sarkar - Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2022, publisher: Apress, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Book:
Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing
Author:
Tirthajyoti Sarkar
Publisher:
Apress
Genre:
Books / Home and family
Year:
2022
Rating:
5 / 5
Favourites:
Add to favourites
Your mark:
- 100
- 1
- 2
- 3
- 4
- 5

Description
Author's other books
Similar books

Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

This book focuses on the Python-based tools and techniques to help you become highly productive at all aspects of typical data science stacks such as statistical analysis, visualization, model selection, and feature engineering.

Youll review the inefficiencies and bottlenecks lurking in the daily business process and solve them with practical solutions. Automation of repetitive data science tasks is a key mindset that is promoted throughout the book. Youll learn how to extend the existing coding practice to handle larger datasets with high efficiency with the help of advanced libraries and packages that already exist in the Python ecosystem.

The book focuses on topics such as how to measure the memory footprint and execution speed of machine learning models, quality test a data science pipelines, and modularizing a data science pipeline for app development. Youll review Python libraries which come in very handy for automating and speeding up the day-to-day tasks.

In the end, youll understand and perform data science and machine learning tasks beyond the traditional methods and utilize the full spectrum of the Python data science ecosystem to increase productivity.

What Youll Learn

Write fast and efficient code for data science and machine learning
Build robust and expressive data science pipelines
Measure memory and CPU profile for machine learning methods
Utilize the full potential of GPU for data science tasks
Handle large and complex data sets efficiently

Who This Book Is For

Data scientists, data analysts, machine learning engineers, Artificial intelligence practitioners, statisticians who want to take full advantage of Python ecosystem.

Tirthajyoti Sarkar: author's other books

Who wrote Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing? Find out the surname, the name of the author of the book and a list of all author's works by series.

Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Contents

Landmarks

Book cover of Productive and Efficient Data Science with Python

Tirthajyoti Sarkar

Productive and Efficient Data Science with Python

With Modularizing, Memory profiles, and Parallel/GPU Processing

Logo of the publisher

Dr. Tirthajyoti Sarkar

Fremont, CA, USA

ISBN 978-1-4842-8120-8 e-ISBN 978-1-4842-8121-5

https://doi.org/10.1007/978-1-4842-8121-5

Dr. Tirthajyoti Sarkar 2022

Apress Standard

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This Apress imprint is published by the registered company APress Media, LLC, part of Springer Nature.

The registered company address is: 1 New York Plaza, New York, NY 10004, U.S.A.

Dedicated to the memory of my loving parents, Jyotirindra Nath Sarkar and Sarmistha Sarkar, who instilled in me the quest for knowledge and taught me the most valuable lessons of life

Introduction

Data science and machine learning can be practiced with various degrees of efficiency and productivity. This book focuses specifically on Python-based tools and techniques to help data scientists, beginners and seasoned professionals alike, become highly productive at all aspects of typical data science tasks.

This book is specifically intended for those who wish to leapfrog beyond the standard way of performing data science and machine learning tasks, and utilize the full spectrum of the Python data science ecosystem for a much higher level of productivity. You will be taught how to look out for inefficiencies and bottlenecks in the standard process and how to think beyond the box. Automation of repetitive data science tasks is a key mindset that you will develop from reading this book. In many cases, you will also learn how to extend existing coding practices to handle larger datasets, with high efficiency, with the help of advanced software tools that already exist in the Python ecosystem but are not taught in any standard data science book.

This is not a regular Python cookbook that teaches standard libraries like NumPy or Pandas. Rather, it focuses on useful techniques such as how to measure the memory footprint and execution speed of ML models, modularize a data science or deep learning task, write object-oriented code for a data science library or web app development, and so on. It also covers Python libraries, which come in handy for automating and speeding up the day-to-day tasks of any data scientist. Furthermore, it touches upon tools and packages that help a data scientist tackle large and complex datasets in a far more optimal way than what would have been possible by following standard Python data science technology wisdom.

If you take away a mentality of probing and measuring inefficiency in your data science code, and you learn tricks to discover effective solutions for those productivity issues, I will consider this book to be successful. This will be an immense reward for me.

Source Code

All source code used in this books examples can be downloaded from https://github.com/Apress/Sarkar_Productive-and-Efficient-Data-Science-with-Python

Any source code or other supplementary material referenced by the author in this book is available to readers on GitHub (https://github.com/Apress/Sarkar_Productive-and-Efficient-Data-Science-with-Python). For more detailed information, please visit www.apress.com/source-code.

Acknowledgments

This book has been a great journey for me, and it will not be complete without acknowledging some of the people who helped me in this quest.

First, I would like to thank my editor, Aditee Mirashi, who guided me patiently on the authoring process and its specifics as this is my first collaboration with Apress. She has been unfailingly helpful and understanding while I navigated through the chapters and technical reviews.

I would like to acknowledge some of the open-source developers and data science communicators whose work I have cited or used in various chapters, with their kind permissions. Khyuen Tran has contributed greatly to the community by publishing efficient data science tricks (with Python) and I have had the pleasure of discussing these ideas with her. Her work is cited in Chapter .

My wife, Chitrita Chakravarti, an accomplished DataOps Solutions Architect herself, has provided support both professionally and personally while I was working on this book. She deserves my sincere gratitude.

Lastly, I am eternally grateful to all my friends and professional connections, especially on LinkedIn, who always had kind and encouraging words for me when I described the painstaking process of working through this project. Their support and words have been a primary source of motivation.

Table of Contents

About the Author

Tirthajyoti Sarkar

lives and works in the San Francisco Bay area California He currently serves - photo 3

lives and works in the San Francisco Bay area, California. He currently serves as the Senior Director, AI/ML Platform at Rhombus Power Inc. where he builds solutions for problems of vital national and global importance using AI, data, and mathematics.

Most recently, he worked as a Data Science Manager at a startup developing an edge-computing platform for the semiconductor manufacturing industry. Before that, he spent more than a decade in the semiconductor and electronics industries where he developed power semiconductor technology and applied artificial intelligence and machine learning techniques for design automation and product innovation. Dr. Sarkar regularly publishes AI and data science articles on top online platforms and teaches machine learning in various workshops and forums. He has published 30+ papers in IEEE and holds multiple US patents. He has authored two data science books. Dr. Sarkar is a Senior Member of IEEE, a former Chair of the Semiconductor Committee of the PSMA (the worlds largest power supply organization consortium), and an Industry Advisory Member for ValleyML, a non-profit AI/ML organization. He holds a Ph.D. in Electrical Engineering from the University of Illinois at Chicago and an MS in Data Analytics from Georgia Tech.

About the Technical Reviewer

Joos Korstanje

is a data scientist with over five years of industry experience in developing machine learning tools, a large part of which are forecasting models. He currently works at Disneyland Paris where he develops machine learning for a variety of tools.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Similar books «Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing»

Look at similar books to Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.

Nathan George

Practical Data Science with Python: Learn tools and techniques from hands-on examples to extract insights from data

Alex Campbell

Python for Data Science : Clear and Complete Guide to Data Science and Analysis with Python

Chopra Rohan

Data Science with Python: Combine Python with machine learning principles to discover hidden patterns in raw data

Madhavan

Mastering Python for Data Science

Jason Test

PYTHON FOR DATA SCIENCE: Guide to computer programming and web coding. Learn machine learning, artificial intelligence, NumPy and Pandas packages for data analysis. Step-by-step exercises included.

Steve Blair

Python Data Science: The Ultimate Handbook for Beginners on How to Explore NumPy for Numerical Data, Pandas for Data Analysis, IPython, Scikit-Learn and Tensorflow for Machine Learning and Business

Jesse C. Daniel

Data Science with Python and Dask

Dr. Ossama Embarak

Data Analysis and Visualization Using Python: Analyze Data to Create Visualizations for BI Systems

Luca Massaron

Python for Data Science For Dummies

Sibanjan Das

Hands-On Automated Machine Learning

Jake VanderPlas

Python Data Science Handbook: Essential Tools for Working with Data

Alberto Boschetti

Python Data Science Essentials - Learn the fundamentals of Data Science with Python

Reviews about «Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing»

Discussion, reviews of the book Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.