Daniel - Data science at scale with python and dask
Here you can read online Daniel - Data science at scale with python and dask full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2019, publisher: Manning Publications; Oreilly Media, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:
Romance novel
Science fiction
Adventure
Detective
Science
History
Home and family
Prose
Art
Politics
Computer
Non-fiction
Religion
Business
Children
Humor
Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.
- Book:Data science at scale with python and dask
- Author:
- Publisher:Manning Publications; Oreilly Media
- Genre:
- Year:2019
- Rating:5 / 5
- Favourites:Add to favourites
- Your mark:
Data science at scale with python and dask: summary, description and annotation
We offer to read an annotation, description, summary or preface (depends on what the author of the book "Data science at scale with python and dask" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.
Dask is a native parallel analytics tool designed to integrate seamlessly with the libraries youre already using, including Pandas, NumPy, and Scikit-Learn. With Dask you can crunch and work with huge datasets, using the tools you already have. AndData Science with Python and Daskis your guide to using Dask for your data projects without changing the way you work!
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. Youll find registration instructions inside the print book.
About the Technology
An efficient data pipeline means everything for the success of a data science project. Dask is a flexible library for parallel computing in Python that makes it easy to build intuitive workflows for ingesting and analyzing large, distributed datasets. Dask provides dynamic task scheduling and parallel collections that extend the functionality of NumPy, Pandas, and Scikit-learn, enabling users to scale their code from a single laptop to a cluster of hundreds of machines with ease.
About the Book
Data Science with Python and Daskteaches you to build scalable projects that can handle massive datasets. After meeting the Dask framework, youll analyze data in the NYC Parking Ticket database and use DataFrames to streamline your process. Then, youll create machine learning models using Dask-ML, build interactive visualizations, and build clusters using AWS and Docker.
Whats inside
Working with large, structured and unstructured datasets
Visualization with Seaborn and Datashader
Implementing your own algorithms
Building distributed apps with Dask Distributed
Packaging and deploying Dask apps
About the Reader
For data scientists and developers with experience using Python and the PyData stack.
About the Author
Jesse Danielis an experienced Python developer. He taught Python for Data Science at the University of Denver and leads a team of data scientists at a Denver-based media technology company.
Table of Contents
PART 1 - The Building Blocks of scalable computing
Why scalable computing matters
Introducing Dask
PART 2 - Working with Structured Data using Dask DataFrames
Introducing Dask DataFrames
Loading data into DataFrames
Cleaning and transforming DataFrames
Summarizing and analyzing DataFrames
Visualizing DataFrames with Seaborn
Visualizing location data with Datashader
PART 3 - Extending and deploying Dask
Working with Bags and Arrays
Machine learning with Dask-ML
Scaling and deploying Dask
Daniel: author's other books
Who wrote Data science at scale with python and dask? Find out the surname, the name of the author of the book and a list of all author's works by series.