Dask is a native parallel analytics tool designed to integrate seamlessly with the libraries you're already using,
771 161 19MB
English Pages 296 Year 2019
Table of contents :
PART 1 - The Building Blocks of scalable computing
1. Why scalable computing matters
2. Introducing Dask
PART 2 - Working with Structured Data using Dask DataFrames
3. Introducing Dask DataFrames
4. Loading data into DataFrames
5. Cleaning and transforming DataFrames
6. Summarizing and analyzing DataFrames
7. Visualizing DataFrames with Seaborn
8. Visualizing location data with Datashader
PART 3 - Extending and deploying Dask
9. Working with Bags and Arrays
10. Machine learning with Dask-ML
11. Scaling and deploying Dask