blob: 947683667b2554e0eaca034f411895c27e64ded2 (
plain) (
blame)
1
2
3
4
5
6
7
8
9
|
Dask is a flexible library for parallel computing in Python.
Dask is composed of two parts:
- Dynamic task scheduling optimized for computation. This is similar to Airflow,
Luigi, Celery, or Make, but optimized for interactive computational workloads.
- "Big Data" collections like parallel arrays, dataframes, and lists that extend
common interfaces like NumPy, Pandas, or Python iterators to
larger-than-memory or distributed environments. These parallel collections run
on top of dynamic task schedulers.
|