summaryrefslogtreecommitdiff
path: root/devel/py-dask/pkg-descr
blob: 947683667b2554e0eaca034f411895c27e64ded2 (plain) (blame)
1
2
3
4
5
6
7
8
9
Dask is a flexible library for parallel computing in Python.

Dask is composed of two parts:
- Dynamic task scheduling optimized for computation. This is similar to Airflow,
  Luigi, Celery, or Make, but optimized for interactive computational workloads.
- "Big Data" collections like parallel arrays, dataframes, and lists that extend
  common interfaces like NumPy, Pandas, or Python iterators to
  larger-than-memory or distributed environments. These parallel collections run
  on top of dynamic task schedulers.