dask
Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.
Details
- Path
- skills/dask
- License
- BSD-3-Clause license
- Allowed tools
- 1
- Dependencies
- 3
Allowed tools
Read Write Edit Bash