Scalable Machine Learning with Dask

Scikit-Learn, NumPy, and pandas form a great toolkit for single-machine, in-memory analytics, but scaling them to larger datasets can be difficult.

In this webinar, watch Anaconda Data Scientist Tom Augspurger demonstrate how dask enables analysis of large datasets in parallel, using all the cores of your laptop or all the machines in your cluster.

Tom will highlight dask-ml, a library for scalable machine learning, and show you how dask-ml can train estimators on large datasets.

Meet Our Speaker

  • Tom Augspurger, Data Scientist

    Tom is a Data Scientist and developer at Anaconda and works on open source projects including dask and pandas. Tom’s current focus is on scaling out Python’s machine learning ecosystem to larger datasets and larger models.
*Required. Your information will be processed according to Anaconda’s Privacy Policy.