Meetup #25

Python and Dask: Scaling the DataFrame

Python's most popular data science libraries—pandas, numpy, and scikit-learn—were designed to run on a single computer, and in some cases, using a single processor. Whether this computer is a laptop or a server with 96 cores, your compute and memory are constrained by the size of the biggest computer you have access to. In this course, you'll learn how to use Dask, a Python library for parallel and distributed computing, to bypass this constraint by scaling our compute and memory across multiple cores. Dask provides integrations with Python libraries like pandas, numpy, and scikit-learn so you can scale your computations without having to learn completely new libraries or significantly refactoring your code.


- Understand the options for installing and running Dask on your laptop or a cluster - When to use the different APIs provided by Dask - Accelerate data science and engineering workflows using Dask

In this episode

Daniel  Gerlanc

Daniel Gerlanc

President and Founder, Enplus Advisor's, Inc.

Daniel Gerlanc has worked as a data scientist for more than decade and written software professionally for 15 years. He spent 5 years as a quantitative analyst with two Boston hedge funds before starting Enplus Advisors. At Enplus, he works with clients on data science and custom software development with a particular focus on projects requiring expertise in both areas. He teaches data science and software development at introductory through advanced levels. He has co-authored several open source R packages, published in peer-reviewed journals, and is active in local predictive analytics groups.



Demetrios Brinkmann

Demetrios Brinkmann


Demetrios is one of the main organizers of the MLOps community and currently resides in a small town outside Frankfurt, Germany. He is an avid traveller who taught English as a second language to see the world and learn about new cultures. Demetrios fell into the Machine Learning Operations world, and since, has interviewed the leading names around MLOps, Data Science, and ML. Since diving into the nitty-gritty of Machine Learning Operations he felt a strong calling to explore the ethical issues surrounding ML. When he is not conducting interviews you can find him making stone stacking with his daughter in the woods or playing the ukulele by the campfire.