Software Engineering Daily

Software Engineering Daily


Modin: Pandas Scalability with Devin Petersohn

July 23, 2020

Pandas is a Python data analysis library, and an essential tool in data science. Pandas allows users to load large quantities of data into a data structure called a dataframe, over which the user can call mathematical operations. When the data fits entirely into memory this works well, but sometimes there is too much data