Parallelising Python on Spark: Options for concurrency with Pandas | Towards Data Science

Parallelising Python on Spark: Options for Concurrency with Pandas In my previous role, I spent some time working on an internal project to predict future disk storage space usage for our Managed S...

By · · 1 min read
Parallelising Python on Spark: Options for concurrency with Pandas | Towards Data Science

Source: Towards Data Science

Parallelising Python on Spark: Options for Concurrency with Pandas In my previous role, I spent some time working on an internal project to predict future disk storage space usage for our Managed Services customers across thousands of disks. Each disk is subject to its own usage patterns and this means we need a separate machine […]