Introduction
Laymen explanation
Technical explanation
The trick in processing large datasets meaningfully is to slice and dice the dataset into meaningful subsets to work around system limits or allow you to divide and conquer with multiple people
Tips
when we slice and dice is, each list is completely disjoint. If we do not do that, we will end up with the same record in multiple lists.
It is important to decide on filtering criteria
There can be filtering criteria which looks disjoint, but can result in the duplicate entry in multiple list. It could be due to same entry fulfilling multiple criteria
Reference
https://www.linkedin.com/pulse/art-slicing-dicing-data-sets-anoop-john/