Handling Large training dataset in machine learning