machine learning and statistics for big data