This is diagram of the container. We put python library(numpy, pandas, sklearn), our dataset(malware analysis dataset) and python code into one single container. By doing this, we do not need to pay attention on configuration and uploading dataset.
After successful execution of that cell you should be able to see the same result like the following picture. Next step, click choose upload malware analysis dataset (top_1000_pe_imports.csv) into google colab.
This is the link for malware analysis dataset. You need to agree with the terms and conditions, and require a Gmail account.
https://www.kaggle.com/ang3loliveira/malware-analysis-datasets-top1000-pe-imports
It might take about 5-10 mins to upload this file because this dataset is large(284808 sample provided).
Dockerfile(Docker file)
lr1.py(python code for logistic regression)
We put all the things into one container and make it become an image