The dataset for this research was compiled from two separate sources. The data from the first source, Kaggle, will be used to train the model, while data from the second source, NewsAPI, will be used to test it. The training dataset is divided into two parts: true news and fake news. Combining these two files yields a complete dataset. The testing dataset provided via API is in JSON format, and it will be cleaned by deleting extraneous columns after converting it into csv format. The link to the original Kaggle dataset can be found here and here.