Now that you have worked through creating your own image processing algorithm, you are ready to begin working with machine learning image processing algorithms. This activity is walks you through two common machine learning models used for image processing so that you can familiarize yourself with the components of these models you have already experienced and how these components click together.
See the required components for two common ML image processing models: logistic regression model and ResNet50 transfer learning model.
Run two common ML image processing models on the same dataset (cat and dog classification) and understand the difference in performance between them.
Understand each of the components present within the logistic regression model and ResNet50 transfer learning model.
Access to a computer and large screen (if you want share with others)
A Google account to access Google Colab
Logistic Regression Model - A statistical model used in predicting the probability of a certain outcome based on a determined relationship between one or more variables, or predictors. In other words, the logistic regression model provides a categorical outcome based on a combination of input variables.
Transfer Learning Model - A machine learning model that uses a model which has been previously trained for a separate task to be used on a new, related task. In contrast to creating a completely new model, a transfer learning model has imported weights from its previous training and adapts these weights to better fit the new task at hand. Using a transfer learning model requires less training data and computational power.
ResNet50 Model - A well-known deep learning model composed of 50 layers that utilizes residual learning techniques to train its deep network efficiently. ResNet was created in 2015 by the Microsoft Research team. The ResNet50 model has been trained on the ImageNet image dataset for classification of more than 1,000 different objects.
ImageNet - An image dataset initially created by a group of researchers at Stanford and Princeton which has grown to contain over 14 million images representing about 80,000 different meaningful concepts, referred to as “synonym sets”.
Understanding what kind of machine learning model to use for what you are trying to accomplish is important to understand. Each type of machine learning model handles information differently and are specially designed to work well with certain tasks more than others, for example a CNN model works well with image classification as opposed to an LLM which primarily deals with language data and sentence structure suggestions and corrections.
In this section we will be looking at a transfer learning model, which uses another models training parameters integrated with its own specifications to vastly enhance the time and resources needed to train a similarly structured machine learning model, we will also look at a Logistic Regression Model which uses advanced math and statistical analysis to accurately predict something given two possible outcomes, in our case an object being a cat or a dog.
Understanding why these different models excel in what they are designed to do will help you understand what kind of machine learning model would best suite what information you have and what project are trying to accomplish. This will also allow you to understand what certain models are capable of and create a more achievable goal and better planned out project.
Take some time to read through the keywords and definitions above.
Read and work through the Logistic Regression Model - Colab Notebook, the ResNet50 Classification Model - Colab Notebook, and the corresponding Machine Learning Models Walkthrough - Exploration Guide.