A computer vision pipeline is a series of steps that most computer vision applications will go through. Many vision applications start off by acquiring images and data , then processing that data, performing some analysis and recognition steps, then finally performing an action. The general pipeline is pictured below:
Given sets of input images, computer vision techniques are used to process those images, identify images and characteristics of interest, and then use that data to recognize certain objects and interpret an image. In this section, we will be focusing on learning more about each individual step in this pipeline.
We will start by learning about how a single image is formed and digitally represented. Then we'll get some practicing on pre-processing techniques and selecting areas of interest in an image.
An image classifier is an algorithm that takes in an image as input and outputs a label or “class” that identifies that image. For example, a traffic sign classifier will look at different of roads and be able to identify whether that road contains humans, cars, bikes and so on. Distinguishing and classifying each image based on its contents.
There are many types of classifiers, used to recognize specific objects or even behaviors — like whether a person is walking or running — but they all involve a similar series of steps...