Back to Blog

Computer vision series: Introduction to CNN for Visual Recognition

Computer vision series: Introduction to CNN for Visual Recognition

History

Benchmark Datasets

  • Pascal VOC
  • Imagenet : 22k categories, 14M images

Imagenet Large scale visual recognition challenge 1000 Object Classes, Large number of images 1.4M images in an experiment error rate has decreased a lot. That's a lot of progress made

In 2012, error rate decreased significantly due to CNN (Convolutional Neural Network) a deep learning

Most important problem of Visual recognition - image classification

In image classification - your algorithim looks at an image, and then picks from among some fixed set of categories to classify the image. This relatively basic tool of image classification is super useful and can be applied all over the place for many different applications.

There is a number of visual recognition problems that are related to image classification, such as object detection, image captioning.

The setup in object detection is a little bit different. Rather than classifying an entire image as cat or dog or whatsnot. Instead we want to go in and draw bounding boxes and say there is a dog here, and a cat here and a car over in the background and draw these boxes describing where objects are in the image.

Image Captioning: ...

CNNs aka ConvNets

CNN have become an important tool for object recognition. learn more about CNN (Deep dive)

Layers of network. AlexNet - 7 to 8 layer 2014 GoogleNet VGG - 19 layers

The more layer, the better. i.e 200 layers but you will run out of GPU (discuss later)

**Main takeway: ** CNN really had this breakthrough moment in 2012, since then there's been a lot of effort focued in tuning and tweaking this algorithims to make them perform better and better on this problem of image classification. We will dive deep exactly how these different models work.

CNNs were not invented overnight. Why it's popular in recent Years? Faster and faster computers, GPUs (Thanks to moore's law). It allowed researcher to explore larger architecture and models. It's super important in the history of CNN.

Other keywords: Style Transfer