[Lesson 4] Going Further with CNNs (1)

- In the previous lesson, we built and trained a CNN to classify small grayscale images of articles of clothing from the fashon MNIST dataset.

- One of the great advantage of CNN's is that they can also work with color images.

- Later in the lesson, I'll build and train a convolutional neural network, that can classify color images of cats and dogs.

- Along the way, I'll also learn different techniques that can be used to manage a common problem with neural networks called overfitting.

- In real applications, we usually have to deal with high resolution color image of different sizes.

- In order to decrease the training time, we will only use a small subset of images to train our CNNs.

- When working with this dataset, we will face two main challenges.

The first chllenge will be working with images of different sizes.
The second chllenge will be working with color images.

- Since all of the images from the Fashion MNIST dataset have the same size, they all get flattened to 1D arrarys of the same size.

- Since neural networks need a fixed size input, just flattening the images won't work. Because flattening the images will give rise to one-dimensional arrays of different sizes.

- When doing image classification, we always solve this problem by resizing all the images to the same size.

- By resizing all images to the same size, this will guarantee that when we flatten the images, they'll result in 1D arrays of the same size.

- Computers interpret color images as three-dimensional arrays. The width and height will be determined by the height and width of the image, and the depth will by determined by the number of color channels.

- Most color images can be represented by three color channels namely red, green, blue. In RGB images, each color channel is represented by its own two-dimensional array.

- Now, Since our input image is going to be three-dimensional, we have to modify our code accordingly.

ex) input_shape=(150,150,3) <- the third number in the input shape parameter refers to the number of color channels. first two numbers refer to height and width of the input image.

저작자표시 (새창열림)

'인공지능(AI) > Udacity tensorflow 강의' 카테고리의 다른 글

[Lesson 4] Going Further with CNNs (3) (0)	2020.09.27
[Lesson 4] Going Further with CNNs (2) (0)	2020.09.27
[Lesson 3] Introduction to CNNs (2) (0)	2020.09.26
[Lesson 3] Introduction to CNNs (1) (0)	2020.09.26
[Lesson 2] Your First Model: Fashion MNIST (2) (0)	2020.09.25

Supremo

[Lesson 4] Going Further with CNNs (1)

'인공지능(AI) > Udacity tensorflow 강의' 카테고리의 다른 글

티스토리툴바

[Lesson 4] Going Further with CNNs (1)

'인공지능(AI) > Udacity tensorflow 강의' 카테고리의 다른 글

'인공지능(AI)/Udacity tensorflow 강의' Related Articles

티스토리툴바