In this post we will see how to use transfer learning (where we use the information/patterns that a model has learned in one task to solve another similar task) .

We will use a pretrained ResNet model trained on ImageNet dataset to learn and classify images in the CIFAR-100 dataset.

We will use a ResNet34 pretrained model from https://github.com/qubvel/classification_models

We will use Resnet34 model to try and achieve 80% validation accuracy . Since pretrained weights are only available for imagenet and models expect a 224x224 image size , we will resize the cifar100 images to 224x224 while training .

In the pretrained model we will remove the top prediction layers and freeze the last 11 layers . We will add a GlobalAveragepooling2D layer , a dense layer and a softmax activation to form our prediction layer for cifar100. The first part will be to train with the frozen layers in base model . After training for about 30 epochs , we will unfreeze the layers and train further .

Install the required files from qubvel keras applications project in order to get the pretrained ResNet model

!pip install git+https://github.com/qubvel/classification_models.git

Import necessary keras modules , numpy and matplotlib

from keras import backend as K
import time
import matplotlib.pyplot as plt
import numpy as np
% matplotlib inline
np.random.seed(2017) 
#from keras.models import Sequential
from keras.layers import Conv2D, MaxPooling2D, GlobalAveragePooling2D
from keras.layers import Activation, Flatten, Dropout
from keras.layers import BatchNormalization
from keras.utils import np_utils

import os

import ResNet34 and image preprocessing from the project we installed earlier

import keras
import cv2
#from classification_models.resnet import ResNet34, preprocess_input
from classification_models.keras import Classifiers
ResNet34, preprocess_input = Classifiers.get('resnet34')

get cifar100 dataset from keras datasets

from keras.datasets import cifar100
(train_features, train_labels), (test_features, test_labels) = cifar100.load_data()
num_train, img_channels, img_rows, img_cols =  train_features.shape
num_test, _, _, _ =  test_features.shape
num_classes = len(np.unique(train_labels))

Downloading data from https://www.cs.toronto.edu/~kriz/cifar-100-python.tar.gz
169009152/169001437 [==============================] - 4s 0us/step

preprocess the images to make sure that they are in the format required by the pretrained model

train_features = preprocess_input(train_features)

test_features = preprocess_input(test_features)

print max and min pixel values in the images which we can use in the ramdom-erase/cutout augmentation later

print(np.max(train_features),np.min(train_features))

255 0

Store cifar100 train and test images in a local data folder. We will load these images using an imagedatagenerator and resize to 224x224 which is default size for Resnet-imagenet models

!rm -R ./data/  # remove old data direrctory to clean up

sub_dir='train'
data_dir='./data'
if not os.path.exists(data_dir):
  os.mkdir(data_dir)
image_dir='./data/'+sub_dir+'/'
if not os.path.exists(image_dir):
  os.mkdir(image_dir)

def save_img(images,sub_dir):
  c=0
  os.chdir('/content/')
  curr_dir = os.getcwd()
  image_dir='./data/'+sub_dir+'/'
  if not os.path.exists(image_dir):

    os.mkdir(image_dir)
  os.chdir(image_dir)
  print('current working directory is '+os.getcwd())
  for img in images:
    c +=1
    filename=str(c)+'.jpg'
    
    
    cv2.imwrite(filename,img)
  print("files resized and saved to "+image_dir)
  os.chdir(curr_dir)
  print('current working directory is '+os.getcwd())

save_img(train_features,'train')

current working directory is /content/data/train
files resized and saved to ./data/train/
current working directory is /content

save_img(test_features,'test')

current working directory is /content/data/test
files resized and saved to ./data/test/
current working directory is /content

!ls ./data

test  train

Mount google drive to save best model while training

from google.colab import drive 
drive.mount('/gdrive',force_remount=True)

Import pandas and create a dataframe with image files and labels information. We will use this dataframe with Keras imagedatagenerator to load images for training and testing and calculate loss using the corresponding label values

import pandas as pd

def form_df(label_type='train'):
  if label_type=='train':
    labels=train_labels
  else:
    labels=test_labels  

  file_name=[]
  class_label=[]
  for i in range(len(labels)):
    filename=str(i+1)+'.jpg'
    file_name.append(filename)
    class_label.append(str(labels[i][0]))

  df=pd.DataFrame({'File':file_name,'Class':class_label})  
  return df

train_df=form_df('train')
print(train_df.head())
train_df.info()

    File Class
0  1.jpg    19
1  2.jpg    29
2  3.jpg     0
3  4.jpg    11
4  5.jpg     1
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 50000 entries, 0 to 49999
Data columns (total 2 columns):
File     50000 non-null object
Class    50000 non-null object
dtypes: object(2)
memory usage: 781.4+ KB

train_df.tail()

test_df=form_df('test')
print(test_df.head())
print(test_df.tail())
print(test_df.info())

    File Class
0  1.jpg    49
1  2.jpg    33
2  3.jpg    72
3  4.jpg    51
4  5.jpg    71
           File Class
9995   9996.jpg    83
9996   9997.jpg    14
9997   9998.jpg    51
9998   9999.jpg    42
9999  10000.jpg    70
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 10000 entries, 0 to 9999
Data columns (total 2 columns):
File     10000 non-null object
Class    10000 non-null object
dtypes: object(2)
memory usage: 156.4+ KB
None

Custom function for random-pad-crop augmentation

def pad4(img):
  pad_size=img.shape[1]//8
  img=np.pad(img, [ (pad_size, pad_size), (pad_size, pad_size), (0, 0)], mode='reflect')  
  return img 


def random_pad_crop_img(img,crop_size=224):
  crop_size=img.shape[1]
  img=pad4(img)
  pad=img.shape[1]-crop_size
  x1=np.random.randint(pad)
  x2=x1+crop_size
  y1=np.random.randint(pad)
  y2=y1+crop_size
  img=img[x1:x2,y1:y2,:]
  return img

We will now get the ResNet34 model weights for imagenet (Cifar is not available in this library).

input shape set to 224,224,3

Add GlobalAveragePooling to convert these to 1D inputs suitable for the softmax prediction layer

Add a Dense Layer instead of the one we removed from the pretrained model

Add softmax prediction

for the first train run we will freeze the all layers of the pretrained model except the last 11 layers

# build model
from keras.layers import GlobalAveragePooling2D, Add, Lambda, Dense, GlobalMaxPooling2D

#base modek from REsnet34 
base_model = ResNet34(input_shape=(224,224,3), weights='imagenet', include_top=False)

#Freeze all but last 11 layers 
for layer in base_model.layers[:-11]:
  layer.trainable=False
for layer in base_model.layers:
    print(layer, layer.trainable) 

#Add our own Top/Prediction layers 
x = GlobalAveragePooling2D()(base_model.output)



x= Dense(num_classes,use_bias=False)(x)

output = keras.layers.Activation('softmax')(x)

model = keras.models.Model(inputs=[base_model.input], outputs=[output])

Compile the model using Stochastic Gradient descent optimizer with momentum of 0.9 and lr of 0.015

from keras.optimizers import SGD
opt=SGD(lr=0.015,  momentum=0.9, nesterov=True)
model.compile(optimizer=opt, loss='categorical_crossentropy', metrics=['accuracy'])

WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/optimizers.py:793: The name tf.train.Optimizer is deprecated. Please use tf.compat.v1.train.Optimizer instead.

WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py:3576: The name tf.log is deprecated. Please use tf.math.log instead.

we want to get the model with best validation accuracy for the prediction task and so we will save the best model from the various epochs in Google Drive using ModelCheckpoint callback available in Keras

define a Modelcheckpoint to save the best Model

from keras.callbacks import ModelCheckpoint

model_save_path='/gdrive/My Drive/EVA/session20/best_model2.h5'

chkpoint_model=ModelCheckpoint(model_save_path, monitor='val_acc', verbose=1, save_best_only=True, save_weights_only=False, mode='max')

Cutout Augmentation

Cutout was first presented as an effective augmentation technique in these two papers :

Improved Regularization of Convolutional Neural Networks with Cutout and Random Erasing Data Augmentation

The idea is to randomly cut away patches of information from images that a model is training on to force it to learn from more parts of the image. This would help the model learn more features about a class instead of depending on some simple assumptions using smaller areas within the image . This helps the model generalize better and make better predictions .

We will use python code for random erasing found at https://github.com/yu4u/cutout-random-erasing

#get code for random erasing from https://github.com/yu4u/cutout-random-erasing
!wget https://raw.githubusercontent.com/yu4u/cutout-random-erasing/master/random_eraser.py

Train the model for 100 epochs using a batch size of 128 . We will use a ImageDataGenerator to apply image augmentation of random-pad-crop, horizontal Flip and CutOut augmentation for the training

from random_eraser import get_random_eraser
eraser = get_random_eraser(p=0.8, s_l=0.15, s_h=0.25,r_1=0.5, r_2=1/0.5,v_l=0,v_h=255,pixel_level=False)
def img_aug1(img):
  
  
  
  img=random_pad_crop_img(img)
  img=eraser(img)
  return img

def scheduler(epoch):
  if epoch < 30:
    return 0.01
  elif 30 < epoch < 50: 
    return 0.008 
  else:
    return 0.008 * tensorflow.math.exp(0.1 * (50 - epoch))

lr_callback = keras.callbacks.LearningRateScheduler(scheduler)

from tensorflow.keras.preprocessing.image import ImageDataGenerator

EPOCHS=100
batch_size=128

train_datagen=ImageDataGenerator(
    
        
        
        preprocessing_function=img_aug1,
        horizontal_flip=True
    
)

val_datagen= ImageDataGenerator(
    
        
)



training_generator = train_datagen.flow_from_dataframe(train_df, directory='./data/train/', 
                                                         x_col='File', y_col='Class', target_size=(224, 224),
                                                    color_mode='rgb', interpolation='bicubic',
                                                    class_mode='categorical', 
                                                    batch_size=batch_size, shuffle=True, seed=42)
validation_generator = val_datagen.flow_from_dataframe(test_df, directory='./data/test/',
                                                         x_col='File', y_col='Class', 
                                                         target_size=(224, 224),interpolation='bicubic',
                                                    color_mode='rgb', class_mode='categorical', 
                                                    batch_size=batch_size, shuffle=True, seed=42)

Found 50000 validated image filenames belonging to 100 classes.
Found 10000 validated image filenames belonging to 100 classes.

def scheduler(epoch):
  if epoch < 5:
    return 0.02
  elif 5 < epoch < 12: 
    return 0.015 
  elif 12 < epoch < 20: 
    return 0.010
  elif 20 < epoch < 25: 
    return 0.007      
  else:
    return 0.003

lr_callback = keras.callbacks.LearningRateScheduler(scheduler)

model.fit_generator(training_generator, epochs=30, 
                        steps_per_epoch=np.ceil(train_features.shape[0]/batch_size), 
                    validation_steps=np.ceil(test_features.shape[0]/batch_size), 
                    validation_data=validation_generator,
                                 shuffle=True,
                                callbacks=[chkpoint_model,lr_callback],
                                 verbose=1)

After the initial 30 epochs of training the last few layers , now unfreeze all the layers and train again for 100 epochs

Unfreeze all layers in base model

for layer in model.layers:
  layer.trainable=True

opt=SGD(lr=0.01,  momentum=0.9, nesterov=True)
model.compile(optimizer=opt, loss='categorical_crossentropy', metrics=['accuracy'])

import math

def scheduler1(epoch):
  if epoch < 15:
    return 0.01
  elif 15 < epoch < 30: 
    return 0.008 
  else:
    return 0.008 * math.exp(0.1 * (30 - epoch))

lr_callback = keras.callbacks.LearningRateScheduler(scheduler1)

model.fit_generator(training_generator, epochs=EPOCHS, 
                        steps_per_epoch=np.ceil(train_features.shape[0]/batch_size), 
                    validation_steps=np.ceil(test_features.shape[0]/batch_size), 
                    validation_data=validation_generator,
                                 shuffle=True,
                                callbacks=[chkpoint_model,lr_callback],
                                 verbose=1)

Val accuracy reached 80.23 at the end of 35th epoch and 81.31 at the end of 100 epochs .We have aleady reached our target of 80% val accuracy . Let us train another 100 epochs to see how much further we can push this validation accuracy

def scheduler2(epoch):
  if epoch < 15:
    return 0.002
  elif 15 < epoch < 30: 
    return 0.001 
  elif 13 < epoch < 50: 
    return 0.0005   
  else:
    return 0.0005 * math.exp(0.5 * (50 - epoch))

lr_callback = keras.callbacks.LearningRateScheduler(scheduler2)

opt=SGD(lr=0.002,  momentum=0.9, nesterov=True)
model.compile(optimizer=opt, loss='categorical_crossentropy', metrics=['accuracy'])

train_datagen=ImageDataGenerator(
    
        
        
        #preprocessing_function=img_aug2,
        horizontal_flip=True,width_shift_range=0.05, height_shift_range=0.05
    
)

val_datagen= ImageDataGenerator(
    
        
)



training_generator = train_datagen.flow_from_dataframe(train_df, directory='./data/train/', 
                                                         x_col='File', y_col='Class', target_size=(224, 224),
                                                    color_mode='rgb', interpolation='bicubic',
                                                    class_mode='categorical', 
                                                    batch_size=batch_size, shuffle=True, seed=42)
validation_generator = val_datagen.flow_from_dataframe(test_df, directory='./data/test/',
                                                         x_col='File', y_col='Class', 
                                                         target_size=(224, 224),interpolation='bicubic',
                                                    color_mode='rgb', class_mode='categorical', 
                                                    batch_size=batch_size, shuffle=True, seed=42)

Found 50000 validated image filenames belonging to 100 classes.
Found 10000 validated image filenames belonging to 100 classes.

model.fit_generator(training_generator, epochs=EPOCHS, 
                        steps_per_epoch=np.ceil(train_features.shape[0]/batch_size), 
                    validation_steps=np.ceil(test_features.shape[0]/batch_size), 
                    validation_data=validation_generator,
                                 shuffle=True,
                                callbacks=[chkpoint_model,lr_callback],
                                 verbose=1)

Runtime disconnected after 27 epochs . Val accuracy has reached 81.52 . We will stop here although we could load the model again and train for more epochs to see how much farther we could go.

Load the model saved best model from google drive

model= keras.models.load_model('/gdrive/My Drive/EVA/session20/best_model2.h5')

Evaluate and print validation loss and validation accuracy

score=model.evaluate_generator(validation_generator)

print('validation loss =',score[0] , ', Validation accuracy =',score[1])

validation loss = 0.7847665718078614 , Validation accuracy = 0.8152

We used the technique of Transfer Learning and fine-tuned a pre-trained a ResNet34 model with Imagenet weights to classify images in the CIFAR100 dataset. In order to achieve this we added our own prediction layer on top of the base model and trained it to achieve 81.52 max validation accuracy .

	File	Class
49995	49996.jpg	80
49996	49997.jpg	7
49997	49998.jpg	3
49998	49999.jpg	7
49999	50000.jpg	73