2021-22
This lab started out its life as a Keras practical developed for the VLC group based on Jason Brownlee’s “Handwritten Digit Recognition using Convolutional Neural Networks in Python with Keras” tutorial. It subsequently evolved as it was used in tutorials run for external organisations including Ordnance Survey, Lloyds Register and DISCNet.
This version is refocussed on using PyTorch with Torchbearer. A number of changes have been made to ensure that it better fits our format and I’ve also added additional information and exercises throughout.
Now we’ve seen how we can use PyTorch to work towards the solution of a handwriting recognition problem, we’ll turn our focus to data that is more realistic, using deep-learning models that are much closer to state of the art. The problem with using better models is that we need massive amounts of labelled data to train these models from scratch, and also large amounts of time (typically days of training, even using multiple GPUs). Rather than training from scratch we’ll explore using transfer learning and fine-tuning using pre-trained models. The pre-trained models that we’ll play with were trained using the ImageNet dataset, which consists of about 1.3 million images in 1000 classes.
Through this part of the tutorial you’ll learn how to:
To work through this lab you’ll use the Python 3 language in a Jupyter Notebook environment, with the pytorch
tensor library, the torchvision
utility library and the torchbearer
model training library. We’ll also use the matplotlib
, scikit-learn
and numpy
packages. We will primarily be using Google Colab to run the notebooks as this gives you access to an environment with all the tools required. If you wish to run the notebooks locally, see the information in the section below.
There is only a single (long) notebook this week to simplify the use of external datasets on Colab. Use the link to open directly in Google Colab (once opened you should immediately save a copy in your Google Drive otherwise anything you do will be lost once the browser closes), or to download locally.
6.1 Transfer Learning and Fine Tuning | preview | download |
You’ll need access to a computer with the following installed:
Python
(>= 3.6)notebook
(>=5.4.1)pytorch
(>= 0.4)torchvision
(>=0.2.1)torchbearer
(>= 0.2.6)NumPy
(>= 1.12.1)SciPy
(>= 1.0.1)scikit-learn
(>= 0.19.1)matplotlib
(>= 2.2.2)If you want to work on your own machine we recommend using the Anaconda python distribution. Running conda install pytorch torchvision -c pytorch
(see https://pytorch.org/get-started/locally/ for more options) will install both pytorch and torchvision. Torchbearer can be installed with pip install torchbearer
.