Udacity AI Programming with Python Nanodegree, DeepLearning Classifier
Project code for Udacity’s AI Programming with Python Nanodegree program. This code is used for classifying images in the predict.py
file with the model
you trained with resnet, alexnet or vgg previously on the train.py
file. In addition, there’s a trainning, testing and plotting exercise developed into a
notebook at the Image_Classifier_Proyect.ipynb
.
For downloading and testing out this proyect, you’ll have to make sure you’ve installed the following python packages:
Open the train.py
file in the terminal using the following sintax: (Just 1 line of code)python train.py --save_dir (path to the checkpoints.pth) --arch (choose the CNN model you want to use between "resnet)
--learning_rate (learning rate value) --hidden_units (hidden units number) --epochs (number of epochs)
--gpu (if --gpu is written, enables gpu usage)
Open the predict.py
file in the terminal using the following sintax: (Just 1 line of code)python predict.py input_image_pathfile checkpoint.pth_filepath --top_k (int of how many probability results are shown)
--category_names (json filepath to convert from categories index to labels/names) --gpu (if --gpu is written, enables gpu usage)
When openning the Image_Classifier_Proyect.ipynb
file, run every cell one-by-one.
For Image_Classifier_Proyect.ipynb
you should get this kind of results:
Training using resnet50:
Output units: 102
Validation accuracy result: 83.8%
Trainning loss result: 1.157
Testing accuracy result: 84.1%
Training using vgg16:
Output units: 102
Validation accuracy result: 80.6%
Trainning loss result: 1.786
Testing accuracy result: 74.4%
Training using alexnet:
Output units: 102
Validation accuracy result: 82.4%
Trainning loss result: 1.365
Testing accuracy result: 82.1%
nn.NLLLoss()
(Negative Log Likelihood Loss) criterion due to the fact that it is one of the best functions