In this project, I am using Pytorch to implement automatic image captioning system as a part of Udacity Computer Vision Nanodegree.
The Second Project in the Computer Vision Nanodegree by Udacity - Automatic Captioning of Images.
pip install -r requirements.txt
The project is an implementation of the architecture introduced in the paper Show and Tell: A Neural Image Caption Generator
Clone this repo: https://github.com/cocodataset/cocoapi
git clone https://github.com/cocodataset/cocoapi.git
Setup the coco API (also described in the readme here)
cd cocoapi/PythonAPI
make
cd ..
Download some specific data from here: http://cocodataset.org/#download (described below)
Under Annotations, download:
Under Images, download: