Deep Cross-Modal Projection Learning for Image-Text Matching
This is a Pytorch implmentation for the paper Deep Cross-Modal Projection Learning for Image-Text Matching.
The official implementation in TensorFlow can be found here.
data/processed
folder. Or you can use the file dataset/preprocess.py
to prepare your own data.pretrained_models
folder.You should firstly change the param model_path
to your current directory.
sh scripts/run.sh
You can directly run the code instead of performing training and testing seperately.
Or training:
sh scripts/train.sh
Or testing:
sh scripts/test.sh