Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".
nocaps
Baseline model for nocaps
benchmark, a re-implementation based on the
UpDown image captioning model trained on the COCO dataset (only),
and with added support of decoding using Constrained Beam Search.
If you find this code useful, please consider citing our paper, the paper which proposed original model, and EvalAI — the platform which hosts our evaluation server. All bibtex available in CITATION.md.
Extensive documentation available at nocaps.org/updown-baseline.
Use it as an API reference to navigate through and build on top of our code.
Pre-trained checkpoints with the provided configs in (configs
directory) are available to download:
.pth
file): updown.pthnocaps val
: updown_nocaps_val.jsonNote: While CBS is inference-only technique, it cannot be used on this checkpoint. CBS
requires models to have 300-dimensional froze GloVe embeddings, this checkpoint has 1000-
dimensional word embeddings which are learned during training.
in-domain | near-domain | out-of-domain | overall | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
CIDEr | SPICE | CIDEr | SPICE | CIDEr | SPICE | BLEU1 | BLEU4 | METEOR | ROUGE | CIDEr | SPICE |
78.1 | 11.6 | 57.7 | 10.3 | 31.3 | 8.3 | 73.7 | 18.3 | 22.7 | 50.4 | 55.3 | 10.1 |
.pth
file): updown_plus_cbs.pthNote: Since CBS is inference-only technique, this particular checkpoint can be used
without CBS decoding. It yields similar results to the UpDown Captioner trained using
learned word embeddings during training.
nocaps val
: updown_plus_cbs_nocaps_val_with_cbs.jsonin-domain | near-domain | out-of-domain | overall | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
CIDEr | SPICE | CIDEr | SPICE | CIDEr | SPICE | BLEU1 | BLEU4 | METEOR | ROUGE | CIDEr | SPICE |
78.6 | 12.1 | 73.5 | 11.5 | 68.8 | 9.8 | 75.8 | 17.5 | 22.7 | 51.1 | 73.3 | 11.3 |
nocaps val
: updown_plus_cbs_nocaps_val_without_cbs.jsonin-domain | near-domain | out-of-domain | overall | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
CIDEr | SPICE | CIDEr | SPICE | CIDEr | SPICE | BLEU1 | BLEU4 | METEOR | ROUGE | CIDEr | SPICE |
75.7 | 11.7 | 58.0 | 10.3 | 32.9 | 8.2 | 73.1 | 18.0 | 22.7 | 50.2 | 55.4 | 10.1 |