##Information
name: LSTM image captioning model based on CVPR 2015 paper "Show and tell: A neural image caption generator" and code from Karpathy's NeuralTalk.
model_file:
model_weights:
license:
neon_version:
neon_commit:
gist_id:
##Description The LSTM model is trained on the flickr8k dataset using precomputed VGG features from http://cs.stanford.edu/people/karpathy/deepimagesent/. Model details can be found in the following CVPR-2015 paper:
Show and tell: A neural image caption generator.
O. Vinyals, A. Toshev, S. Bengio, and D. Erhan.
CVPR, 2015 (arXiv ref. cs1411.4555)
##Performance Using the bleu score evaluation script from https://raw.githubusercontent.com/karpathy/neuraltalk/master/eval/ and evaluating against 5 reference sentences the results are:
| BLEU | Score |
|---|---|
| B-1 | 54.2 |
| B-2 | 32.6 |
| B-3 | 19.3 |
| B-4 | 12.3 |
Access Denied on the model file