Skip to content

Instantly share code, notes, and snippets.

@jcoreyes
Last active January 21, 2018 10:41
Show Gist options
  • Save jcoreyes/7e76e90664f935c6f65d to your computer and use it in GitHub Desktop.
Save jcoreyes/7e76e90664f935c6f65d to your computer and use it in GitHub Desktop.
Image Captioning LSTM

##Information

name: LSTM image captioning model based on CVPR 2015 paper "Show and tell: A neural image caption generator" and code from Karpathy's NeuralTalk.

model_file:

model_weights:

license:

neon_version:

neon_commit:

gist_id:

##Description The LSTM model is trained on the flickr8k dataset using precomputed VGG features from http://cs.stanford.edu/people/karpathy/deepimagesent/. Model details can be found in the following CVPR-2015 paper:

Show and tell: A neural image caption generator.
O. Vinyals, A. Toshev, S. Bengio, and D. Erhan.  
CVPR, 2015 (arXiv ref. cs1411.4555)

##Performance Using the bleu score evaluation script from https://raw.githubusercontent.com/karpathy/neuraltalk/master/eval/ and evaluating against 5 reference sentences the results are:

BLEU Score
B-1 54.2
B-2 32.6
B-3 19.3
B-4 12.3
@nickzuck
Copy link

Access Denied on the model file

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment