LG220: Deep Visual-Semantic Alignments for Generating Image Descriptions

Date: 
Friday, February 27, 2015 - 12:00
Speaker: 
Angel Cruz-Roa, PhD & Andrew Jancowitz, PhD

Also presenting...

Andrej Karpathy, Li Fei-Fei. Deep Visual-Semantic Alignments for Generating Image Descriptions. (Standford)

Vinyals, O., & Toshev, A. (n.d.). Show and Tell: A Neural Image Caption Generator. (Google)

Mao, J., Xu, W., Yang, Y., Wang, J., & Yuille, A. L. (2014). Explain Images with Multimodal Recurrent Neural Networks, 1–9.  (Baidu/UCLA)

Donahue, J., Saenko, K., Darrell, T., Austin, U. T., Lowell, U., & Berkeley, U. C. (n.d.). V. Long-term Recurrent Convolutional Networks for Visual Recognition and Description. (Berkeley)

Kiros, R., Salakhutdinov, R., & Zemel, R. S. (n.d.). Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models.  (University of Toronto)