Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

Piyush Sharma(Google (United States)), Nan Ding(Google (United States)), Sebastian Goodman(Google (United States)), Radu Soricut(Google (United States))
Unknown
January 1, 2018
Cited by 1,795Open Access
Full Text

Abstract

We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception- ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer


Related Papers

No related papers found

Powered by citation graph analysis