Conference Proceeding

The REAL corpus: A crowd-sourced Corpus of human generated and evaluated spatial references to real-world urban scenes

Details

Citation

Bartie P, Mackaness W, Gkatzia D & Rieser V (2016) The REAL corpus: A crowd-sourced Corpus of human generated and evaluated spatial references to real-world urban scenes. In: Calzolari N, Choukri K, Mazo H, Moreno A, Declerck T T, Goggi S, Grobelnik M, Odijk J, Piperidis S, Maegaard B & Mariani J (eds.) Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. 10th International Conference on Language Resources and Evaluation, LREC 2016, Portoroz, Slovenia, 23.05.2016-28.05.2016. Paris: European Language Resources Association (ELRA), pp. 2153-2155. http://www.lrec-conf.org/proceedings/lrec2016/pdf/1035_Paper.pdf

Abstract
We present a newly crowd-sourced data set of natural language references to objects anchored in complex urban scenes (In short: The REAL Corpus – Referring Expressions Anchored Language). The REAL corpus contains a collection of images of real-world urban scenes together with verbal descriptions of target objects generated by humans, paired with data on how successful other people were able to identify the same object based on these descriptions. In total, the corpus contains 32 images with on average 27 descriptions per image and 3 verifications for each description. In addition, the corpus is annotated with a variety of linguistically motivated features. The paper highlights issues posed by collecting data using crowd-sourcing with an unrestricted input format, as well as using real-world urban scenes. The corpus will be released via the ELRA repository as part of this submission.

Keywords
Image Descriptions; Spatial Referring Expressions; Urban Scenes; Vision and Language

StatusPublished
Publication date31/12/2016
Publication date online31/05/2016
URLhttp://hdl.handle.net/1893/26431
Related URLshttp://hdl.handle.net/…rec-conf.org/en/
PublisherEuropean Language Resources Association (ELRA)
Publisher URLhttp://www.lrec-conf.org/…f/1035_Paper.pdf
Place of publicationParis
ISBN978-295174089-1
Conference10th International Conference on Language Resources and Evaluation, LREC 2016
Conference locationPortoroz, Slovenia
Dates