Locked History Actions

Diff for "Scwad/AIDe"

Differences between revisions 13 and 14
Revision 13 as of 2021-06-15 10:14:32
Size: 1824
Comment:
Revision 14 as of 2022-02-28 10:56:14
Size: 1773
Comment:
Deletions are marked like this. Additions are marked like this.
Line 18: Line 18:
 * Alina Wróblewska. Polish Corpus of Annotated Descriptions of Images. Accepted for LREC 2018. <<BibMate(key, "wro_18a", omitYears=true)>>

AIDe – Corpus of Annotated Image Descriptions

AIDe is a corpus of image descriptions in Polish. It consists of 2K natural language descriptions of 1K images. The descriptions are morphosyntactically analysed (part-of-speech tagged and dependency parsed) and the pairs of these descriptions are annotated in terms of semantic relatedness and entailment. All annotations are provided by people with strong linguistic background.

The dataset can be used for evaluation of various systems integrating language and vision. It is applicable for evaluation of systems designed to generation of images based on provided descriptions (text-to-image generation) or to generation of captions based on images (image-to-text generation). Furthermore, as elected images are split into thematic groups based on WordNet, the dataset is also useful for validating image classification approaches.

Download

  • Dataset of annotated image descriptions: AIDe

  • Images: If you wish to get the pictures please contact alina <at> ipipan.waw.pl (replace <at> with @).

Publication

List of publications

Licence

The resource is distributed under the CC BY-SA 4.0 licence.

Contact

For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.

Acknowledgments

The building of the resource was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure.