Con-Text : Fine-Grained Object Classification Using Scene Text

Dataset

Con-Text dataset is built from sub-categories of the ImageNet "building" and "place of business" sets to evaluate fine-grained classification. The dataset consists of 28 categories with 24,255 images in total. Note that this dataset is not specifically build for text recognition and thus not all the images have text in them. Moreover, high variability of text size, location, resolution and style and, uncontrolled environmental settings ( illumination ) make text recognition from this dataset harder.

Images

Related Publications

This list will be updated. Please cite [1] if you use the dataset. 1. S. Karaoglu, R. Tao, J. C. van Gemert and T. Gevers, Con-Text: Text Detection for Fine-grained Object Classification., IEEE Transactions on Image Processing (TIP) 2017 [pdf]
2. S. Karaoglu, R. Tao, T. Gevers and Arnold W. M. Smeulders Words Matter: Scene Text for Image Classification and Retrieval., IEEE Transactions on MultiMedia (TMM) 2017 [pdf][code]
3. S. Karaoglu, J. C. van Gemert and T. Gevers, Con-Text: Text Detection Using Background Connectivity for Fine-Grained Object Classification., ACM Multimedia (ACM MM) 2013 [pdf] [poster][presentation]

BibTeX

@InProceedings{Karaoglu17,
author    = "Sezer Karaoglu and Ran Tao and Jan van Gemert and Theo Gevers",
title     = "Con-Text: Text Detection for Fine-grained Object Classification",
booktitle = "IEEE Transactions on Image Processing (TIP)",
year      = "2017"
}

Results

People

Sezer Karaoglu
Jan C. van Gemert
Theo Gevers

Acknowledgements

Sezer Karaoglu is supported by Dutch national program COMMIT.

Contact

For the questions about the dataset please contact Sezer Karaoglu and Jan C. van Gemert, (s.karaoglu[at]uva.nl and j.c.vangemert[at]uva.nl)