Try to recognize and segment as many object categories as you can. Training images correspond to outdoor pictures taken in different cities of Spain.
Characteristics of the dataset:
Images | Objects | Cars | Person | Building | Road | Sidewalk | Sky | Tree | |
---|---|---|---|---|---|---|---|---|---|
Training Set | 2920 | 32164 | 4441 | 2524 | 3004 | 1321 | 1272 | 1009 | 2652 |
Test Set | 1133 | 32853 | 2265 | 2119 | 2117 | 739 | 1107 | 823 | 1652 |
Challenges:
Release October 22, 2008:
training.tar.gz (5.8 Gbytes)
|
thumbnails
|
list of training categories
test.tar.gz (1.8 Gbytes)
|
thumbnails
|
list of test categories
Use the LabelMe toolbox to read the annotations and to extract segmentation masks.
Send us your comments .
Citation: LabelMe: a database and web-based tool for image annotation . B. Russell, A. Torralba, K. Murphy, W. T. Freeman. International Journal of Computer Vision, 2007.
Try to recognize and segment as many object categories as you can. Use 100 images for training from each scene category (this will give you a total of 800 training images), and the rest for testing. Report performances for each object separatelly. Not all the objects have the same amount of training data available. But this reflects the fact that for some objects it is easier to gather data than for others.
Download datasets, code and paper
Citation: Modeling the shape of the scene: a holistic representation of the spatial envelope. A. Oliva, A. Torralba. International Journal of Computer Vision, Vol. 42(3): 145-175, 2001.
(c) MIT, Computer Science and Artificial Intelligence Laboratory. Accessibility