The PASCAL Object Recognition Database Collection,
2006.
Dataset, Objects.
HTML Version. Various datasets for object recognition. Pointers to some of the
others.
MSR VTT Dataset,
A Large Video Description Dataset for Bridging Video and Language.
WWW Link.
Dataset, Visual Question Answering.
Video Objects: A Test Database for Video Object Recognition,
2006.
Dataset, Objects.
HTML Version. 180 videos of 15 objects.
Animals with Attributes: A dataset for Attribute Based Classification,
2006.
Dataset, Objects.
WWW Link. 30,000+ images, 40 animal classes.
Image Net,
2014.
WWW Link.
Dataset, Objects. Large set of images (or sets of datasets) for recognition.
Related to ImageNet Challanges for recognition.
14Million+ images.
Links to Stanford
See also Stanford University, Computer Science Departent. and Princeton.
See also Princeton.
Washington Ground Truth Image Database,
CBIR dataset.
Online2004
WWW Link.
Dataset.
Dataset, Retrieval.
BibRef
0400
LHI Object Datasets,
Includes hand segmentations, and annotations.
Online2004
HTML Version.
Dataset.
Dataset, Object Recognition.
Transportation images, Animals, Aerial Images, Objects,
Dataset also includes other data.
See also Lotus Hill Institute.
BibRef
0400
NEC Animal Dataset,
Online2009
WWW Link.
Dataset.
Dataset, Object Recognition. It consists of about 5000 high quality images
from 60 toy animals taken at different poses against a plain
background.
BibRef
0900
Xcavator.Net,
Online2007
WWW Link.
Dataset, Object Recognition. Photo search for professional use. Searches stock databases, you then
purchase the image for use.
Part of CogniSign LLC.
BibRef
0700
Geusebroek, J.M.[Jan-Mark],
Burghouts, G.J.[Gertjan J.],
Smeulders, A.W.M.[Arnold W.M.],
The Amsterdam Library of Object Images,
IJCV(61), No. 1, January 2005, pp. 103-112.
DOI Link
0410
WWW Link.
Dataset, Objects. 1000 objects over 100 images per object.
BibRef
Torralba, A.B.[Antonio B.],
Fergus, R.[Rob],
Freeman, W.T.[William T.],
80 Million Tiny Images: A Large Data Set for Nonparametric Object and
Scene Recognition,
PAMI(30), No. 11, November 2008, pp. 1958-1970.
IEEE DOI
WWW Link.
0809
BibRef
And:
CSAIL-TR-2007-024, 2007.
Dataset, Retrieval. Images from the WWW, associated with a noun. Large comprehensive dataset.
Dataset with segmentations.
BibRef
Gong, Y.C.[Yun-Chao],
Pawlowski, M.[Marcin],
Yang, F.[Fei],
Brandy, L.[Louis],
Boundev, L.[Lubomir],
Fergus, R.[Rob],
Web scale photo hash clustering on a single machine,
CVPR15(19-27)
IEEE DOI
1510
BibRef
Russell, B.[Bryan],
Torralba, A.B.[Antonio B.],
Freeman, W.T.[William T.],
LableMe: The Open Annotation Tool,
Online2010.
WWW Link.
1108
Dataset, Retrieval.
Code, Annotation. The site for the annotation tool, also the video version.
BibRef
Zhou, B.[Bolei],
Lapedriza, A.[Agata],
Khosla, A.[Aditya],
Oliva, A.[Aude],
Torralba, A.B.[Antonio B.],
Places: A 10 Million Image Database for Scene Recognition,
PAMI(40), No. 6, June 2018, pp. 1452-1464.
IEEE DOI
1805
Dataset, Retrieval. Context, Databases, Image recognition, Semantics, Sun, Training,
Visualization, Scene classification, deep feature, deep learning,
visual recognition
BibRef
Escalante, H.J.[Hugo Jair],
Hernandez, C.A.[Carlos A.],
Gonzalez, J.A.[Jesus A.],
Lopez-Lopez, A.,
Montes-y-Gomez, M.[Manuel],
Morales, E.F.[Eduardo F.],
Sucar, L.E.[L. Enrique],
Villasenor, L.[Luis],
Grubinger, M.[Michael],
The segmented and annotated IAPR TC-12 benchmark,
CVIU(114), No. 4, April 2010, pp. 419-428.
Elsevier DOI
1003
Dataset, Retrieval. Data set creation; Ground truth collection; Evaluation metrics;
Automatic image annotation; Image retrieval
BibRef
Russakovsky, O.[Olga],
Deng, J.[Jia],
Su, H.[Hao],
Krause, J.[Jonathan],
Satheesh, S.[Sanjeev],
Ma, S.[Sean],
Huang, Z.H.[Zhi-Heng],
Karpathy, A.[Andrej],
Khosla, A.[Aditya],
Bernstein, M.[Michael],
Berg, A.C.[Alexander C.],
Fei-Fei, L.[Li],
ImageNet Large Scale Visual Recognition Challenge,
IJCV(115), No. 3, December 2015, pp. 211-252.
Springer DOI
1512
Dataset, Object Category. Object category classification and detection on hundreds of object
categories and millions of images.
BibRef
Loh, Y.P.[Yuen Peng],
Chan, C.S.[Chee Seng],
Getting to know low-light images with the Exclusively Dark dataset,
CVIU(178), 2019, pp. 30-42.
Elsevier DOI
1812
Dataset, Low Light.
BibRef
Rosu, R.A.[Radu Alexandru],
Quenzel, J.[Jan],
Behnke, S.[Sven],
Semi-supervised Semantic Mapping Through Label Propagation with
Semantic Texture Meshes,
IJCV(128), No. 5, May 2020, pp. 1220-1238.
Springer DOI
2005
BibRef
Aizawa, K.,
Fujimoto, A.,
Otsubo, A.,
Ogawa, T.,
Matsui, Y.,
Tsubota, K.,
Ikuta, H.,
Building a Manga Dataset 'Manga109' With Annotations for Multimedia
Applications,
MultMedMag(27), No. 2, April 2020, pp. 8-18.
IEEE DOI
2006
Dataset, Manga. Machine learning, Visualization, Character recognition, Art,
Machine learning algorithms, Task analysis
BibRef
Kuznetsova, A.[Alina],
Rom, H.[Hassan],
Alldrin, N.[Neil],
Uijlings, J.[Jasper],
Krasin, I.[Ivan],
Pont-Tuset, J.[Jordi],
Kamali, S.[Shahab],
Popov, S.[Stefan],
Malloci, M.[Matteo],
Kolesnikov, A.[Alexander],
Duerig, T.[Tom],
Ferrari, V.[Vittorio],
The Open Images Dataset V4,
IJCV(128), No. 7, July 2020, pp. 1956-1981.
Springer DOI
2007
Dataset, Object Detection. 9.2M images with unified annotations.
HTML Version.
BibRef
Maugey, T.,
Toni, L.,
Large Database Compression Based on Perceived Information,
SPLetters(27), 2020, pp. 1735-1739.
IEEE DOI
2010
Covariance matrices, Compression algorithms, Databases,
Measurement, Signal processing algorithms, Image coding, Entropy,
sampling
BibRef
He, Y.[Yue],
Shen, Z.Y.[Zhe-Yan],
Cui, P.[Peng],
Towards Non-I.I.D. image classification: A dataset and baselines,
PR(110), 2021, pp. 107383.
Elsevier DOI
2011
Non-I.I.D, Dataset, Context, Bias, ConvNet, Batch balancing
BibRef
Pang, Y.,
Cao, J.,
Li, Y.,
Xie, J.,
Sun, H.,
Gong, J.,
TJU-DHD: A Diverse High-Resolution Dataset for Object Detection,
IP(30), 2021, pp. 207-219.
IEEE DOI
2011
Object detection, Feature extraction, Image resolution,
Face recognition, Proposals, Training, Face detection, Dataset,
large scale
BibRef
Xu, X.W.[Xiao-Wei],
Zhang, X.Y.[Xin-Yi],
Yu, B.[Bei],
Hu, X.B.S.[Xiao-Bo Sharon],
Rowen, C.[Christopher],
Hu, J.T.[Jing-Tong],
Shi, Y.Y.[Yi-Yu],
DAC-SDC Low Power Object Detection Challenge for UAV Applications,
PAMI(43), No. 2, February 2021, pp. 392-403.
IEEE DOI
2101
More for detection, but generally a dataset, evaluation paper.
This paper presents in detail the dataset and evaluation procedure. It
further discusses the methods developed by some of the entries as well
as representative results.
Object detection, Graphics processing units,
Field programmable gate arrays, Task analysis, low power
BibRef
Duan, J.,
Yu, S.,
Tan, H.L.,
Tan, C.,
Actionet: An Interactive End-To-End Platform For Task-Based Data
Collection And Augmentation In 3D Environment,
ICIP20(1566-1570)
IEEE DOI
2011
Collecting the data.
Task analysis, Videos,
Graphical user interfaces, Planning, Data collection, Robots,
3D environment
BibRef
Zhang, Y.,
Zhang, L.,
Hamidouche, W.,
Deforges, O.,
A Fixation-Based 360° Benchmark Dataset For Salient Object Detection,
ICIP20(3458-3462)
IEEE DOI
2011
Benchmark testing, Visualization,
Object detection, Head, Measurement, Training, VR,
benchmark
BibRef
Hsu, T.M.H.[Tzu-Ming Harry],
Qi, H.[Hang],
Brown, M.[Matthew],
Federated Visual Classification with Real-World Data Distribution,
ECCV20(X:76-92).
Springer DOI
2011
Species and landmark classification.
BibRef
Zheng, J.[Jia],
Zhang, J.F.[Jun-Fei],
Li, J.[Jing],
Tang, R.[Rui],
Gao, S.H.[Sheng-Hua],
Zhou, Z.[Zihan],
Structured3D:
A Large Photo-realistic Dataset for Structured 3d Modeling,
ECCV20(IX:519-535).
Springer DOI
2011
BibRef
Song, J.M.[Jia-Ming],
Dauphin, Y.[Yann],
Auli, M.[Michael],
Ma, T.Y.[Teng-Yu],
Robust and On-the-Fly Dataset Denoising for Image Classification,
ECCV20(XXIX: 556-572).
Springer DOI
2010
BibRef
Wang, X.,
Zhang, X.,
Zhu, Y.,
Guo, Y.,
Yuan, X.,
Xiang, L.,
Wang, Z.,
Ding, G.,
Brady, D.,
Dai, Q.,
Fang, L.,
PANDA: A Gigapixel-Level Human-Centric Video Dataset,
CVPR20(3265-3275)
IEEE DOI
2008
Task analysis, Spatial resolution, Trajectory, Cameras,
Benchmark testing, Visualization, Head
BibRef
Warburg, F.[Frederik],
Hauberg, S.[Sųren],
López-Antequera, M.[Manuel],
Gargallo, P.[Pau],
Kuang, Y.[Yubin],
Civera, J.[Javier],
Mapillary Street-Level Sequences:
A Dataset for Lifelong Place Recognition,
CVPR20(2623-2632)
IEEE DOI
2008
Dataset, Mapillary mapping platform.
Urban areas, Cameras, Image recognition, Meteorology, Task analysis,
Image sequences, Benchmark testing
BibRef
Li, X.,
Wei, T.,
Chen, Y.P.,
Tai, Y.,
Tang, C.,
FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation,
CVPR20(2866-2875)
IEEE DOI
2008
Image segmentation, Training, Animals, Task analysis, Semantics,
Computer vision, Tools
BibRef
Scheck, T.[Tobias],
Seidel, R.[Roman],
Hirtz, G.[Gangolf],
Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor
Dataset for Deep Transfer Learning,
WACV20(932-941)
IEEE DOI
2006
Dataset, Fisheye Images. Cameras, Image segmentation, Object detection,
Semantics, Solid modeling, Rendering (computer graphics)
BibRef
Chou, S.H.[Shih-Han],
Sun, C.[Cheng],
Chang, W.Y.[Wen-Yen],
Hsu, W.T.[Wan-Ting],
Sun, M.[Min],
Fu, J.L.[Jian-Long],
360-Indoor: Towards Learning Real-World Objects in 360° Indoor
Equirectangular Images,
WACV20(834-842)
IEEE DOI
2006
Object detection, Videos, Distortion, Automobiles,
Computer vision, Task analysis
BibRef
Behley, J.,
Garbade, M.,
Milioto, A.,
Quenzel, J.,
Behnke, S.,
Stachniss, C.,
Gall, J.,
SemanticKITTI:
A Dataset for Semantic Scene Understanding of LiDAR Sequences,
ICCV19(9296-9306)
IEEE DOI
2004
Dataset, LiDAR. distance measurement, image segmentation,
optical radar, stereo image processing, LiDAR sequences, Lasers
BibRef
Wang, X.,
Wu, J.,
Chen, J.,
Li, L.,
Wang, Y.,
Wang, W.Y.,
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for
Video-and-Language Research,
ICCV19(4580-4590)
IEEE DOI
2004
language translation, linguistics, natural language processing,
video signal processing, unified multilingual model, Social network services
BibRef
Gu, S.,
Lugmayr, A.,
Danelljan, M.,
Fritsche, M.,
Lamour, J.,
Timofte, R.,
DIV8K: DIVerse 8K Resolution Image Dataset,
AIM19(3512-3516)
IEEE DOI
2004
Dataset, High Resolution. convolutional neural nets, image resolution,
learning (artificial intelligence), CNN, image processing
BibRef
Mauceri, C.[Cecilia],
Palmer, M.[Martha],
Heckman, C.[Christoffer],
SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions,
CLVL19(1883-1886)
IEEE DOI
2004
Dataset, Recognition. image colour analysis, object detection, SLAM (robots),
spatial referring expressions, SUN-Spot, objects localization,
multimodal
BibRef
Sųlund, T.[Thomas],
Buch, A.G.[Anders Glent],
Krüger, N.[Norbert],
Aanęs, H.[Henrik],
A Large-Scale 3D Object Recognition Dataset,
3DV16(73-82)
IEEE DOI
1701
Dataset, Object Recognition.
WWW Link. object recognition
BibRef
Hua, B.S.[Binh-Son],
Pham, Q.H.[Quang-Hieu],
Nguyen, D.T.[Duc Thanh],
Tran, M.K.[Minh-Khoi],
Yu, L.F.[Lap-Fai],
Yeung, S.K.[Sai-Kit],
SceneNN: A Scene Meshes Dataset with aNNotations,
3DV16(92-101)
IEEE DOI
1701
Dataset, RGB-D.
WWW Link. Cameras
BibRef
Rotman, D.[Daniel],
Gilboa, G.[Guy],
A Depth Restoration Occlusionless Temporal Dataset,
3DV16(176-184)
IEEE DOI
1701
Dataset, RGB-D.
BibRef
Zhang, J.J.[Jun-Jie],
Zhang, J.[Jian],
Lu, J.F.[Jian-Feng],
Shen, C.H.[Chun-Hua],
Curr, K.[Kate],
Phua, R.[Robin],
Neville, R.[Richard],
Edmonds, E.[Elise],
SLNSW-UTS:
A Historical Image Dataset for Image Multi-Labeling and Retrieval,
DICTA16(1-6)
IEEE DOI
1701
Dataset, Object Recognition. 29713 images, 119 labels.
BibRef
Xiang, Y.[Yu],
Kim, W.[Wonhui],
Chen, W.[Wei],
Ji, J.W.[Jing-Wei],
Choy, C.[Christopher],
Su, H.[Hao],
Mottaghi, R.[Roozbeh],
Guibas, L.J.[Leonidas J.],
Savarese, S.[Silvio],
ObjectNet3D: A Large Scale Database for 3D Object Recognition,
ECCV16(VIII: 160-176).
Springer DOI
1611
Dataset, Object Recognition.
WWW Link.
BibRef
Lin, T.Y.[Tsung-Yi],
Maire, M.[Michael],
Belongie, S.J.[Serge J.],
Hays, J.[James],
Perona, P.[Pietro],
Ramanan, D.[Deva],
Dollįr, P.[Piotr],
Zitnick, C.L.[C. Lawrence],
Microsoft COCO: Common Objects in Context,
ECCV14(V: 740-755).
Springer DOI
1408
Dataset, Objects.
WWW Link.
BibRef
Flickr30k Dataset,
From image descriptions to visual denotations.
WWW Link.
Dataset, Visual Question Answering. Extension of Flickr 8k dataset.
Plummer, B.A.[Bryan A.],
Wang, L.[Liwei],
Cervantes, C.M.[Chris M.],
Caicedo, J.C.[Juan C.],
Hockenmaier, J.[Julia],
Lazebnik, S.[Svetlana],
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for
Richer Image-to-Sentence Models,
IJCV(123), No. 1, May 2017, pp. 74-93.
Springer DOI
1705
BibRef
Earlier:
ICCV15(2641-2649)
IEEE DOI
1602
Dataset, Object Recognition. Benchmark testing
BibRef
Fanello, S.R.[Sean Ryan],
Ciliberto, C.[Carlo],
Santoro, M.[Matteo],
Natale, L.[Lorenzo],
Metta, G.[Giorgio],
Rosasco, L.[Lorenzo],
Odone, F.[Francesca],
iCub World: Friendly Robots Help Building Good Vision Data-Sets,
GT13(700-705)
IEEE DOI
1309
Dataset, Object Recognition. Human Robot Interaction; Object Categorization Dataset; iCub
BibRef
Ponomarenko, N.[Nikolay],
Ieremeiev, O.[Oleg],
Lukin, V.[Vladimir],
Jin, L.[Lina],
Egiazarian, K.O.[Karen O.],
A New Color Image Database TID2013: Innovations and Results,
ACIVS13(402-413).
Springer DOI
1311
Dataset, Color Images.
BibRef
Ponce, J.,
Berg, T.L.,
Everingham, M.R.,
Forsyth, D.A.,
Hebert, M.,
Lazebnik, S.[Svetlana],
Marszalek, M.,
Schmid, C.,
Russell, B.C.,
Torralba, A.,
Williams, C.K.I.,
Zhang, J.,
Zisserman, A.,
Dataset Issues in Object Recognition,
CLOR06(29-48).
Springer DOI
0711
Dataset, Discussion.
BibRef
Campbell, R., and
Flynn, P.J.,
A WWW-Accessible 3D Image and Model Database for
Computer Vision Research,
EEMCV98(148-154).
BibRef
9800
And:
EEMTV98(xx)
Dataset, 3-D Data.
HTML Version.
BibRef
Nene, S.A.,
Nayar, S.K.[Shree K.],
Murase, H.[Hiroshi],
Columbia Object Image Library (COIL-100),
ColumbiaTechnical Report CUCS-006-96, February 1996.
PS File. Also:
WWW Link. Also the COIL-20 database.
WWW Link.
Dataset, Objects.
BibRef
9602
Chapter on Matching and Recognition Using Volumes, High Level Vision Techniques, Invariants continues in
General Spatial Reasoning and Geometric Reasoning Issues, Visual Relations .