30 Jan 2015 | Olga Russakovsky* · Jia Deng* · Hao Su · Jonathan Krause · Sanjeev Satheesh · Sean Ma · Zhiheng Huang · Andrej Karpathy · Aditya Khosla · Michael Bernstein · Alexander C. Berg · Li Fei-Fei
The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) is an annual benchmark for object category classification and detection, involving hundreds of object categories and millions of images. This paper discusses the creation of the ILSVRC dataset, the advancements in object recognition, and the challenges of collecting large-scale ground truth annotations. It highlights key breakthroughs in categorical object recognition, provides a detailed analysis of the current state of large-scale image classification and object detection, and compares state-of-the-art computer vision accuracy with human accuracy. The paper also outlines future directions and improvements based on lessons learned from the five years of the challenge. The ILSVRC dataset, which includes image-level and object-level annotations, has facilitated significant progress in object recognition algorithms and has been instrumental in developing new techniques. The paper covers the dataset construction process, including the selection of object categories, image collection, and annotation methods, and provides an overview of the evaluation criteria and methods used in the challenge.The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) is an annual benchmark for object category classification and detection, involving hundreds of object categories and millions of images. This paper discusses the creation of the ILSVRC dataset, the advancements in object recognition, and the challenges of collecting large-scale ground truth annotations. It highlights key breakthroughs in categorical object recognition, provides a detailed analysis of the current state of large-scale image classification and object detection, and compares state-of-the-art computer vision accuracy with human accuracy. The paper also outlines future directions and improvements based on lessons learned from the five years of the challenge. The ILSVRC dataset, which includes image-level and object-level annotations, has facilitated significant progress in object recognition algorithms and has been instrumental in developing new techniques. The paper covers the dataset construction process, including the selection of object categories, image collection, and annotation methods, and provides an overview of the evaluation criteria and methods used in the challenge.