The article by D. Marr, "Early Processing of Visual Information," outlines a theory of how the visual system processes information at an early stage. The key idea is that the first step involves computing a "primal sketch," a rich but primitive description of the intensity changes in an image. This description is expressed using a vocabulary of intensity change types (edges, shading edges, extended edges, lines, blobs, etc.) and their parameters (position, orientation, termination points, contrast, size, and fuzziness).
Marr argues that the primal sketch is large and unwieldy, so the next step is to group its contents in a way that aids later recognition. He suggests that our ability to interpret simple drawings without semantic content implies the presence of symbolic processes that can define 'place-tokens' and group them according to certain rules. These grouping processes are not influenced by specific images but are based on general principles such as orientation, collinearity, and spatial proximity.
The article discusses various techniques for detecting and combining intensity changes, including the use of different mask sizes and orientations. It also explores the implications of these processes for neurophysiology and the role of higher-level knowledge and purpose in visual information processing. Marr emphasizes that the primal sketch and subsequent grouping operations are fundamental to early visual processing and that they are relatively independent of the specific image being processed.The article by D. Marr, "Early Processing of Visual Information," outlines a theory of how the visual system processes information at an early stage. The key idea is that the first step involves computing a "primal sketch," a rich but primitive description of the intensity changes in an image. This description is expressed using a vocabulary of intensity change types (edges, shading edges, extended edges, lines, blobs, etc.) and their parameters (position, orientation, termination points, contrast, size, and fuzziness).
Marr argues that the primal sketch is large and unwieldy, so the next step is to group its contents in a way that aids later recognition. He suggests that our ability to interpret simple drawings without semantic content implies the presence of symbolic processes that can define 'place-tokens' and group them according to certain rules. These grouping processes are not influenced by specific images but are based on general principles such as orientation, collinearity, and spatial proximity.
The article discusses various techniques for detecting and combining intensity changes, including the use of different mask sizes and orientations. It also explores the implications of these processes for neurophysiology and the role of higher-level knowledge and purpose in visual information processing. Marr emphasizes that the primal sketch and subsequent grouping operations are fundamental to early visual processing and that they are relatively independent of the specific image being processed.