The utility of image descriptions in the initial stages of vision: A case study of printed text

Vision research has made very substantial progress towards understanding how we see. It is one area of psychology where the three‐way thrust of behavioural measurements (psychophysics), brain imaging, and computational studies have been combined quite routinely for some years. The purpose of this pa...

Full description

Saved in:

Bibliographic Details
Published in	The British journal of psychology Vol. 101; no. 1; pp. 1 - 26
Main Authors	Watt, Roger J., Dakin, Steven C.
Format	Journal Article
Language	English
Published	Oxford, UK Blackwell Publishing Ltd 01.02.2010 British Psychological Society
Subjects	Biological and medical sciences Computer Simulation Eyes & eyesight Fundamental and applied biological sciences. Psychology Humans Image Enhancement Image Processing, Computer-Assisted Language Models, Biological Pattern Recognition, Automated - methods Perception Photic Stimulation Production and perception of written language Psychology Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Sensory perception Signal Processing, Computer-Assisted Studies Vision Vision, Ocular - physiology Visual Perception Human Reading Description Language Vision Perception Text Review Image
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Vision research has made very substantial progress towards understanding how we see. It is one area of psychology where the three‐way thrust of behavioural measurements (psychophysics), brain imaging, and computational studies have been combined quite routinely for some years. The purpose of this paper is to demonstrate a relatively unusual form of computational modelling that we characterise as involving image descriptions. Image descriptions are statements about structures in images and relationships between structures. Most modelling in vision is either conceived in fairly terms, or is done at the level of images. Neither is entirely satisfactory, and image descriptions are a simple formulation of age‐old ideas about a Vocabulary of image features that are detected and parameterized from actual digital images. For our example, we use the domain of the visual perception of printed text. This is an area that has been characterized by thorough, robust psychophysical experiments. The fundamental requirements of visual processing in this domain are: grouping of some parts if the image into words; at the same time segmenting words from each other. We show how these are readily understood in terms of our model of image descriptions, and show quantitatively that typographical practice, refined over centuries, is about optimum for the visual system at least as represented by our model. In addition, we show that the same notion of image descriptions could, in principle, support word recognition in certain circumstances.
Bibliography:	istex:424791C71C2870E16C287447F884AB258DAAB37E ark:/67375/WNG-WXMD1DLC-M ArticleID:BJOP258
ISSN:	0007-1269 2044-8295
DOI:	10.1348/000712608X379070