Access the full text.
Sign up today, get DeepDyve free for 14 days.
D. Coombs, Christopher Brown (1990)
Intelligent gaze control in binocular visionProceedings. 5th IEEE International Symposium on Intelligent Control 1990
The Design of GoalBased Scenarios
R. Schank (1982)
Dynamic Memory
R. Schank, A. Fano, B. Bell, M. Jona (1994)
The design of Goal-Based ScenariosThe Journal of the Learning Sciences, 3
T. Poggio, V. Torre, C. Koch (1985)
Computational vision and regularization theoryNature, 317
M. Swain, D. Ballard (1991)
Color indexingInternational Journal of Computer Vision, 7
S. Papert (1981)
Mindstorms: Children, Computers, and Powerful Ideas
Mitchell Marcus (1979)
A theory of syntactic recognition for natural language
W. Ferguson, R. Bareiss, L. Birnbaum, Richard Osgood (1992)
ASK Systems: An Approach to the Realization of Story-Based TeachersThe Journal of the Learning Sciences, 2
N. Chomsky (1965)
Aspects of the Theory of Syntax
Peter Prokopowicz, P. Cooper (1993)
The dynamic retina: Contrast and motion detection for active visionInternational Journal of Computer Vision, 16
Eugene Charniak, D. McDermott (1986)
Introduction to artificial intelligence
A. Whitehead
The aims of education
T. Poggio, V. Torre, C. Koch (1987)
Readings In Computer Vision
L. Birnbaum, M. Brand, P. Cooper (1993)
Looking for trouble: Using causal semantics to direct focus of attention1993 (4th) International Conference on Computer Vision
R. Schank (1977)
Rules and Topics in ConversationCogn. Sci., 1
Y. Aloimonos, I. Weiss, A. Bandyopadhyay (2004)
Active visionInternational Journal of Computer Vision, 1
Steven Abney, R. Berwick, C. Tenny (1991)
Principle-Based Parsing: Computation and Psycholinguistics
D. Ballard (1991)
Animate VisionArtif. Intell., 48
R. Bajcsy (1988)
Active perceptionProc. IEEE, 76
(1994)
Maintaining Visual Models of a Scene Using Change Primitives
M. Tomita (1985)
Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems
Charles Martin (1992)
Direct memory access parsing
Research in vision and language has traditionally remained separate in part because the classic task of generating a representation of a given image or sentence has resulted in an emphasis on low level structural aspects of these media. In this paper we argue that image and language understanding should be approached with the intent of facilitating the performance of a task. Under this view research in image and language understanding must confront common issues that arise as a task is pursued. Language and images are both input that can be used to maintain a model of a task. We argue that a model may be maintained by incorporating changes in the scene that can be characterized at a high level of abstraction yet manifest themselves at relatively low levels of analysis. Existing task-relevant models and the associated domain knowledge are used to expect specific changes and disambiguate the interpretation of these changes, thereby allowing them to modify the existing model. From this perspective, understanding input is largely independent of the modality of the input.
Artificial Intelligence Review – Springer Journals
Published: Dec 4, 2004
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.