Document Type

Article

Rights

Available under a Creative Commons Attribution Non-Commercial Share Alike 4.0 International Licence

Disciplines

1.2 COMPUTER AND INFORMATION SCIENCE

Publication Details

In Artifical Intelligence Review 2006, Vol. 25, issue 1-2, pp21-35. http://www.springerlink.com/content/v6160l661j29/?p=b4a8d4514d264cce8d7c00d9f3eab46aπ=9

Abstract

In recent years a a number of psycholinguistic experiments have pointed to the interaction between language and vision. In particular, the interaction between visual attention and linguistic reference. In parallel with this, several theories of discourse have attempted to provide an account of the relationship between types of referential expressions on the one hand and the degree of mental activation on the other. Building on both of these traditions, this paper describes an attention based approach to visually situated reference resolution. The framework uses the relationship between referential form and preferred mode of interpretation as a basis for a weighted integration of linguistic and visual attention scores for each entity in the multimodal context. The resulting integrated attention scores are then used to rank the candidate referents during the resolution process, with the candidate scoring the highest selected as the referent. One advantage of this approach is that the resolution process occurs within the full multimodal context, in so far as the referent is selected from a full list of the objects in the multimodal context. As a result situations where the intended target of the reference is erroneously excluded, due to an individual assumption within the resolution process, are avoided. Moreover, the system can recognise situations where attention cues from different modalities make a reference potentially ambiguous.

DOI

https://doi.org/10.1007/s10462-007-9022-9

Recommended Citation

Kelleher, J.: Attention Driven Reference Resolution in Multimodal Contexts. Artifical Intelligence Review. Vol. 25, issue 1-2, pp21-35. 2006. doi:10.1007/s10462-007-9022-9

Download

Included in

Computer Engineering Commons

COinS

Articles

Attention Driven Reference Resolution in Multimodal Contexts

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Included in

Search

Browse

Author Corner

Links

Articles

Attention Driven Reference Resolution in Multimodal Contexts

Authors

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Included in

Share

Search

Browse

Author Corner

Links