Operative ekphrasis: the collapse of the text/image distinction in multimodal AI

This article discusses the implications of multimodal artificial intelligence (AI), including image generators such as DALL·E, for the traditional concept of ekphrasis. Using ekphrasis as an example of 'thinking with AI', it takes up the suggestion that in the digital realm ekphrastic rela...

Full description

Saved in:
Bibliographic Details
Published inWord & image (London. 1985) Vol. 40; no. 2; pp. 77 - 90
Main Author Bajohr, Hannes
Format Journal Article
LanguageEnglish
Published Abingdon Routledge 02.04.2024
Taylor & Francis Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This article discusses the implications of multimodal artificial intelligence (AI), including image generators such as DALL·E, for the traditional concept of ekphrasis. Using ekphrasis as an example of 'thinking with AI', it takes up the suggestion that in the digital realm ekphrastic relationships should be understood as performative rather than representational. Since with the introduction of modern AI the digital realm needs to be divided into a sequential part (classic algorithms) and a connectionist part (artificial neural networks), the article shows how the latter part ultimately tends toward a collapse of the text/image distinction in the technical system. Artificial neural networks both encode images and text as the same type of information, and they do so differently from the sequential model. Only in the context of multimodal AI, unlike in analogue or sequential paradigms, ekphrasis goes beyond the separation of or transition between text and image, but rather transcends this difference.
ISSN:0266-6286
1943-2178
DOI:10.1080/02666286.2024.2330335