Operative ekphrasis: the collapse of the text/image distinction in multimodal AI
This article discusses the implications of multimodal artificial intelligence (AI), including image generators such as DALL·E, for the traditional concept of ekphrasis. Using ekphrasis as an example of 'thinking with AI', it takes up the suggestion that in the digital realm ekphrastic rela...
Saved in:
Published in | Word & image (London. 1985) Vol. 40; no. 2; pp. 77 - 90 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
Abingdon
Routledge
02.04.2024
Taylor & Francis Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | This article discusses the implications of multimodal artificial intelligence (AI), including image generators such as DALL·E, for the traditional concept of ekphrasis. Using ekphrasis as an example of 'thinking with AI', it takes up the suggestion that in the digital realm ekphrastic relationships should be understood as performative rather than representational. Since with the introduction of modern AI the digital realm needs to be divided into a sequential part (classic algorithms) and a connectionist part (artificial neural networks), the article shows how the latter part ultimately tends toward a collapse of the text/image distinction in the technical system. Artificial neural networks both encode images and text as the same type of information, and they do so differently from the sequential model. Only in the context of multimodal AI, unlike in analogue or sequential paradigms, ekphrasis goes beyond the separation of or transition between text and image, but rather transcends this difference. |
---|---|
ISSN: | 0266-6286 1943-2178 |
DOI: | 10.1080/02666286.2024.2330335 |