Rethinking VLMs and LLMs for Image Classification

Visual Language Models (VLMs) are now increasingly being merged with Large Language Models (LLMs) to enable new capabilities, particularly in terms of improved interactivity and open-ended responsiveness. While these are remarkable capabilities, the contribution of LLMs to enhancing the longstanding...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Cooper, Avi, Kato, Keizo, Shih, Chia-Hsien, Yamane, Hiroaki, Vinken, Kasper, Takemoto, Kentaro, Sunagawa, Taro, Hao-Wei Yeh, Yamanaka, Jin, Mason, Ian, Boix, Xavier
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 03.10.2024
Subjects
Online AccessGet full text

Cover

Loading…