Video Captioning Using Global-Local Representation

Video captioning is a challenging task as it needs to accurately transform visual understanding into natural language description. To date, state-of-the-art methods inadequately model global-local vision representation for sentence generation, leaving plenty of room for improvement. In this work, we...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on circuits and systems for video technology Vol. 32; no. 10; pp. 6642 - 6656
Main Authors Yan, Liqi, Ma, Siqi, Wang, Qifan, Chen, Yingjie, Zhang, Xiangyu, Savakis, Andreas, Liu, Dongfang
Format Journal Article
LanguageEnglish
Published United States IEEE 01.10.2022
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…