Learning Dual Semantic Relations With Graph Attention for Image-Text Matching
Image-Text Matching is one major task in cross-modal information processing. The main challenge is to learn the unified visual and textual representations. Previous methods that perform well on this task primarily focus on not only the alignment between region features in images and the correspondin...
Saved in:
Published in | IEEE transactions on circuits and systems for video technology Vol. 31; no. 7; pp. 2866 - 2879 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.07.2021
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!