Loading…
Loading…
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim
Published in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (01.07.2017)
Published in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (01.07.2017)
Get full text
Conference Proceeding
Loading…
Scene Graph Refinement Network for Visual Question Answering
Qian, Tianwen, Chen, Jingjing, Chen, Shaoxiang, Wu, Bo, Jiang, Yu-Gang
Published in IEEE transactions on multimedia (2023)
Published in IEEE transactions on multimedia (2023)
Get full text
Journal Article
Loading…
An analysis of graph convolutional networks and recent datasets for visual question answering
Yusuf, Abdulganiyu Abdu, Chong, Feng, Xianling, Mao
Published in The Artificial intelligence review (01.12.2022)
Published in The Artificial intelligence review (01.12.2022)
Get full text
Journal Article
Loading…
PGCL: Prompt guidance and self-supervised contrastive learning-based method for Visual Question Answering
Gao, Ling, Zhang, Hongda, Liu, Yiming, Sheng, Nan, Feng, Haotian, Xu, Hao
Published in Expert systems with applications (01.10.2024)
Published in Expert systems with applications (01.10.2024)
Get full text
Journal Article
Loading…
Learning Visual Knowledge Memory Networks for Visual Question Answering
Su, Zhou, Zhu, Chen, Dong, Yinpeng, Cai, Dongqi, Chen, Yurong, Li, Jianguo
Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (01.06.2018)
Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (01.06.2018)
Get full text
Conference Proceeding
Loading…
DOMAS: DATA ORIENTED MEDICAL VISUAL QUESTION ANSWERING USING SWIN TRANSFORMER
Teodora-Alexandra TOADER
Published in Studia Universitatis Babes-Bolyai: Series Informatica (20.07.2023)
Published in Studia Universitatis Babes-Bolyai: Series Informatica (20.07.2023)
Get full text
Journal Article
Loading…
Loading…
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA
Zhou, Sheng, Guo, Dan, Li, Jia, Yang, Xun, Wang, Meng
Published in IEEE transactions on image processing (2023)
Published in IEEE transactions on image processing (2023)
Get full text
Journal Article
Loading…
Human-Adversarial Visual Question Answering
Sheng, Sasha, Singh, Amanpreet, Goswami, Vedanuj, Magana, Jose Alberto Lopez, Galuba, Wojciech, Parikh, Devi, Kiela, Douwe
Year of Publication 04.06.2021
Year of Publication 04.06.2021
Get full text
Journal Article
Loading…
Loading…
Loading…
Loading…
Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs
Zhang, Jiarui, Khayatkhoei, Mahyar, Chhikara, Prateek, Ilievski, Filip
Year of Publication 24.10.2023
Year of Publication 24.10.2023
Get full text
Journal Article
Loading…
Loading…
Loading…
Loading…
Loading…
Human-Adversarial Visual Question Answering
Sheng, Sasha, Singh, Amanpreet, Goswami, Vedanuj, Lopez Magana, Jose Alberto, Galuba, Wojciech, Parikh, Devi, Kiela, Douwe
Published in arXiv.org (04.06.2021)
Get full text
Published in arXiv.org (04.06.2021)
Paper