Learning from Children: Improving Image-Caption Pretraining via Curriculum
Ayyubi, Hammad A, Lokesh, Rahul, Zareian, Alireza, Wu, Bo, Chang, Shih-Fu
Year of Publication 27.05.2023
Year of Publication 27.05.2023
Get full text
Journal Article
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
You, Haoxuan, Sun, Rui, Wang, Zhecan, Chen, Long, Wang, Gengyu, Ayyubi, Hammad A, Chang, Kai-Wei, Chang, Shih-Fu
Year of Publication 24.05.2023
Year of Publication 24.05.2023
Get full text
Journal Article
Video Summarization: Towards Entity-Aware Captions
Ayyubi, Hammad A, Liu, Tianqi, Nagrani, Arsha, Lin, Xudong, Zhang, Mingda, Arnab, Anurag, Han, Feng, Zhu, Yukun, Liu, Jialu, Chang, Shih-Fu
Year of Publication 01.12.2023
Year of Publication 01.12.2023
Get full text
Journal Article
Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across Modalities
Ayyubi, Hammad A, Thomas, Christopher, Chum, Lovish, Lokesh, Rahul, Chen, Long, Niu, Yulei, Lin, Xudong, Feng, Xuande, Koo, Jaywon, Ray, Sounak, Chang, Shih-Fu
Year of Publication 14.06.2022
Year of Publication 14.06.2022
Get full text
Journal Article
Generating Rationales in Visual Question Answering
Ayyubi, Hammad A, Tanjim, Md. Mehrab, McAuley, Julian J, Cottrell, Garrison W
Year of Publication 04.04.2020
Year of Publication 04.04.2020
Get full text
Journal Article
Learning from Children: Improving Image-Caption Pretraining via Curriculum
Ayyubi, Hammad A, Lokesh, Rahul, Zareian, Alireza, Wu, Bo, Shih-Fu, Chang
Published in arXiv.org (30.05.2023)
Get full text
Published in arXiv.org (30.05.2023)
Paper
Video Summarization: Towards Entity-Aware Captions
Ayyubi, Hammad A, Liu, Tianqi, Nagrani, Arsha, Lin, Xudong, Zhang, Mingda, Arnab, Anurag, Han, Feng, Zhu, Yukun, Liu, Jialu, Shih-Fu, Chang
Published in arXiv.org (01.12.2023)
Get full text
Published in arXiv.org (01.12.2023)
Paper
Generating Rationales in Visual Question Answering
Ayyubi, Hammad A, Tanjim, Md Mehrab, McAuley, Julian J, Cottrell, Garrison W
Published in arXiv.org (04.04.2020)
Get full text
Published in arXiv.org (04.04.2020)
Paper