A Simple LLM Framework for Long-Range Video Question-Answering
Zhang, Ce, Lu, Taixi, Islam, Md Mohaiminul, Wang, Ziyang, Yu, Shoubin, Bansal, Mohit, Bertasius, Gedas
Year of Publication 28.12.2023
Year of Publication 28.12.2023
Get full text
Journal Article
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Wang, Xiyao, Zhou, Yuhang, Liu, Xiaoyu, Lu, Hongjin, Xu, Yuancheng, He, Feihong, Yoon, Jaehong, Lu, Taixi, Bertasius, Gedas, Bansal, Mohit, Yao, Huaxiu, Huang, Furong
Year of Publication 19.01.2024
Year of Publication 19.01.2024
Get full text
Journal Article
A Simple LLM Framework for Long-Range Video Question-Answering
Zhang, Ce, Lu, Taixi, Islam, Md Mohaiminul, Wang, Ziyang, Yu, Shoubin, Bansal, Mohit, Bertasius, Gedas
Published in arXiv.org (10.10.2024)
Get full text
Published in arXiv.org (10.10.2024)
Paper
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Wang, Xiyao, Zhou, Yuhang, Liu, Xiaoyu, Lu, Hongjin, Xu, Yuancheng, He, Feihong, Yoon, Jaehong, Lu, Taixi, Bertasius, Gedas, Bansal, Mohit, Yao, Huaxiu, Huang, Furong
Published in arXiv.org (25.01.2024)
Get full text
Published in arXiv.org (25.01.2024)
Paper