Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01.01.2021)
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01.01.2021)
Get full text
Conference Proceeding
TubeDETR: Spatio-Temporal Video Grounding with Transformers
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2022)
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2022)
Get full text
Conference Proceeding
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Yang, Antoine, Nagrani, Arsha, Seo, Paul Hongsuck, Miech, Antoine, Pont-Tuset, Jordi, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Get full text
Conference Proceeding
Manas: multi-agent neural architecture search
Lopes, Vasco, Carlucci, Fabio Maria, Esperança, Pedro M., Singh, Marco, Yang, Antoine, Gabillon, Victor, Xu, Hang, Chen, Zewei, Wang, Jun
Published in Machine learning (2024)
Published in Machine learning (2024)
Get full text
Journal Article
VidChapters-7M: Video Chapters at Scale
Yang, Antoine, Nagrani, Arsha, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Year of Publication 25.09.2023
Year of Publication 25.09.2023
Get full text
Journal Article
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (10.10.2022)
Published in arXiv.org (10.10.2022)
Get full text
Paper
Journal Article
TubeDETR: Spatio-Temporal Video Grounding with Transformers
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (09.06.2022)
Published in arXiv.org (09.06.2022)
Get full text
Paper
Journal Article
Learning to Answer Visual Questions from Web Videos
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (11.05.2022)
Published in arXiv.org (11.05.2022)
Get full text
Paper
Journal Article
Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Year of Publication 01.12.2020
Year of Publication 01.12.2020
Get full text
Journal Article
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Yang, Antoine, Nagrani, Arsha, Seo, Paul Hongsuck, Miech, Antoine, Pont-Tuset, Jordi, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Published in arXiv.org (21.03.2023)
Published in arXiv.org (21.03.2023)
Get full text
Paper
Journal Article
MANAS: Multi-Agent Neural Architecture Search
Lopes, Vasco, Carlucci, Fabio Maria, Esperança, Pedro M, Singh, Marco, Gabillon, Victor, Yang, Antoine, Xu, Hang, Chen, Zewei, Wang, Jun
Published in arXiv.org (12.01.2023)
Published in arXiv.org (12.01.2023)
Get full text
Paper
Journal Article
Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (12.08.2021)
Get full text
Published in arXiv.org (12.08.2021)
Paper