Search Results - "Yang, Antoine" :: K.UTB vyhledávací portál

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01.01.2021)

Get full text

Conference Proceeding

Loading…

TubeDETR: Spatio-Temporal Video Grounding with Transformers

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2022)

Get full text

Conference Proceeding

Loading…

Learning to Answer Visual Questions from Web Videos

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in IEEE transactions on pattern analysis and machine intelligence (2022)

Get full text

Journal Article

Loading…

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

by Yang, Antoine, Nagrani, Arsha, Seo, Paul Hongsuck, Miech, Antoine, Pont-Tuset, Jordi, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)

Get full text

Conference Proceeding

Loading…

Manas: multi-agent neural architecture search

by Lopes, Vasco, Carlucci, Fabio Maria, Esperança, Pedro M., Singh, Marco, Yang, Antoine, Gabillon, Victor, Xu, Hang, Chen, Zewei, Wang, Jun
Published in Machine learning (2024)

Get full text

Journal Article

Loading…

CoVR: Learning Composed Video Retrieval from Web Video Captions

by Ventura, Lucas, Yang, Antoine, Schmid, Cordelia, Varol, Gül
Published in arXiv.org (21.05.2024)

Get full text

Paper Journal Article

Loading…

VidChapters-7M: Video Chapters at Scale

by Yang, Antoine, Nagrani, Arsha, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Year of Publication 25.09.2023

Get full text

Journal Article

Loading…

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (10.10.2022)

Get full text

Paper Journal Article

Loading…

TubeDETR: Spatio-Temporal Video Grounding with Transformers

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (09.06.2022)

Get full text

Paper Journal Article

Loading…

Learning to Answer Visual Questions from Web Videos

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Published in arXiv.org (11.05.2022)

Get full text

Paper Journal Article

Loading…

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

by Yang, Antoine, Miech, Antoine, Sivic, Josef, Laptev, Ivan, Schmid, Cordelia
Year of Publication 01.12.2020

Get full text

Journal Article

Loading…

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

by Yang, Antoine, Nagrani, Arsha, Seo, Paul Hongsuck, Miech, Antoine, Pont-Tuset, Jordi, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Published in arXiv.org (21.03.2023)

Get full text

Paper Journal Article

Loading…

MANAS: Multi-Agent Neural Architecture Search

by Lopes, Vasco, Carlucci, Fabio Maria, Esperança, Pedro M, Singh, Marco, Gabillon, Victor, Yang, Antoine, Xu, Hang, Chen, Zewei, Wang, Jun
Published in arXiv.org (12.01.2023)

Get full text

Paper Journal Article