MarioQA: Answering Questions by Watching Gameplay Videos
Jonghwan Mun, Hongsuck Seo, Paul, Ilchae Jung, Bohyung Han
Published in 2017 IEEE International Conference on Computer Vision (ICCV) (01.10.2017)
Published in 2017 IEEE International Conference on Computer Vision (ICCV) (01.10.2017)
Get full text
Conference Proceeding
Look Before you Speak: Visually Contextualized Utterances
Hongsuck Seo, Paul, Nagrani, Arsha, Schmid, Cordelia
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2021)
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2021)
Get full text
Conference Proceeding
End-to-end Generative Pretraining for Multimodal Video Captioning
Seo, Paul Hongsuck, Nagrani, Arsha, Arnab, Anurag, Schmid, Cordelia
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2022)
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2022)
Get full text
Conference Proceeding
Zero-shot Referring Image Segmentation with Global-Local Context Features
Yu, Seonghoon, Seo, Paul Hongsuck, Son, Jeany
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Get full text
Conference Proceeding
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Yang, Antoine, Nagrani, Arsha, Seo, Paul Hongsuck, Miech, Antoine, Pont-Tuset, Jordi, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Get full text
Conference Proceeding
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR
Seo, Paul Hongsuck, Nagrani, Arsha, Schmid, Cordelia
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Get full text
Conference Proceeding
IFSeg: Image-free Semantic Segmentation via Vision-Language Model
Yun, Sukmin, Park, Seong Hyeon, Seo, Paul Hongsuck, Shin, Jinwoo
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Get full text
Conference Proceeding
Learning Audio-Video Modalities from Image Captions
Nagrani, Arsha, Seo, Paul Hongsuck, Seybold, Bryan, Hauth, Anja, Manen, Santiago, Sun, Chen, Schmid, Cordelia
Published in Computer Vision - ECCV 2022 (2022)
Published in Computer Vision - ECCV 2022 (2022)
Get full text
Book Chapter
Learning Correlation Structures for Vision Transformers
Kim, Manjin, Seo, Paul Hongsuck, Schmid, Cordelia, Cho, Minsu
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Get full text
Conference Proceeding
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Shin, Heeseong, Kim, Chaehyun, Hong, Sunghwan, Cho, Seokju, Arnab, Anurag, Seo, Paul Hongsuck, Kim, Seungryong
Year of Publication 29.09.2024
Year of Publication 29.09.2024
Get full text
Journal Article
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Cho, Seokju, Shin, Heeseong, Hong, Sunghwan, Arnab, Anurag, Seo, Paul Hongsuck, Kim, Seungryong
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Get full text
Conference Proceeding