Search Results - "Hongsuck Seo, Paul" :: K.UTB vyhledávací portál

MarioQA: Answering Questions by Watching Gameplay Videos

by Jonghwan Mun, Hongsuck Seo, Paul, Ilchae Jung, Bohyung Han
Published in 2017 IEEE International Conference on Computer Vision (ICCV) (01.10.2017)

Get full text

Conference Proceeding

Loading…

Look Before you Speak: Visually Contextualized Utterances

by Hongsuck Seo, Paul, Nagrani, Arsha, Schmid, Cordelia
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2021)

Get full text

Conference Proceeding

Loading…

End-to-end Generative Pretraining for Multimodal Video Captioning

by Seo, Paul Hongsuck, Nagrani, Arsha, Arnab, Anurag, Schmid, Cordelia
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2022)

Get full text

Conference Proceeding

Loading…

Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences

by Seo, Seonguk, Seo, Paul Hongsuck, Han, Bohyung
Published in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2019)

Get full text

Conference Proceeding

Loading…

Zero-shot Referring Image Segmentation with Global-Local Context Features

by Yu, Seonghoon, Seo, Paul Hongsuck, Son, Jeany
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)

Get full text

Conference Proceeding

Loading…

Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction

by Hyeonwoo Noh, Seo, Paul Hongsuck, Bohyung Han
Published in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2016)

Get full text

Conference Proceeding

Loading…

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

by Yang, Antoine, Nagrani, Arsha, Seo, Paul Hongsuck, Miech, Antoine, Pont-Tuset, Jordi, Laptev, Ivan, Sivic, Josef, Schmid, Cordelia
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)

Get full text

Conference Proceeding

Loading…

AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR

by Seo, Paul Hongsuck, Nagrani, Arsha, Schmid, Cordelia
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)

Get full text

Conference Proceeding

Loading…

IFSeg: Image-free Semantic Segmentation via Vision-Language Model

by Yun, Sukmin, Park, Seong Hyeon, Seo, Paul Hongsuck, Shin, Jinwoo
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)

Get full text

Conference Proceeding

Loading…

Learning Audio-Video Modalities from Image Captions

by Nagrani, Arsha, Seo, Paul Hongsuck, Seybold, Bryan, Hauth, Anja, Manen, Santiago, Sun, Chen, Schmid, Cordelia
Published in Computer Vision - ECCV 2022 (2022)

Get full text

Book Chapter

Loading…

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

by Yu, Seonghoon, Seo, Paul Hongsuck, Son, Jeany
Year of Publication 10.07.2024

Get full text

Journal Article

Loading…

Zero-shot Referring Image Segmentation with Global-Local Context Features

by Yu, Seonghoon, Seo, Paul Hongsuck, Son, Jeany
Year of Publication 31.03.2023

Get full text

Journal Article

Loading…

AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR

by Seo, Paul Hongsuck, Nagrani, Arsha, Schmid, Cordelia
Year of Publication 29.03.2023

Get full text

Journal Article

Loading…

AVATAR submission to the Ego4D AV Transcription Challenge

by Seo, Paul Hongsuck, Nagrani, Arsha, Schmid, Cordelia
Year of Publication 17.11.2022

Get full text

Journal Article

Loading…

Learning Correlation Structures for Vision Transformers

by Kim, Manjin, Seo, Paul Hongsuck, Schmid, Cordelia, Cho, Minsu
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)

Get full text

Conference Proceeding

Loading…

Learning Correlation Structures for Vision Transformers

by Kim, Manjin, Seo, Paul Hongsuck, Schmid, Cordelia, Cho, Minsu
Year of Publication 05.04.2024

Get full text

Journal Article

Loading…

IFSeg: Image-free Semantic Segmentation via Vision-Language Model

by Yun, Sukmin, Park, Seong Hyeon, Seo, Paul Hongsuck, Shin, Jinwoo
Year of Publication 25.03.2023

Get full text

Journal Article

Loading…

Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels

by Shin, Heeseong, Kim, Chaehyun, Hong, Sunghwan, Cho, Seokju, Arnab, Anurag, Seo, Paul Hongsuck, Kim, Seungryong
Year of Publication 29.09.2024

Get full text

Journal Article

Loading…

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation

by Cho, Seokju, Shin, Heeseong, Hong, Sunghwan, Arnab, Anurag, Seo, Paul Hongsuck, Kim, Seungryong
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)

Get full text

Conference Proceeding

Loading…

Look Before you Speak: Visually Contextualized Utterances

by Seo, Paul Hongsuck, Nagrani, Arsha, Schmid, Cordelia
Year of Publication 10.12.2020

Get full text

Journal Article

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database