Less Can Be More: Sound Source Localization With a Classification Model
Senocak, Arda, Ryu, Hyeonggon, Kim, Junsik, Kweon, In So
Published in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01.01.2022)
Published in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01.01.2022)
Get full text
Conference Proceeding
Generative Bias for Robust Visual Question Answering
Cho, Jae Won, Kim, Dong-Jin, Ryu, Hyeonggon, Kweon, In So
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01.06.2023)
Get full text
Conference Proceeding
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Ryu, Hyeonggon, Senocak, Arda, So Kweon, In, Son Chung, Joon
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Woo, Jongbhin, Ryu, Hyeonggon, Jang, Youngjoon, Cho, Jae Won, Chung, Joon Son
Year of Publication 17.10.2024
Year of Publication 17.10.2024
Get full text
Journal Article
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment
Senocak, Arda, Ryu, Hyeonggon, Kim, Junsik, Oh, Tae-Hyun, Pfister, Hanspeter, Chung, Joon Son
Year of Publication 18.07.2024
Year of Publication 18.07.2024
Get full text
Journal Article
Sound Source Localization is All about Cross-Modal Alignment
Senocak, Arda, Ryu, Hyeonggon, Kim, Junsik, Oh, Tae-Hyun, Pfister, Hanspeter, Chung, Joon Son
Year of Publication 19.09.2023
Year of Publication 19.09.2023
Get full text
Journal Article
Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Senocak, Arda, Kim, Junsik, Oh, Tae-Hyun, Ryu, Hyeonggon, Li, Dingzeyu, Kweon, In So
Year of Publication 11.02.2022
Year of Publication 11.02.2022
Get full text
Journal Article
Sound Source Localization is All about Cross-Modal Alignment
Senocak, Arda, Ryu, Hyeonggon, Kim, Junsik, Tae-Hyun Oh, Pfister, Hanspeter, Joon Son Chung
Published in arXiv.org (19.09.2023)
Get full text
Published in arXiv.org (19.09.2023)
Paper
Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Senocak, Arda, Kim, Junsik, Tae-Hyun Oh, Ryu, Hyeonggon, Li, Dingzeyu, In So Kweon
Published in arXiv.org (12.02.2022)
Get full text
Published in arXiv.org (12.02.2022)
Paper