JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images
Wang, Zhecan, Liu, Junzhang, Tang, Chia-Wei, Alomari, Hani, Sivakumar, Anushka, Sun, Rui, Li, Wenhao, Atabuzzaman, Md, Ayyubi, Hammad, You, Haoxuan, Ishmam, Alvi, Chang, Kai-Wei, Chang, Shih-Fu, Thomas, Chris
Year of Publication 19.09.2024
Year of Publication 19.09.2024
Get full text
Journal Article
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
You, Haoxuan, Zhou, Luowei, Xiao, Bin, Codella, Noel, Cheng, Yu, Xu, Ruochen, Chang, Shih-Fu, Yuan, Lu
Year of Publication 26.07.2022
Year of Publication 26.07.2022
Get full text
Journal Article
Graph-MLP: Node Classification without Message Passing in Graph
Hu, Yang, You, Haoxuan, Wang, Zhecan, Wang, Zhicheng, Zhou, Erjin, Gao, Yue
Year of Publication 07.06.2021
Year of Publication 07.06.2021
Get full text
Journal Article
Multi-modality Latent Interaction Network for Visual Question Answering
Gao, Peng, You, Haoxuan, Zhang, Zhanpeng, Wang, Xiaogang, Li, Hongsheng
Year of Publication 10.08.2019
Year of Publication 10.08.2019
Get full text
Journal Article
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Zhang, Haotian, Gao, Mingfei, Gan, Zhe, Dufter, Philipp, Wenzel, Nina, Huang, Forrest, Shah, Dhruti, Du, Xianzhi, Zhang, Bowen, Li, Yanghao, Dodge, Sam, You, Keen, Yang, Zhen, Timofeev, Aleksei, Xu, Mingze, Chen, Hong-You, Fauconnier, Jean-Philippe, Lai, Zhengfeng, You, Haoxuan, Wang, Zirui, Dehghan, Afshin, Grasch, Peter, Yang, Yinfei
Year of Publication 30.09.2024
Year of Publication 30.09.2024
Get full text
Journal Article
SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Wang, Zhecan, You, Haoxuan, Li, Liunian Harold, Zareian, Alireza, Park, Suji, Liang, Yiqing, Chang, Kai-Wei, Chang, Shih-Fu
Year of Publication 15.12.2021
Year of Publication 15.12.2021
Get full text
Journal Article
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions
Li, Liunian Harold, You, Haoxuan, Wang, Zhecan, Zareian, Alireza, Chang, Shih-Fu, Chang, Kai-Wei
Year of Publication 24.10.2020
Year of Publication 24.10.2020
Get full text
Journal Article
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks
Wang, Zhecan, Codella, Noel, Chen, Yen-Chun, Zhou, Luowei, Dai, Xiyang, Xiao, Bin, Yang, Jianwei, You, Haoxuan, Chang, Kai-Wei, Chang, Shih-fu, Yuan, Lu
Year of Publication 22.04.2022
Year of Publication 22.04.2022
Get full text
Journal Article
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Wang, Zhecan, Codella, Noel, Chen, Yen-Chun, Zhou, Luowei, Yang, Jianwei, Dai, Xiyang, Xiao, Bin, You, Haoxuan, Chang, Shih-Fu, Yuan, Lu
Year of Publication 14.01.2022
Year of Publication 14.01.2022
Get full text
Journal Article
PointHop: An Explainable Machine Learning Method for Point Cloud Classification
Zhang, Min, You, Haoxuan, Kadam, Pranav, Liu, Shan, C -C Jay Kuo
Published in arXiv.org (16.12.2019)
Published in arXiv.org (16.12.2019)
Get full text
Paper
Journal Article
Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense
Wang, Zhecan, You, Haoxuan, He, Yicheng, Li, Wenhao, Kai-Wei, Chang, Shih-Fu, Chang
Published in arXiv.org (23.10.2023)
Get full text
Published in arXiv.org (23.10.2023)
Paper