IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization
Cao, Jie, Jiao, Dian, Yan, Qiang, Zhang, Wenqiao, Tang, Siliang, Zhuang, Yueting
Year of Publication 15.07.2024
Year of Publication 15.07.2024
Get full text
Journal Article
Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Shen, Kai, Wu, Lingfei, Tang, Siliang, Xu, Fangli, Long, Bo, Zhuang, Yueting, Pei, Jian
Published in arXiv.org (06.07.2024)
Published in arXiv.org (06.07.2024)
Get full text
Paper
Journal Article
Bridging Local Details and Global Context in Text-Attributed Graphs
Wang, Yaoke, Zhu, Yun, Zhang, Wenqiao, Zhuang, Yueting, Li, Yunfei, Tang, Siliang
Year of Publication 18.06.2024
Year of Publication 18.06.2024
Get full text
Journal Article
DIEM: Decomposition-Integration Enhancing Multimodal Insights
Jiang, Xinyi, Wang, Guoming, Guo, Junhao, Li, Juncheng, Zhang, Wenqiao, Lu, Rongxing, Tang, Siliang
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Get full text
Conference Proceeding
Improving Large Models with Small models: Lower Costs and Better Performance
Chen, Dong, Zhang, Shuo, Zhuang, Yueting, Tang, Siliang, Liu, Qidong, Wang, Hua, Xu, Mingliang
Year of Publication 15.06.2024
Year of Publication 15.06.2024
Get full text
Journal Article
DuetRAG: Collaborative Retrieval-Augmented Generation
Jiao, Dian, Cai, Li, Huang, Jingsheng, Zhang, Wenqiao, Tang, Siliang, Zhuang, Yueting
Year of Publication 12.05.2024
Year of Publication 12.05.2024
Get full text
Journal Article
WorldGPT: Empowering LLM as Multimodal World Model
Ge, Zhiqi, Huang, Hongzhe, Zhou, Mingze, Li, Juncheng, Wang, Guoming, Tang, Siliang, Zhuang, Yueting
Year of Publication 28.04.2024
Year of Publication 28.04.2024
Get full text
Journal Article
GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning
Zhu, Yun, Wang, Yaoke, Shi, Haizhou, Zhang, Zhenshuo, Jiao, Dian, Tang, Siliang
Published in arXiv.org (11.03.2024)
Published in arXiv.org (11.03.2024)
Get full text
Paper
Journal Article
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
Yu, Qifan, Li, Juncheng, Wei, Longhui, Pang, Liang, Ye, Wentao, Qin, Bosheng, Tang, Siliang, Tian, Qi, Zhuang, Yueting
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16.06.2024)
Get full text
Conference Proceeding
Auto-Encoding Morph-Tokens for Multimodal LLM
Pan, Kaihang, Tang, Siliang, Li, Juncheng, Fan, Zhaoyu, Chow, Wei, Yan, Shuicheng, Chua, Tat-Seng, Zhuang, Yueting, Zhang, Hanwang
Year of Publication 03.05.2024
Year of Publication 03.05.2024
Get full text
Journal Article
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Zheng, Haoyu, Zhang, Wenqiao, Wang, Yaoke, Zhou, Hao, Liu, Jiang, Li, Juncheng, Lv, Zheqi, Tang, Siliang, Zhuang, Yueting
Year of Publication 21.04.2024
Year of Publication 21.04.2024
Get full text
Journal Article
Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document
Chen, Xiangnan, Xiao, Qian, Li, Juncheng, Dong, Duo, Lin, Jun, Liu, Xiaozhong, Tang, Siliang
Year of Publication 23.05.2023
Year of Publication 23.05.2023
Get full text
Journal Article
InstructVid2Vid: Controllable Video Editing with Natural Language Instructions
Qin, Bosheng, Li, Juncheng, Tang, Siliang, Chua, Tat-Seng, Zhuang, Yueting
Year of Publication 20.05.2023
Year of Publication 20.05.2023
Get full text
Journal Article
Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels
Zhang, Wenqiao, Liu, Changshuo, Zeng, Lingze, Ooi, Beng Chin, Tang, Siliang, Zhuang, Yueting
Year of Publication 20.04.2023
Year of Publication 20.04.2023
Get full text
Journal Article
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
Qian, Long, Li, Juncheng, Wu, Yu, Ye, Yaobo, Fei, Hao, Chua, Tat-Seng, Zhuang, Yueting, Tang, Siliang
Year of Publication 17.02.2024
Year of Publication 17.02.2024
Get full text
Journal Article