QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Ding, Pengxiang, Zhao, Han, Song, Wenxuan, Zhang, Wenjie, Zhang, Min, Huang, Siteng, Yang, Ningxi, Wang, Donglin
Published in arXiv.org (06.07.2024)
Get full text
Published in arXiv.org (06.07.2024)
Paper
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Zhao, Han, Zhang, Min, Zhao, Wei, Ding, Pengxiang, Huang, Siteng, Wang, Donglin
Published in arXiv.org (05.06.2024)
Get full text
Published in arXiv.org (05.06.2024)
Paper
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Huang, Siteng, Gong, Biao, Feng, Yutong, Chen, Xi, Fu, Yuqian, Liu, Yu, Wang, Donglin
Published in arXiv.org (10.05.2024)
Get full text
Published in arXiv.org (10.05.2024)
Paper
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning
Huang, Siteng, Gong, Biao, Feng, Yutong, Zhang, Min, Lv, Yiliang, Wang, Donglin
Published in arXiv.org (26.03.2024)
Get full text
Published in arXiv.org (26.03.2024)
Paper
Prompt-based Distribution Alignment for Unsupervised Domain Adaptation
Bai, Shuanghao, Zhang, Min, Zhou, Wanqi, Huang, Siteng, Luan, Zhirong, Wang, Donglin, Chen, Badong
Published in arXiv.org (26.01.2024)
Get full text
Published in arXiv.org (26.01.2024)
Paper
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
Huang, Siteng, Gong, Biao, Pan, Yulin, Jiang, Jianwen, Lv, Yiliang, Li, Yuyuan, Wang, Donglin
Published in arXiv.org (22.03.2023)
Get full text
Published in arXiv.org (22.03.2023)
Paper
Pareto Self-Supervised Training for Few-Shot Learning
Chen, Zhengyu, Ge, Jixie, Zhan, Heshen, Huang, Siteng, Wang, Donglin
Published in arXiv.org (19.04.2021)
Get full text
Published in arXiv.org (19.04.2021)
Paper
Model training method and image-text comparison method
JIANG JIANWEN, PAN YULIN, ZHAO DELI, LYU YILIANG, HUANG SITENG, GONG BIAO
Year of Publication 30.06.2023
Get full text
Year of Publication 30.06.2023
Patent