Merging Vision Transformers from Different Tasks and Domains

This work targets to merge various Vision Transformers (ViTs) trained on different tasks (i.e., datasets with different object categories) or domains (i.e., datasets with the same categories but different environments) into one unified model, yielding still good performance on each task or domain. P...

Full description

Saved in:
Bibliographic Details
Main Authors Ye, Peng, Huang, Chenyu, Shen, Mingzhu, Chen, Tao, Huang, Yongqi, Zhang, Yuning, Ouyang, Wanli
Format Journal Article
LanguageEnglish
Published 25.12.2023
Subjects
Online AccessGet full text

Cover

Loading…