Target detection method based on Transform global and local attention interaction

The invention belongs to the field of computer vision, particularly relates to a target detection method based on Transform global and local attention interaction, and aims to solve the problems of low accuracy and precision of a target detection result caused by high calculation cost, high complexi...

Full description

Saved in:
Bibliographic Details
Main Authors CHEN YANG, CHEN SIHAN, WANG KUNFENG, ZHANG SHUQIN
Format Patent
LanguageChinese
English
Published 12.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention belongs to the field of computer vision, particularly relates to a target detection method based on Transform global and local attention interaction, and aims to solve the problems of low accuracy and precision of a target detection result caused by high calculation cost, high complexity and insufficient global and local interaction of a Transform model. The method comprises the following steps: preprocessing a to-be-processed two-dimensional image; performing window division by taking the image token as a unit; performing local multi-head attention calculation based on a window; performing down-sampling on the local windows, splicing the local windows into a new global window, and performing global multi-head attention calculation; global and local interaction is carried out, so that global information is supplemented to local information; image tokens are merged to obtain multi-scale features, and the multi-scale features are sent to a target detection module to obtain the category and positio
Bibliography:Application Number: CN202210399175