Video description generation method, device and equipment and computer readable storage medium

The invention belongs to the technical field of intelligent decision making, and provides a video description generation method, device and equipment and a computer readable storage medium, and the method comprises the steps: obtaining a to-be-described video, and extracting the visual features, aud...

Full description

Saved in:
Bibliographic Details
Main Authors LUO JIAN, CHENG NING, WANG JIANZONG
Format Patent
LanguageChinese
English
Published 09.07.2021
Subjects
Online AccessGet full text

Cover

More Information
Summary:The invention belongs to the technical field of intelligent decision making, and provides a video description generation method, device and equipment and a computer readable storage medium, and the method comprises the steps: obtaining a to-be-described video, and extracting the visual features, auditory features and word features of the to-be-described video; respectively coding the visual features and the auditory features through a multi-modal attention mechanism subject model of the video description generation system to obtain visual coding features and auditory coding features; processing the visual coding features and the auditory coding features through an auxiliary model of a video description generation system to generate target auxiliary features; decoding the visual coding feature, the auditory coding feature, the target auxiliary feature and the word feature through a multi-modal attention mechanism subject model to obtain a posterior probability of each keyword, and selecting a decoded word from
Bibliography:Application Number: CN202110470037