Video description generation method, device and equipment and computer readable storage medium
The invention belongs to the technical field of intelligent decision making, and provides a video description generation method, device and equipment and a computer readable storage medium, and the method comprises the steps: obtaining a to-be-described video, and extracting the visual features, aud...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
09.07.2021
|
Subjects | |
Online Access | Get full text |
Cover
Summary: | The invention belongs to the technical field of intelligent decision making, and provides a video description generation method, device and equipment and a computer readable storage medium, and the method comprises the steps: obtaining a to-be-described video, and extracting the visual features, auditory features and word features of the to-be-described video; respectively coding the visual features and the auditory features through a multi-modal attention mechanism subject model of the video description generation system to obtain visual coding features and auditory coding features; processing the visual coding features and the auditory coding features through an auxiliary model of a video description generation system to generate target auxiliary features; decoding the visual coding feature, the auditory coding feature, the target auxiliary feature and the word feature through a multi-modal attention mechanism subject model to obtain a posterior probability of each keyword, and selecting a decoded word from |
---|---|
Bibliography: | Application Number: CN202110470037 |