Video description generation method, device and equipment and computer readable storage medium

The invention belongs to the technical field of intelligent decision making, and provides a video description generation method, device and equipment and a computer readable storage medium, and the method comprises the steps: obtaining a to-be-described video, and extracting the visual features, aud...

Full description

Saved in:

Bibliographic Details
Main Authors	LUO JIAN, CHENG NING, WANG JIANZONG
Format	Patent
Language	Chinese English
Published	09.07.2021
Subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS
Online Access	Get full text

Cover

More Information
Summary:	The invention belongs to the technical field of intelligent decision making, and provides a video description generation method, device and equipment and a computer readable storage medium, and the method comprises the steps: obtaining a to-be-described video, and extracting the visual features, auditory features and word features of the to-be-described video; respectively coding the visual features and the auditory features through a multi-modal attention mechanism subject model of the video description generation system to obtain visual coding features and auditory coding features; processing the visual coding features and the auditory coding features through an auxiliary model of a video description generation system to generate target auxiliary features; decoding the visual coding feature, the auditory coding feature, the target auxiliary feature and the word feature through a multi-modal attention mechanism subject model to obtain a posterior probability of each keyword, and selecting a decoded word from
Bibliography:	Application Number: CN202110470037