TEMPORAL STRUCTURE-BASED CONDITIONAL CONVOLUTIONAL NEURAL NETWORKS FOR VIDEO COMPRESSION

Video encoding and decoding is implemented with auto encoders using luminance information to derive motion information for chrominance prediction. In one embodiment conditional convolutions are used to encode motion flow information. A current condition, for example, GOP structure, is used as input...

Full description

Saved in:
Bibliographic Details
Main Authors BEGAINT, Jean, RACAPE, Fabien, PUSHPARAJA, Akshay, FELTMAN, Simon
Format Patent
LanguageEnglish
Published 06.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Video encoding and decoding is implemented with auto encoders using luminance information to derive motion information for chrominance prediction. In one embodiment conditional convolutions are used to encode motion flow information. A current condition, for example, GOP structure, is used as input to a succession of fully connected layers to implement the conditional convolution. In a related embodiment, more than one reference frame is used to encode motion flow information.
Bibliography:Application Number: US202218281844