Neural Network based Inter bi-prediction Blending

This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, the motion compensation of blocks from already decoded reference pictures stands out as the principal tool used to predict the current frame. Especially, the bi-prediction, i...

Full description

Saved in:

Bibliographic Details
Published in	2021 International Conference on Visual Communications and Image Processing (VCIP) pp. 1 - 5
Main Authors	Galpin, Franck, Bordes, Philippe, Dumas, Thierry, Nikitin, Pavel, Le Leannec, Fabrice
Format	Conference Proceeding
Language	English
Published	IEEE 05.12.2021
Subjects	Image processing Learning systems Motion compensation Neural networks Quantization (signal) Video coding Visual communication
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, the motion compensation of blocks from already decoded reference pictures stands out as the principal tool used to predict the current frame. Especially, the bi-prediction, in which a block is obtained by averaging two different motion-compensated prediction blocks, significantly improves the final temporal prediction accuracy. In this context, we introduce a simple neural network that further improves the blending operation. A complexity balance, both in terms of network size and encoder mode selection, is carried out. Extensive tests on top of the recently standardized VVC codec are performed and show a BD-rate improvement of −1.4% in random access configuration for a network size of fewer than 10k parameters. We also propose a simple CPU-based implementation and direct network quantization to assess the complexity/gains tradeoff in a conventional codec framework.
ISSN:	2642-9357
DOI:	10.1109/VCIP53242.2021.9675422