RECOGNITION DEVICE, RECOGNITION SYSTEM, AND COMPUTER PROGRAM

The present invention provides a recognition device that suppresses any decrease in recognition accuracy.　A recognition device that applies recognition processing to videos obtained through imaging comprises: a neural network 172 that extracts, from a video that includes a plurality of pixels compos...

Full description

Saved in:

Bibliographic Details
Main Authors	HACHIUMA, Ryo, SEKII, Taiki
Format	Patent
Language	English French Japanese
Published	21.12.2023
Subjects	CALCULATING COMPUTING COUNTING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The present invention provides a recognition device that suppresses any decrease in recognition accuracy.　A recognition device that applies recognition processing to videos obtained through imaging comprises: a neural network 172 that extracts, from a video that includes a plurality of pixels composed in size of a first unit and furthermore includes a plurality of objects composed in size of a second unit which is larger than the first unit and smaller than the entire video, a discrete feature quantity that indicates features of the pixels composed in size of the first unit; a MaxPooling unit 173 that, when a plurality of discrete feature quantities are extracted, aggregates the extracted plurality of discrete feature quantities for each object composed in size of the second unit; and a DNN unit 178 that, on the basis of the result of aggregation, recognizes an event represented in the video. La présente invention concerne un dispositif de reconnaissance qui supprime toute diminution de la précision de reconnaissance.　Un dispositif de reconnaissance qui applique un traitement de reconnaissance à des vidéos obtenues par imagerie comprend : un réseau neuronal (172) qui extrait, d'une vidéo comprenant une pluralité de pixels dont la taille correspond à une première unité et comprenant également une pluralité d'objets dont la taille taille correspond à une seconde unité plus grande que la première unité et plus petite que la vidéo entière, une quantité de caractéristiques discrètes qui indique les caractéristiques des pixels dont la taille correspond à la première unité ; une unité de MaxPooling (173) qui, lorsqu'une pluralité de quantités de caractéristiques discrètes sont extraites, agrège la pluralité extraite de quantités de caractéristiques discrètes pour chaque objet dont la taille correspond à la seconde unité ; et une unité DNN (178) qui, d'après le résultat de l'agrégation, reconnaît un événement représenté dans la vidéo. 認識の精度の低下を抑える認識装置を提供する。　撮影により得られた映像に対して認識処理を施す認識装置は、第一単位の大きさからなる画素を複数個含むと共に、第一単位の大きさより大きく、映像全体より小さい第二単位の大きさからなるオブジェクトを複数個含む映像を対象として、第一単位の大きさからなる画素の特徴を示す個別特徴量を抽出するニューラルネットワーク１７２と、複数の個別特徴量が抽出された場合、第二単位の大きさからなるオブジェクト毎に、抽出された複数の個別特徴量を集約するＭａｘＰｏｏｌｉｎｇ部１７３と、集約結果に基づいて、映像に表された事象を認識するＤＮＮ部１７８とを備える。
Bibliography:	Application Number: WO2023JP20052