METHOD AND SYSTEM OF PROVIDING INTERFACE FOR VISUAL QUESTION ANSWERING

Provided is a method of providing an interface for visual question answering. The method including: receiving 360-degree video data; acquiring a plurality of images corresponding to a plurality of fields of view (FOV) based on the 360-degree video data; identifying a plurality of objects included in...

Full description

Saved in:
Bibliographic Details
Main Authors Yun, Heeseung, Yang, Wonsuk, Lee, Kang Il, Yu, Youngjae, Kim, Gunhee
Format Patent
LanguageEnglish
Published 06.07.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Provided is a method of providing an interface for visual question answering. The method including: receiving 360-degree video data; acquiring a plurality of images corresponding to a plurality of fields of view (FOV) based on the 360-degree video data; identifying a plurality of objects included in the plurality of images and storing metadata regarding the plurality of objects; converting plane coordinates of each of the plurality of objects into spherical coordinates; classifying the plurality of objects into a plurality of lists; selecting a first object having a highest correct answer probability; comparing the spherical coordinates of the first object having the highest correct answer probability with the spherical coordinates of each of the remaining first objects, and removing a first object satisfying a preset condition among the remaining first objects from the plurality of images; and generating an interface for visual question answering.
Bibliography:Application Number: US202218081421