METHOD AND SYSTEM OF PROVIDING INTERFACE FOR VISUAL QUESTION ANSWERING
Provided is a method of providing an interface for visual question answering. The method including: receiving 360-degree video data; acquiring a plurality of images corresponding to a plurality of fields of view (FOV) based on the 360-degree video data; identifying a plurality of objects included in...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
06.07.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Provided is a method of providing an interface for visual question answering. The method including: receiving 360-degree video data; acquiring a plurality of images corresponding to a plurality of fields of view (FOV) based on the 360-degree video data; identifying a plurality of objects included in the plurality of images and storing metadata regarding the plurality of objects; converting plane coordinates of each of the plurality of objects into spherical coordinates; classifying the plurality of objects into a plurality of lists; selecting a first object having a highest correct answer probability; comparing the spherical coordinates of the first object having the highest correct answer probability with the spherical coordinates of each of the remaining first objects, and removing a first object satisfying a preset condition among the remaining first objects from the plurality of images; and generating an interface for visual question answering. |
---|---|
Bibliography: | Application Number: US202218081421 |