Information processing using a hierarchy structure of randomized samples

A method is provided for retrieving information from massive databases (i.e., databases with millions of documents) in real time, that allows users to control the trade-off between accuracy in retrieved results and response times. The method may be applied to databases with contents, i.e., documents...

Full description

Saved in:
Bibliographic Details
Main Authors Aono, Masaki, Houle, Michael Edward, Kobayashi, Mei
Format Patent
LanguageEnglish
Published 08.05.2007
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method is provided for retrieving information from massive databases (i.e., databases with millions of documents) in real time, that allows users to control the trade-off between accuracy in retrieved results and response times. The method may be applied to databases with contents, i.e., documents which have been modeled with a clearly defined metric that enables computation of distances between any two documents, so that pairs of documents which are "closer" with respect to the metric are more similar than pairs of documents which are "further apart". Our method can be applied to similarity ranking and/or can be combined together with other methods to increase the scalability of information retrieval, detection, ranking, and tracking.