Information processing using a hierarchy structure of randomized samples
A method is provided for retrieving information from massive databases (i.e., databases with millions of documents) in real time, that allows users to control the trade-off between accuracy in retrieved results and response times. The method may be applied to databases with contents, i.e., documents...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English |
Published |
08.05.2007
|
Online Access | Get full text |
Cover
Loading…
Summary: | A method is provided for retrieving information from massive databases (i.e., databases with millions of documents) in real time, that allows users to control the trade-off between accuracy in retrieved results and response times. The method may be applied to databases with contents, i.e., documents which have been modeled with a clearly defined metric that enables computation of distances between any two documents, so that pairs of documents which are "closer" with respect to the metric are more similar than pairs of documents which are "further apart". Our method can be applied to similarity ranking and/or can be combined together with other methods to increase the scalability of information retrieval, detection, ranking, and tracking. |
---|