A signature file scheme based on multiple organizations for indexing very large text databases
A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records....
Saved in:
Published in | Journal of the American Society for Information Science Vol. 41; no. 7; pp. 508 - 534 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Washington, D.C
Wiley Subscription Services, Inc., A Wiley Company
01.10.1990
John Wiley & Sons American Documentation Institute Wiley Periodicals Inc |
Subjects | |
Online Access | Get full text |
ISSN | 0002-8231 1097-4571 |
DOI | 10.1002/(SICI)1097-4571(199010)41:7<508::AID-ASI5>3.0.CO;2-J |
Cover
Loading…
Summary: | A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records. With this method, records are grouped into blocks and signatures are formed for each block of records. These signatures are stored in a block descriptor file using a storage device called the bit slice organization. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to efficiently determine record matches on query. Both computational results based on a mathematical model as well as experimental results using a library database are presented. These results show that the method provides effective access to large text databases. © 1990 John Wiley & Sons, Inc. |
---|---|
Bibliography: | ArticleID:ASI5 istex:A342B2CD7826D983F75BBE7BCC000F245F4AEFEC ark:/67375/WNG-6N54W435-5 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Statistics/Data Report-1 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0002-8231 1097-4571 |
DOI: | 10.1002/(SICI)1097-4571(199010)41:7<508::AID-ASI5>3.0.CO;2-J |