A signature file scheme based on multiple organizations for indexing very large text databases

A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records....

Full description

Saved in:
Bibliographic Details
Published inJournal of the American Society for Information Science Vol. 41; no. 7; pp. 508 - 534
Main Authors Kent, A., Sacks-Davis, R., Ramamohanarao, K.
Format Journal Article
LanguageEnglish
Published Washington, D.C Wiley Subscription Services, Inc., A Wiley Company 01.10.1990
John Wiley & Sons
American Documentation Institute
Wiley Periodicals Inc
Subjects
Online AccessGet full text
ISSN0002-8231
1097-4571
DOI10.1002/(SICI)1097-4571(199010)41:7<508::AID-ASI5>3.0.CO;2-J

Cover

Loading…
More Information
Summary:A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records. With this method, records are grouped into blocks and signatures are formed for each block of records. These signatures are stored in a block descriptor file using a storage device called the bit slice organization. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to efficiently determine record matches on query. Both computational results based on a mathematical model as well as experimental results using a library database are presented. These results show that the method provides effective access to large text databases. © 1990 John Wiley & Sons, Inc.
Bibliography:ArticleID:ASI5
istex:A342B2CD7826D983F75BBE7BCC000F245F4AEFEC
ark:/67375/WNG-6N54W435-5
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Statistics/Data Report-1
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ISSN:0002-8231
1097-4571
DOI:10.1002/(SICI)1097-4571(199010)41:7<508::AID-ASI5>3.0.CO;2-J