A signature file scheme based on multiple organizations for indexing very large text databases
A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records....
Saved in:
Published in | Journal of the American Society for Information Science Vol. 41; no. 7; pp. 508 - 534 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Washington, D.C
Wiley Subscription Services, Inc., A Wiley Company
01.10.1990
John Wiley & Sons American Documentation Institute Wiley Periodicals Inc |
Subjects | |
Online Access | Get full text |
ISSN | 0002-8231 1097-4571 |
DOI | 10.1002/(SICI)1097-4571(199010)41:7<508::AID-ASI5>3.0.CO;2-J |
Cover
Abstract | A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records. With this method, records are grouped into blocks and signatures are formed for each block of records. These signatures are stored in a block descriptor file using a storage device called the bit slice organization. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to efficiently determine record matches on query. Both computational results based on a mathematical model as well as experimental results using a library database are presented. These results show that the method provides effective access to large text databases. © 1990 John Wiley & Sons, Inc. |
---|---|
AbstractList | A new signature file method is presented for accessing information from large databases containing both formatted and free text data. The new method, called the multiorganizational scheme, is proposed for indexing very large databases containing hundreds of thousands or even millions of records. Using this method, it is possible to group records into blocks and to form signatures for each block of records. A storage device called the bit slice organization is used to store these signatures in a block descriptor file. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to efficiently determine record matches on query. Computational results based on a mathematical model as well as experimental results using a library database are presented. These results indicate that the method provides effective access to large text databases. A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records. With this method, records are grouped into blocks and signatures are formed for each block of records. These signatures are stored in a block descriptor file using a storage device called the bit slice organization. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to efficiently determine record matches on query. Both computational results based on a mathematical model as well as experimental results using a library database are presented. These results show that the method provides effective access to large text databases. © 1990 John Wiley & Sons, Inc. Presents a new signature file method for accessing information from large data bases containing both formatted and free text data. The new method, called the multiorganisational scheme is proposed for indexing very large data bases containing possibly millions of records. Records are grouped into blocks and signatures are formed for each block of records. These signatures are stored in block descriptor file using a storage device called the bit slice organisation. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to determine efficiently record matches on query. 00 Original abstract--amended Describes a new signature file method, called the multiorganizational scheme, for accessing information from large databases containing both formatted and free-text data. Implementation issues are discussed, and computational results based on a mathematical model are presented, as well as results using a library database. (43 references) (MES) A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large databases containing hundreds of thousands or possibly millions of records. With this method, records are grouped into blocks and signatures are formed for each block of records. These signatures are stored in a block descriptor file using a storage device called the bit slice organization. By forming multiple block descriptor files, each based on a possibly different grouping of records into blocks, it is possible to efficiently determine record matches on query. Both computational results based on a mathematical model as well as experimental results using a library database are presented. These results show that the method provides effective access to large text databases. |
Author | Kent, A. Sacks-Davis, R. Ramamohanarao, K. |
Author_xml | – sequence: 1 givenname: A. surname: Kent fullname: Kent, A. organization: Department of Computer Science, Royal Melbourne Institute of Technology, Melbourne, Victoria 3000, Australia – sequence: 2 givenname: R. surname: Sacks-Davis fullname: Sacks-Davis, R. organization: Department of Computer Science, Royal Melbourne Institute of Technology, Melbourne, Victoria 3000, Australia – sequence: 3 givenname: K. surname: Ramamohanarao fullname: Ramamohanarao, K. organization: Department of Computer Science, University of Melbourne, Parkville, Victoria 3052, Australia |
BackLink | http://eric.ed.gov/ERICWebPortal/detail?accno=EJ418645$$DView record in ERIC http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=19423733$$DView record in Pascal Francis |
BookMark | eNqFkl1v0zAUhiM0JLrBP-DCAsG2ixR_xnGZJlVljFbTerGh3XHkuE7xSJNip7Dy63HaqZP4vLKO3sePZZ13P9mrm9omyQnBfYIxfXN0NR6NjwlWMuVCkiOiFCb4mJOBPBE4HwyG43fp8GosTlkf90fTtzSdPEp6uwt7SQ9HT5pTRp4k-yHcxlGpTPSST0MU3LzW7cpbVLrKomA-24VFhQ52hpoaLVZV65YxaPxc1-6Hbl1TB1Q2Hrl6Zu9cPUffrF-jSvu5Ra29a9FMt7oThKfJ41JXwT67Pw-Sj-_Prkcf0ovp-Xg0vEiNIJlIqbLSSEuFtYQqUnCaFdyWuTJGiZyWOZ9l1MyKvBQKF4ZKJSnBOefY8oKwjB0kh1vv0jdfVza0sHDB2KrStW1WASRnnEeORPL1P0khKWeYsQi--AW8bVa-jr8ASrJcMYJ5hF7-DSIMEyoysnnz1T2lg9FV6XVtXICldwvt10AUp0xunny-5ax3ZhefTTjJMy5ifL2NjW9C8LZ8MGDoigLQFQW6vUO3d9gWBTgBCbEoALEo0BUFGGAYTYHC5EH7Pa5__ZvzP8o_GDdz1KZbrQuxEzut9l8gk0wKuLk8h-xS8BvOBAj2E5Ko3LQ |
CODEN | AISJB6 |
ContentType | Journal Article |
Copyright | Copyright © 1990 John Wiley & Sons, Inc. 1991 INIST-CNRS Copyright Wiley Periodicals Inc. Oct 1990 |
Copyright_xml | – notice: Copyright © 1990 John Wiley & Sons, Inc. – notice: 1991 INIST-CNRS – notice: Copyright Wiley Periodicals Inc. Oct 1990 |
DBID | BSCLL AAYXX CITATION 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN IQODW AGQHT AIATT APEJR FYSDU GHEHK HYQOX HZAIM K30 PAAUG PAWHS PAWZZ PAXOH PBHAV PBQSW PBYQZ PCIWU PCMID PCZJX PDGRG PDWWI PETMR PFVGT PGXDX PIHIL PISVA PJCTQ PJTMS PLCHJ PMFND PMHAD PMKZF PNQDJ POUND PPLAD PQAPC PQCAN PQCMW PQEME PQHKH PQMID PQNCT PQNET PQSCT PQSET PSVJG PVKVW PVMQY PZGFC ~P4 ~P5 3V. 7WY 7WZ 7X7 7XB 87Z 88E 8FE 8FG 8FI 8FJ 8FK 8FL ABUWG AFKRA AIMQZ ALSLI ARAPS AZQEC BENPR BEZIV BGLVJ CCPQU CNYFK DWQXO FRNLG FYUFA F~G GHDGH GNUQQ HCIFZ JQ2 K60 K6~ K7- K9. L.- LIQON M0A M0C M0S M1O M1P P5Z P62 PHGZM PHGZT PJZUB PKEHL PPXIY PQBIZ PQBZA PQEST PQGLB PQQKQ PQUKI PRINS PRQQA PYYUZ Q9U E3H F2A 7SC 8FD L7M L~C L~D |
DOI | 10.1002/(SICI)1097-4571(199010)41:7<508::AID-ASI5>3.0.CO;2-J |
DatabaseName | Istex CrossRef ERIC ERIC (Ovid) ERIC ERIC ERIC (Legacy Platform) ERIC( SilverPlatter ) ERIC ERIC PlusText (Legacy Platform) Education Resources Information Center (ERIC) ERIC Pascal-Francis Periodicals Archive Online Foundation Collection 2 Periodicals Archive Online Collection 5 (2022) Periodicals Archive Online Foundation Collection 2 (2022) Periodicals Index Online Segment 07 Periodicals Index Online Segment 08 ProQuest Historical Periodicals Periodicals Index Online Segment 26 Periodicals Index Online Primary Sources Access—Foundation Edition (Plan E) - West Primary Sources Access (Plan D) - International Primary Sources Access & Build (Plan A) - MEA Primary Sources Access—Foundation Edition (Plan E) - Midwest Primary Sources Access—Foundation Edition (Plan E) - Northeast Primary Sources Access (Plan D) - Southeast Primary Sources Access (Plan D) - North Central Primary Sources Access—Foundation Edition (Plan E) - Southeast Primary Sources Access (Plan D) - South Central Primary Sources Access & Build (Plan A) - UK / I Primary Sources Access (Plan D) - Canada Primary Sources Access (Plan D) - EMEALA Primary Sources Access—Foundation Edition (Plan E) - North Central Primary Sources Access—Foundation Edition (Plan E) - South Central Primary Sources Access & Build (Plan A) - International Primary Sources Access—Foundation Edition (Plan E) - International Primary Sources Access (Plan D) - West Periodicals Index Online Segments 1-50 Primary Sources Access (Plan D) - APAC Primary Sources Access (Plan D) - Midwest ProQuest One History Primary Sources Access (Plan D) - MEA ProQuest Digital Collections Primary Sources Access—Foundation Edition (Plan E) - Canada Primary Sources Access—Foundation Edition (Plan E) - UK / I Primary Sources Access—Foundation Edition (Plan E) - EMEALA Primary Sources Access & Build (Plan A) - APAC Primary Sources Access & Build (Plan A) - Canada Primary Sources Access & Build (Plan A) - West Primary Sources Access & Build (Plan A) - EMEALA Primary Sources Access (Plan D) - Northeast Primary Sources Access & Build (Plan A) - Midwest Primary Sources Access & Build (Plan A) - North Central Primary Sources Access & Build (Plan A) - Northeast Primary Sources Access & Build (Plan A) - South Central Primary Sources Access & Build (Plan A) - Southeast Primary Sources Access (Plan D) - UK / I Historical Periodicals Collection (1740-1940) Primary Sources Access—Foundation Edition (Plan E) - APAC Primary Sources Access—Foundation Edition (Plan E) - MEA PAO Collection 5 UZH-/ZB-Zugriff (inkl. PURA/SLSKey): ProQuest ProQuest Central (Corporate) ABI/INFORM Collection ABI/INFORM Global (PDF only) Health & Medical Collection ProQuest Central (purchase pre-March 2016) ABI/INFORM Collection Medical Database (Alumni Edition) ProQuest SciTech Collection ProQuest Technology Collection Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ABI/INFORM Collection (Alumni) ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest One Literature Social Science Premium Collection ProQuest Advanced Technologies & Aerospace Database ProQuest Central Essentials ProQuest Central Business Premium Collection Technology Collection ProQuest One Community College Library & Information Science Collection ProQuest Central Business Premium Collection (Alumni) Health Research Premium Collection ABI/INFORM Global (Corporate) Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection ProQuest Business Collection (Alumni Edition) ProQuest Business Collection Computer Science Database ProQuest Health & Medical Complete (Alumni) ABI/INFORM Professional Advanced ProQuest One Literature - U.S. Customers Only ABI/INFORM Global ABI/INFORM Global ProQuest Health & Medical Collection Library Science Medical Database Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic (New) ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Business ProQuest One Business (Alumni) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest One Social Sciences ABI/INFORM Collection China ProQuest Central Basic Library & Information Sciences Abstracts (LISA) Library & Information Science Abstracts (LISA) Computer and Information Systems Abstracts Technology Research Database Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef ERIC Periodicals Index Online Segment 26 Periodicals Index Online Segments 1-50 Periodicals Index Online Periodicals Archive Online Foundation Collection 2 Periodicals Archive Online Collection 5 (2022) Historical Periodicals Collection ProQuest One History Periodicals Archive Online Foundation Collection 2 (2022) ProQuest Digital Collections PAO Collection 5 (with 1996-2000 update) PAO Collection 5 Periodicals Index Online Segment 08 Periodicals Index Online Segment 07 ProQuest Historical Periodicals ProQuest Business Collection (Alumni Edition) Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection SciTech Premium Collection ProQuest Central China ABI/INFORM Complete ProQuest One Applied & Life Sciences Health Research Premium Collection Health & Medical Research Collection Library & Information Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Advanced Technologies & Aerospace Collection Business Premium Collection Social Science Premium Collection ABI/INFORM Global ProQuest One Literature ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) ProQuest Business Collection ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest One Academic UKI Edition ProQuest One Academic ProQuest One Academic (New) ABI/INFORM Global (Corporate) ProQuest One Business Technology Collection ProQuest One Academic Middle East (New) ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central ABI/INFORM Professional Advanced ProQuest Library Science Health and Medicine Complete (Alumni Edition) ProQuest Central Korea ABI/INFORM Complete (Alumni Edition) ProQuest One Literature - U.S. Customers Only ProQuest One Social Sciences ABI/INFORM Global (Alumni Edition) ProQuest Central Basic ABI/INFORM China ProQuest SciTech Collection Advanced Technologies & Aerospace Database ProQuest Medical Library ProQuest One Business (Alumni) ABI/INFORM Archive ProQuest Central (Alumni) Business Premium Collection (Alumni) Library and Information Science Abstracts (LISA) Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Computer and Information Systems Abstracts Library and Information Science Abstracts (LISA) ERIC ProQuest Business Collection (Alumni Edition) |
Database_xml | – sequence: 1 dbid: ERI name: ERIC url: https://eric.ed.gov/ sourceTypes: Index Database – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Library & Information Science |
EISSN | 1097-4571 |
ERIC | EJ418645 |
EndPage | 534 |
ExternalDocumentID | 639271601 19423733 EJ418645 10_1002__SICI_1097_4571_199010_41_7_508__AID_ASI5_3_0_CO_2_J ASI5 ark_67375_WNG_6N54W435_5 |
Genre | article Statistics/Data Report |
GeographicLocations | New York United States--US |
GeographicLocations_xml | – name: New York – name: United States--US |
GroupedDBID | -~X .4I .DC .GA .GJ .Y3 0-V 07C 10A 186 1L6 1OB 1OC 1OL 29L 31~ 3WU 4.4 4ZD 51W 51X 52N 52O 52P 52S 52T 52W 52X 5GY 66C 77I 7PT 7WY 7X7 8-1 8-4 8-5 883 88E 8FE 8FG 8FI 8FJ 8FL 8FW 8G5 8R4 8R5 8VB 930 A03 AAEVG AAHQN AAMMB AAMNL AANHP AANLZ AAWJA AAWTL AAXRX AAYCA AAZKR ABCUV ABIJN ABJNI ABPPZ ABUWG ACAHQ ACBEA ACBWZ ACCZN ACFBH ACGFS ACIOK ACPOU ACREJ ACRPL ACXBN ACXQS ACYXJ ADBBV ADEOM ADIZJ ADMGS ADMHC ADMHG ADNMO ADOZA AEFGJ AEIGN AEIMD AETEA AEUYR AFBPY AFFNX AFFPM AFGKR AFKRA AFWVQ AFZJQ AGQHT AGQPQ AGXDD AGYGG AHBTC AHKVK AHQJS AI. AIATT AIDQK AIDYY AIMQZ AITYG AIURR AKVCP ALMA_UNASSIGNED_HOLDINGS ALSLI ALUQN AMBMR AMYDB APEJR ARALO ARAPS ATUGU AZFZN AZQEC BDRZF BENPR BEZIV BGLVJ BPHCQ BRXPI BSCLL BVXVI BY8 CCPQU CJNVE CMOOK CNYFK CO8 CS3 D-F DCZOG DRFUL DRSTM DWQXO EBS EBU ECVKH EJD ELW F00 F01 F04 FEDTE FRNLG FYUFA G-S GNP GNUQQ GODZA GROUPED_ABI_INFORM_ARCHIVE GROUPED_ABI_INFORM_RESEARCH GUQSH HCIFZ HF~ HGLYW HHY HMCUK HVGLF HYQOX H~9 I-F JPC K1G K60 K6V K6~ K7- KQQ LATKE LAW LEEKS LH4 LIQON LITHE LOXES LP6 LP7 LPU LUTES LYRES M0C M0F M0P M1O M1P M2O M59 MEWTI MRFUL MRSTM MSFUL MSSTM MVM MXFUL MXSTM O-F OHT P-O P4D P62 PALCI PHGZM PHGZT PJZUB PMFND PMKZF PPXIY PQBIZ PQBZA PQGLB PQQKQ PROAC PRQQA PSQYO PUEGO PVKVW Q2X Q5E QB0 QRW QWB RIWAO ROL SAMSI SUPJJ TN5 U5U UB1 UKHRP VH1 W99 WH7 WIH WIK WJL WQJ WXSBR XG1 XV2 XZL ZCA ZCG ZL0 ZZTAW ~02 ~P4 ~P5 0B8 3V. AAHHS AAYOK ACCFJ AEEZP AEQDE AEUQT AFPWT AIWBW AJBDE ALIPV GROUPED_ABI_INFORM_COMPLETE RWI VQA WRC WWI AAYXX CITATION 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN IQODW FYSDU GHEHK HZAIM K30 PAAUG PAWHS PAWZZ PAXOH PBHAV PBQSW PBYQZ PCIWU PCMID PCZJX PDGRG PDWWI PETMR PFVGT PGXDX PIHIL PISVA PJCTQ PJTMS PLCHJ PMHAD PNQDJ POUND PPLAD PQAPC PQCAN PQCMW PQEME PQHKH PQMID PQNCT PQNET PQSCT PQSET PSVJG PVMQY PZGFC 7XB 8FK JQ2 K9. L.- PKEHL PQEST PQUKI PRINS Q9U E3H F2A 7SC 8FD L7M L~C L~D |
ID | FETCH-LOGICAL-c5165-29e7c7e25ee1291b426b4ef89cc9582f84d62cdb8f590bc27972108440e4b1363 |
IEDL.DBID | 8FG |
ISSN | 0002-8231 |
IngestDate | Fri Sep 05 06:46:13 EDT 2025 Thu Sep 04 23:43:45 EDT 2025 Sun Sep 07 03:46:32 EDT 2025 Fri Jul 25 23:46:58 EDT 2025 Wed Apr 02 07:25:56 EDT 2025 Tue Sep 02 19:29:23 EDT 2025 Tue Jul 01 00:49:27 EDT 2025 Wed Jan 22 16:21:28 EST 2025 Tue Sep 09 05:30:23 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | false |
IsPeerReviewed | false |
IsScholarly | false |
Issue | 7 |
Keywords | Large dimension Inverted file Query Information conversion Data storage Mathematical method Document access Addressing Implementation File layout Research and development Automated processing Documentation data processing Signing Database Hashing Question answering Full text Indexing |
Language | English |
License | http://doi.wiley.com/10.1002/tdm_license_1.1 CC BY 4.0 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c5165-29e7c7e25ee1291b426b4ef89cc9582f84d62cdb8f590bc27972108440e4b1363 |
Notes | ArticleID:ASI5 istex:A342B2CD7826D983F75BBE7BCC000F245F4AEFEC ark:/67375/WNG-6N54W435-5 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Statistics/Data Report-1 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
PQID | 1301256131 |
PQPubID | 1818555 |
PageCount | 27 |
ParticipantIDs | proquest_miscellaneous_743443631 proquest_miscellaneous_57243033 proquest_journals_216893104 proquest_journals_1301256131 pascalfrancis_primary_19423733 eric_primary_EJ418645 crossref_primary_10_1002__SICI_1097_4571_199010_41_7_508__AID_ASI5_3_0_CO_2_J wiley_primary_10_1002_SICI_1097_4571_199010_41_7_508_AID_ASI5_3_0_CO_2_J_ASI5 istex_primary_ark_67375_WNG_6N54W435_5 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 1900 |
PublicationDate | October 1990 |
PublicationDateYYYYMMDD | 1990-10-01 |
PublicationDate_xml | – month: 10 year: 1990 text: October 1990 |
PublicationDecade | 1990 |
PublicationPlace | Washington, D.C |
PublicationPlace_xml | – name: Washington, D.C – name: New York, NY – name: New York, N.Y – name: New York |
PublicationTitle | Journal of the American Society for Information Science |
PublicationTitleAlternate | J. Am. Soc. Inf. Sci |
PublicationYear | 1990 |
Publisher | Wiley Subscription Services, Inc., A Wiley Company John Wiley & Sons American Documentation Institute Wiley Periodicals Inc |
Publisher_xml | – name: Wiley Subscription Services, Inc., A Wiley Company – name: John Wiley & Sons – name: American Documentation Institute – name: Wiley Periodicals Inc |
References | Berra, P. B., Chung, S. M., & Hachem, N. (1987). Computer architecture for a surrogate file to a very large data/knowledge base. IEEE Computer, 20(3), 25-32. Harrison, M. C. (1971). Implementation of substring test by hashing. Comm. ACM, 14, 777-779 Whang, K., Wiederhold, G., & Sagalowicz, D. (1983). Estimating block accesses in database organizations: A closed noniterative formula. Comm. ACM, 23, 940-944. Yao, S. B. (1977). Approximating block accesses in database organizations. Comm. ACM, 20, 260-261. Johnson, N. L., & Kotz, S. (1969). Discrete distributions. Wiley: New York. Cárdenas, A. F. (1975). Analysis and performance of inverted data base structures. Comm. ACM, 18, 253-263. Colomb, R. M., & Jayasooriah (1986). A clause indexing system for PROLOG based on superimposed coding. Australian Computer Journal, 18, 18-25. Pfaltz, J. L., Berman, W. J., & Cagley, E. M. (1980). Partial-match retrieval using indexed descriptor files. Comm. ACM, 23, 522-528. Files, J. R., & Huskey, H. D. (1969). An information retrieval system based on superimposed coding. Proceedings AFIPS, Fall Joint Computer Conference, 35, 423-432. Roberts, C. S. (1979). Partial-match retrieval via the method of superimposed codes. Proceedings of the IEEE, 67, 1624-1642. Faloutsos, C., & Christodoulakis, S. (1987). Description and performance analysis of signature file methods for office filing. ACM Transactions on Office Information Systems, 5, 237-257. Salton, G., & McGill, M. J. (1983). in Introduction to modern information retrieval. New York: McGraw-Hill. Knuth, D. E., Morris, J. H., & Pratt, V. R. (1977). Fast pattern matching in strings. SIAM J. Comput. 6, 323-350. Sacks-Davis, R., & Ramamohanarao, K. (1983). A two-level superimposed coding scheme for partial match retrieval. Information Systems, 8, 273-280. Comer, D. (1979). The ubiquitous B-tree. ACM Computing Surveys, 11, 121-137. Aho, A. V., & Corasick, M. J. (1975). Fast pattern matching: An aid to bibliographic searching. Comm. ACM, 18, 333-340. Faloutsos, C. (1985). Access methods for text. ACM Computing Surveys, 17, 49-74. Croft, W., & Savino, P. (1988). Implementing ranking strategies using text signatures. ACM Transactions on Office Information Systems, 6, 42-62. Colomb, R. M. (1985). Use of superimposed code words for partial match data retrieval. Australian Computer Journal, 17, 181-188. Stanfill, C., & Kahle, B. (1986). Parallel free-text search on the connection machine system. Comm. ACM, 29, 1229-1239. Federowicz, J. (1987). Database performance evaluation in an indexed file environment. ACM Trans. Database Systems, 12, 85-110. Sacks-Davis, R., Ramamohanarao, K., & Kent, A. J. (1987). Multi-key access methods based on superimposed coding techniques. ACM Trans. Database Systems, 12, 655-696. Lovins, J. B. (1968). Development of a stemming algorithm. Mechanical Translation and Computational Linguistics, 11, 22-31. Christodoulakis, S. (1984). Implications of certain assumptions in database performance evaluation. ACM Transations on Database Systems, 9, 163-186. Boyer, R. S., & Moore, J. S. (1977). A fast string searching algorithm. Comm. ACM, 20, 762-772. Knuth, D. E. (1973). The art of computer programming, vol. 3: Sorting and searching. Reading, MA: Addison-Wesley. McLlroy, M. D. (1982). Development of a spelling list. IEEE Transactions on Communications COM-30, 1, 91-99. Sacks-Davis, R. (1985). Performance of a multi-key access method based on descriptors and superimposed coding techniques. Information Systems, 10, 391-403. Stiassny, S. (1960). Mathematical analysis of various superimposed coding schemes. American Documentation, 11, 155-169. Bloom, B. H. (1970). Space/time trade-offs in hash coding with allowable errors. Comm. ACM, 13, 422-426. 1987; 12 1960; 11 1987; 5 1980; 23 1975; 18 1977; 20 1986; 18 1983; 8 1969; 35 1973 1979; 11 1979 1970; 13 1985; 17 1987; 20 1979; 67 1982; 1 1988; 6 1971; 14 1987 1986 1985 1986; 29 1984; 9 1984 1983 1985; 10 1969 1968; 11 1988 1977; 6 1983; 23 |
References_xml | – reference: Aho, A. V., & Corasick, M. J. (1975). Fast pattern matching: An aid to bibliographic searching. Comm. ACM, 18, 333-340. – reference: Stiassny, S. (1960). Mathematical analysis of various superimposed coding schemes. American Documentation, 11, 155-169. – reference: Faloutsos, C. (1985). Access methods for text. ACM Computing Surveys, 17, 49-74. – reference: McLlroy, M. D. (1982). Development of a spelling list. IEEE Transactions on Communications COM-30, 1, 91-99. – reference: Lovins, J. B. (1968). Development of a stemming algorithm. Mechanical Translation and Computational Linguistics, 11, 22-31. – reference: Sacks-Davis, R. (1985). Performance of a multi-key access method based on descriptors and superimposed coding techniques. Information Systems, 10, 391-403. – reference: Faloutsos, C., & Christodoulakis, S. (1987). Description and performance analysis of signature file methods for office filing. ACM Transactions on Office Information Systems, 5, 237-257. – reference: Files, J. R., & Huskey, H. D. (1969). An information retrieval system based on superimposed coding. Proceedings AFIPS, Fall Joint Computer Conference, 35, 423-432. – reference: Roberts, C. S. (1979). Partial-match retrieval via the method of superimposed codes. Proceedings of the IEEE, 67, 1624-1642. – reference: Federowicz, J. (1987). Database performance evaluation in an indexed file environment. ACM Trans. Database Systems, 12, 85-110. – reference: Sacks-Davis, R., Ramamohanarao, K., & Kent, A. J. (1987). Multi-key access methods based on superimposed coding techniques. ACM Trans. Database Systems, 12, 655-696. – reference: Colomb, R. M. (1985). Use of superimposed code words for partial match data retrieval. Australian Computer Journal, 17, 181-188. – reference: Johnson, N. L., & Kotz, S. (1969). Discrete distributions. Wiley: New York. – reference: Bloom, B. H. (1970). Space/time trade-offs in hash coding with allowable errors. Comm. ACM, 13, 422-426. – reference: Cárdenas, A. F. (1975). Analysis and performance of inverted data base structures. Comm. ACM, 18, 253-263. – reference: Whang, K., Wiederhold, G., & Sagalowicz, D. (1983). Estimating block accesses in database organizations: A closed noniterative formula. Comm. ACM, 23, 940-944. – reference: Christodoulakis, S. (1984). Implications of certain assumptions in database performance evaluation. ACM Transations on Database Systems, 9, 163-186. – reference: Knuth, D. E. (1973). The art of computer programming, vol. 3: Sorting and searching. Reading, MA: Addison-Wesley. – reference: Stanfill, C., & Kahle, B. (1986). Parallel free-text search on the connection machine system. Comm. ACM, 29, 1229-1239. – reference: Croft, W., & Savino, P. (1988). Implementing ranking strategies using text signatures. ACM Transactions on Office Information Systems, 6, 42-62. – reference: Pfaltz, J. L., Berman, W. J., & Cagley, E. M. (1980). Partial-match retrieval using indexed descriptor files. Comm. ACM, 23, 522-528. – reference: Berra, P. B., Chung, S. M., & Hachem, N. (1987). Computer architecture for a surrogate file to a very large data/knowledge base. IEEE Computer, 20(3), 25-32. – reference: Yao, S. B. (1977). Approximating block accesses in database organizations. Comm. ACM, 20, 260-261. – reference: Comer, D. (1979). The ubiquitous B-tree. ACM Computing Surveys, 11, 121-137. – reference: Colomb, R. M., & Jayasooriah (1986). A clause indexing system for PROLOG based on superimposed coding. Australian Computer Journal, 18, 18-25. – reference: Knuth, D. E., Morris, J. H., & Pratt, V. R. (1977). Fast pattern matching in strings. SIAM J. Comput. 6, 323-350. – reference: Sacks-Davis, R., & Ramamohanarao, K. (1983). A two-level superimposed coding scheme for partial match retrieval. Information Systems, 8, 273-280. – reference: Harrison, M. C. (1971). Implementation of substring test by hashing. Comm. ACM, 14, 777-779, – reference: Salton, G., & McGill, M. J. (1983). in Introduction to modern information retrieval. New York: McGraw-Hill. – reference: Boyer, R. S., & Moore, J. S. (1977). A fast string searching algorithm. Comm. ACM, 20, 762-772. – volume: 17 start-page: 49 year: 1985 end-page: 74 article-title: Access methods for text publication-title: ACM Computing Surveys – volume: 6 start-page: 42 year: 1988 end-page: 62 article-title: Implementing ranking strategies using text signatures publication-title: ACM Transactions on Office Information Systems – year: 1983 – volume: 12 start-page: 655 year: 1987 end-page: 696 article-title: Multi‐key access methods based on superimposed coding techniques publication-title: ACM Trans. Database Systems – start-page: 347 year: 1986 end-page: 355 – volume: 10 start-page: 391 year: 1985 end-page: 403 article-title: Performance of a multi‐key access method based on descriptors and superimposed coding techniques publication-title: Information Systems – start-page: 21 year: 1984 end-page: 40 – volume: 8 start-page: 273 year: 1983 end-page: 280 article-title: A two‐level superimposed coding scheme for partial match retrieval publication-title: Information Systems – volume: 23 start-page: 522 year: 1980 end-page: 528 article-title: Partial‐match retrieval using indexed descriptor files publication-title: Comm. ACM – year: 1987 – start-page: 203 year: 1984 end-page: 210 – volume: 13 start-page: 422 year: 1970 end-page: 426 article-title: Space/time trade‐offs in hash coding with allowable errors publication-title: Comm. ACM – year: 1973 – start-page: 1 year: 1984 end-page: 20 – start-page: 280 year: 1988 end-page: 293 – volume: 67 start-page: 1624 year: 1979 end-page: 1642 article-title: Partial‐match retrieval via the method of superimposed codes publication-title: Proceedings of the IEEE – volume: 5 start-page: 237 year: 1987 end-page: 257 article-title: Description and performance analysis of signature file methods for office filing publication-title: ACM Transactions on Office Information Systems – volume: 9 start-page: 163 year: 1984 end-page: 186 article-title: Implications of certain assumptions in database performance evaluation publication-title: ACM Transations on Database Systems – volume: 11 start-page: 121 year: 1979 end-page: 137 article-title: The ubiquitous B‐tree publication-title: ACM Computing Surveys – volume: 23 start-page: 940 year: 1983 end-page: 944 article-title: Estimating block accesses in database organizations: A closed noniterative formula publication-title: Comm. ACM – year: 1979 – volume: 18 start-page: 253 year: 1975 end-page: 263 article-title: Analysis and performance of inverted data base structures publication-title: Comm. ACM – volume: 17 start-page: 181 year: 1985 end-page: 188 article-title: Use of superimposed code words for partial match data retrieval publication-title: Australian Computer Journal – volume: 20 start-page: 25 issue: 3 year: 1987 end-page: 32 article-title: Computer architecture for a surrogate file to a very large data/knowledge base publication-title: IEEE Computer – volume: 20 start-page: 762 year: 1977 end-page: 772 article-title: A fast string searching algorithm publication-title: Comm. ACM – volume: 35 start-page: 423 year: 1969 end-page: 432 article-title: An information retrieval system based on superimposed coding publication-title: Proceedings AFIPS, Fall Joint Computer Conference – start-page: 165 year: 1985 end-page: 170 – volume: 14 start-page: 777 year: 1971 end-page: 779 article-title: Implementation of substring test by hashing publication-title: Comm. ACM – year: 1969 – volume: 1 start-page: 91 year: 1982 end-page: 99 article-title: Development of a spelling list publication-title: IEEE Transactions on Communications COM‐30 – volume: 20 start-page: 260 year: 1977 end-page: 261 article-title: Approximating block accesses in database organizations publication-title: Comm. ACM – start-page: 351 year: 1988 end-page: 359 – start-page: 569 year: 1986 end-page: 576 – volume: 11 start-page: 22 year: 1968 end-page: 31 article-title: Development of a stemming algorithm publication-title: Mechanical Translation and Computational Linguistics – volume: 6 start-page: 323 year: 1977 end-page: 350 article-title: Fast pattern matching in strings publication-title: SIAM J. Comput. – volume: 11 start-page: 155 year: 1960 end-page: 169 article-title: Mathematical analysis of various superimposed coding schemes publication-title: American Documentation – volume: 18 start-page: 333 year: 1975 end-page: 340 article-title: Fast pattern matching: An aid to bibliographic searching publication-title: Comm. ACM – start-page: 448 year: 1985 end-page: 457 – volume: 29 start-page: 1229 year: 1986 end-page: 1239 article-title: Parallel free‐text search on the connection machine system publication-title: Comm. ACM – volume: 18 start-page: 18 year: 1986 end-page: 25 article-title: A clause indexing system for PROLOG based on superimposed coding publication-title: Australian Computer Journal – volume: 12 start-page: 85 year: 1987 end-page: 110 article-title: Database performance evaluation in an indexed file environment publication-title: ACM Trans. Database Systems |
SSID | ssj0009965 |
Score | 1.2366363 |
Snippet | A new signature file method for accessing information from large databases containing both formatted and free text data is presented. The new method, called... Describes a new signature file method, called the multiorganizational scheme, for accessing information from large databases containing both formatted and... Presents a new signature file method for accessing information from large data bases containing both formatted and free text data. The new method, called the... A new signature file method is presented for accessing information from large databases containing both formatted and free text data. The new method, called... |
SourceID | proquest pascalfrancis eric crossref wiley istex |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 508 |
SubjectTerms | Access to information Bibliographic Databases Computerized information storage and retrieval Computerized subject indexing Data Files Exact sciences and technology File organization Formats. Markup languages. Codification. Conversion Full Text Databases Indexing Information and communication sciences Information and document structure and analysis Information processing and retrieval Information Retrieval Information science. Documentation Information storage and retrieval Information work International conferences Libraries Library Research Logic programming Mathematical Models Methods Office automation Relational databases Sciences and techniques of general use Signature Files Signatures Subject Index Terms Subject indexing |
Title | A signature file scheme based on multiple organizations for indexing very large text databases |
URI | https://api.istex.fr/ark:/67375/WNG-6N54W435-5/fulltext.pdf https://onlinelibrary.wiley.com/doi/abs/10.1002%2F%28SICI%291097-4571%28199010%2941%3A7%3C508%3A%3AAID-ASI5%3E3.0.CO%3B2-J http://eric.ed.gov/ERICWebPortal/detail?accno=EJ418645 https://www.proquest.com/docview/1301256131 https://www.proquest.com/docview/216893104 https://www.proquest.com/docview/57243033 https://www.proquest.com/docview/743443631 |
Volume | 41 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwhV3fb9MwELbYKiF4QDCGCNuKH9C0PWSLEzuOywTqunZrRVvENtYnrCR2ACHasXbS_nzuErdbNX68VIpsnaP7bN_X3OczIW9kEZpIZsa3qTI-z3nhJ1zkfp6GUWaNVUbi2eH-ID45572RGDltztTJKud7YrlRm0mO38j3QxZDaIU_D-8vf_l4aRQmV90NGiukxiDQ4DRPOse3NXdVLBbsF3jMQ3Lgao7u75x2W91dzL76XEi2wzA9FOxy1pAHQFkajWb3yG-edsW7aC_Yaw3fhn5vKXI5cXQNsbhBQWU6BZ8W1WUYS2z1Luctg1bnKXni2CZtVtPjGXlgx2vk8Z0ahGtky51coNvUHU1CqKhb88_JlyY9_f61qv9JOzAINH2zPy09hPhnKHTtO00iXTrXScEWWDT2Boahny2M8AFV5_QMwgE9SmcpBtDpOjnvtM9aJ767lMHPBUNFnLIylzYU1gJVYBlE-IzbIlF5rkQSFgk3cZibLCmECrI8lFgeKEg4DyzPWBRHL8jqeDK2LwktYmsDWwC9jwyXSqVC2CADQimjNMq48kh_7nB9WdXe0FWV5VBrBLBKnyOAugJQc6alBgC1BgA1AqgjHejWUIe655F1RG1hq93jLIm58Mh2CeOiIb36gXo3KfTF4FjHA8EvgFVq6Fhfwvn2rRSqi6LII5tz4LXbC6aYMAQWCbSJeWTjfvNiYnvk9aIV1jgmbtKxnVxPtZAhB6oB5ulfegAP5BycC0P0y_l2z2H_8dcf3FU-v_rnK2-QR2ioEjduktXZ1bXdApI2y-pkRY5kvVyQdVI7bA8-fqqXHyfgt8-GvwFqZzIs |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V3JchMxEFWFpIrlwBJCMWTTAVLJYZwZjTSaMQHK2DG2E9sHO8sJMYsGqBR2iO0i8E_8Ct9E9yxOTFhOOXB0SdWaaj11P1lPLUKeyoTFjgxjUwd-bPKIJ6bHRWRGAXNCHWs_lnh3uN1xGwe8dSyO58j34i4MyiqLmJgG6ngY4X_k28x2IbXC5uHV6WcTH43Cw9XiBY0MFXv66xfYsY1eNGswvc8Yq-_2qw0zf1TAjISNii5fy0hqJrSGVGeHkKFCrhPPjyJfeCzxeOyyKA69RPhWGDGJ5W0sj3NL89B2XAfs3iAL-HIXLqi23b2o8eu7Ysq2gTfdJDt5jdPtzV6z2tzC016TC2lv2ngcZW1xuyx3gCKVy5Vmzaz0muKlU7JK1e5zZrZmMmUuxl7AuT9HAWcwgjlMssc3ZtjxZY6dJsn6PfKjcG-mjTkpTcZhKfr2S-XJ_8b_98ndnK7TSra-HpA5PVgkdy4VcVwkq_nVD7pB87tdiHWaB82H5G2F9j6-zwqo0jp4DZo-6E-avgYCEVPo2s5FnXTmYiwFW2Ax1ucwDD3UMMI-yvZpH_IprQXjABnIaIkcXIsDHpH5wXCgHxOauFpbOoH9kRNz6fuBENoKgZFLJ3BC7hukXSBInWbFS1RWppophYjM9AeISJUhUnFbSQWIVAoQqRCRylGWqnYVUy2DLCEMp7Z2W9z2XC4MspHictoQnJ2gYFAKddR5o9yO4EdAyxV0XJsB7sVX-SjPchyDrBToU3kwHeGJK9Bw4J22QZavNk-RaZD1aSsESTz5CgZ6OBkpIRkHrgbm6R96AJHmHJwLQ7TTBXTFYf_w12_clf5-8tdPXie3Gv32vtpvdvaWyW00milFV8j8-GyiV4HxjsO1NM5Q8u66F9ZPf36q6w |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1bTxNBFJ4gJEQejCLGlds8KIGHpbtz2dmtqKktlS20mBSkTx72MqvG2CItEf-av84zu9uWBi9PPDYzOdOeb-acb3ouQ8hzlbGUqzi1dRSktkhEZvtCJnYSMR7rVAepMrXD7Y53cCpaPdmbI7_GtTAmrXJsE3NDnQ4S8x95BW0t-mJ0Pm4lK9Mi3jeaby6-2-YFKRNpHT-nUWyRQ_3zB17fhq_CBmL9grHm_kn9wC5fGLAT6Zr0rkCrRGkmtUa_58bormKhMz9IkkD6LPNF6rEkjf1MBk6cMGV63Ti-EI4Wscs9jnLvkQXFkVXhWVI9NW34G3hyQr2RRC2SvbLhaWW7G9bDHRP6tYVU7rZrYlPOjnCrag_5UrVaCxt2rRvK13zX2a0fv2R2a8ZtlpnZC2YjXJtszmiIgGbFSxwzVPkm4c49ZvMheVBSXVor9uYjMqf7y2TpRgPEZbJelk3QLVrWRZl9QkuD85h8rNHul09F81HaxEVw6LP-pulbdL4pxantMiGSzhSVUpSFElN9jcvQDxpXODIp7_QEAaWNaBQZ7z1cIad3gtcTMt8f9PVTQjNPa0dneLfgqVBBEEmpnRjZrOIRj0VgkfZY4XBRNP6AosUzAzAAFrF7AyAUAIJwQQECCIAAggEQODhQPwYGLYusGNQmsvZbwvU9IS2ylcM4GYguv5pkOyXhrPMOvI4UZ0hpASduzOA8_VaBSW3i3CJrY-ChNERDmB4bi6zeHmauh4QVr-QW2ZyMooExUaOorwdXQ5CKCeQ5KJ7-ZQaSUCFQubhEO99vtxT2H339QV3552f__kWbZBFNAByFncNVct8ILbIs18j86PJKryNbHMUb-bGk5Pyu7cBvgP1tdg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Signature+File+Scheme+Based+on+Multiple+Organizations+for+Indexing+Very+Large+Text+Databases&rft.jtitle=Journal+of+the+American+Society+for+Information+Science&rft.au=Kent%2C+A&rft.au=Sacks-Davis%2C+R&rft.au=Ramamohanarao%2C+K&rft.date=1990-10-01&rft.pub=Wiley+Periodicals+Inc&rft.issn=0002-8231&rft.eissn=1097-4571&rft.volume=41&rft.issue=7&rft.spage=508&rft_id=info:doi/10.1002%2F%28SICI%291097-4571%28199010%2941%3A7%3C508%3A%3AAID-ASI5%3E3.0.CO%3B2-J&rft.externalDocID=639271601 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0002-8231&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0002-8231&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0002-8231&client=summon |