Applying informetric characteristics of databases to ir system file design, part I: Informetric models

This study examines how informetric characteristics of information retrieval (IR) system databases can be used to help the systems designer decide what types of file structures would provide the best performance for a given type of information system environment. In this first of two papers, the dev...

Full description

Saved in:
Bibliographic Details
Published inInformation processing & management Vol. 28; no. 1; pp. 121 - 133
Main Author Wolfram, Dietmar
Format Journal Article
LanguageEnglish
Published Oxford Elsevier Ltd 1992
Elsevier Science
Pergamon Press
Elsevier Science Ltd
Subjects
Online AccessGet full text
ISSN0306-4573
1873-5371
DOI10.1016/0306-4573(92)90098-K

Cover

Abstract This study examines how informetric characteristics of information retrieval (IR) system databases can be used to help the systems designer decide what types of file structures would provide the best performance for a given type of information system environment. In this first of two papers, the development of appropriate models describing database contents, to be used later in a simulation study, are dealt with. Database characteristics for which data were collected include: the index term frequency distribution, the distribution of terms used per query, and the distribution of term frequency selections. A shifted generalized Waring distribution was found to provide the best fit for the index term distributions with the large data sets used. For the terms used per query, a shifted negative binomial was found to provide a reasonable fit. A complex relationship was observed for the term selection distribution data, for which the empirical distribution is used. As well, four other hypothetical term selection relationships are presented. With this information, a simulation study examining system performance under different informetric environments can be undertaken.
AbstractList This study examines how informetric characteristics of information retrieval (IR) system databases can be used to help the systems designer decide what types of file structures would provide the best performance for a given type of information system environment. In this first of two papers, the development of appropriate models describing database contents, to be used later in a simulation study, are dealt with. Database characteristics for which data were collected include: the index term frequency distribution, the distribution of terms used per query, and the distribution of term frequency selections. A shifted generalized Waring distribution was found to provide the best fit for the index term distributions with the large data sets used. For the terms used per query, a shifted negative binomial was found to provide a reasonable fit. A complex relationship was observed for the term selection distribution data, for which the empirical distribution is used. As well, four other hypothetical term selection relationships are presented. With this information, a simulation study examining system performance under different informetric environments can be undertaken.
Informetric characteristics of information retrieval (IR) system databases can be used to help the systems designer decide what types of file structures would provide the best performance for a given type of information system environment. Database characteristics for which data were collected include: 1. the index term frequency distribution, 2. the distribution of terms used per query, and 3. the distribution of term frequency selections. A shifted generalized Waring distribution was found to provide the best fit for the index term distributions with the large data sets used. For the terms used per query, a shifted negative binomial was found to provide a reasonable fit. A complex relationship was observed for the term selection distribution data, for which the empirical distribution is used.
Examines how informetric characteristics of information retrieval (IR) system databases can be used to help system designers decide what type of file structures would provide the best performance. The development of appropriate models describing database contents is highlighted in this first part of a two-part study. (30 references) (LRW)
Contribution to a special issue devoted to informetrics. Examines ways in which informetrics characteristics of information retrieval (IR) system data bases can be used to help the systems designer decide what types of file structure would provide the best performance for a given type of information environment. Covers the development of appropriate models describing data base contents to be used in a simulation study. Data base characteristics for which data were collected include: the index term frequency distribution, the distribution of terms used per query and the distribution of term frequency selections. Presents 4 other hypothetical term selection relationships. This information creates the possibility of a simulation study examining system performance under different informetric environments. 00 G.L.C.
Author Wolfram, Dietmar
Author_xml – sequence: 1
  givenname: Dietmar
  surname: Wolfram
  fullname: Wolfram, Dietmar
  organization: School of Library and Information Science, P.O. Box 413, University of Wisconsin-Milwaukee, Milwaukee, WI 53201, U.S.A
BackLink http://eric.ed.gov/ERICWebPortal/detail?accno=EJ441801$$DView record in ERIC
http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=5018740$$DView record in Pascal Francis
BookMark eNqFkdFqFDEUhoNUcFt9g14EFWnB0WQmySS9EEqpurbgjV6HTOakpswk2yQr7Nub7a5FCupVLv7v_Dmc7xAdhBgAoWNK3lFCxXvSEdEw3ncnqj1VhCjZXD1BCyr7ruFdTw_Q4gF5hg5zviWEME7bBXLnq9W08eEG--BimqEkb7H9YZKxBZLPxduMo8OjKWYwGTIuEfuE8yYXmLHzE-ARsr8Jb_HKpIKXZ3j5R9UcR5jyc_TUmSnDi_17hL5_vPx28bm5_vppeXF-3diO89IoQSxhYFQ_DJSyFuQwSqqUtW4UsqV9Jwc5gDPDoDgFx8U4GCGMsTUVfdcdoTe73lWKd2vIRc8-W5gmEyCus-Z9y0jLaAVfPgJv4zqFupumiknJ62Er9OqvEKOCdrQXslKv95TJ1kwumWB91qvkZ5M2mpPqgZGKHe-welX7kF5-YYzK-7_OdrFNMecETltfTPExlGT8pCnRW9d6K1JvRWrV6nvX-qoOs0fDv_v_M_Zhv1JV8tND0tl6CBZGn8AWPUb_74JfaX_A_A
CODEN IPMADK
CitedBy_id crossref_primary_10_1016_0306_4573_94_90028_0
crossref_primary_10_1590_S0100_19652002000200016
crossref_primary_10_1145_333135_333136
crossref_primary_10_1002_asi_10121
crossref_primary_10_1016_0306_4573_92_90099_L
crossref_primary_10_3390_math8112025
crossref_primary_10_5424_sjar_2016141_7687
crossref_primary_10_1002_1097_4571_2000_9999_9999___AID_ASI1591_3_0_CO_2_R
crossref_primary_10_1002_asi_20688
crossref_primary_10_1016_j_ipm_2008_06_005
crossref_primary_10_1016_j_ipm_2005_05_003
Cites_doi 10.1002/asi.5090150208
10.1093/comjnl/28.3.309
10.1016/0306-4573(87)90001-X
10.1002/asi.4630330406
10.1002/asi.4630360502
10.1002/asi.4630330507
10.1108/eb026600
10.1016/0306-4573(92)90099-L
10.1214/aos/1176345003
10.1108/eb026845
10.1016/0306-4573(88)90023-4
10.1016/0020-0271(73)90004-1
10.2307/2984648
10.1108/eb046796
10.1002/asi.5090180209
10.2307/2345247
ContentType Journal Article
Copyright 1992
1992 INIST-CNRS
Copyright Pergamon Press Inc. 1992
Copyright_xml – notice: 1992
– notice: 1992 INIST-CNRS
– notice: Copyright Pergamon Press Inc. 1992
DBID AAYXX
CITATION
7SW
BJH
BNH
BNI
BNJ
BNO
ERI
PET
REK
WWN
IQODW
K30
PAAUG
PAWHS
PAWZZ
PAXOH
PBHAV
PBQSW
PBYQZ
PCIWU
PCMID
PCZJX
PDGRG
PDWWI
PETMR
PFVGT
PGXDX
PIHIL
PISVA
PJCTQ
PJTMS
PLCHJ
PMHAD
PNQDJ
POUND
PPLAD
PQAPC
PQCAN
PQCMW
PQEME
PQHKH
PQMID
PQNCT
PQNET
PQSCT
PQSET
PSVJG
PVMQY
PZGFC
SFNNT
E3H
F2A
DOI 10.1016/0306-4573(92)90098-K
DatabaseName CrossRef
ERIC
ERIC (Ovid)
ERIC
ERIC
ERIC (Legacy Platform)
ERIC( SilverPlatter )
ERIC
ERIC PlusText (Legacy Platform)
Education Resources Information Center (ERIC)
ERIC
Pascal-Francis
Periodicals Index Online
Primary Sources Access—Foundation Edition (Plan E) - West
Primary Sources Access (Plan D) - International
Primary Sources Access & Build (Plan A) - MEA
Primary Sources Access—Foundation Edition (Plan E) - Midwest
Primary Sources Access—Foundation Edition (Plan E) - Northeast
Primary Sources Access (Plan D) - Southeast
Primary Sources Access (Plan D) - North Central
Primary Sources Access—Foundation Edition (Plan E) - Southeast
Primary Sources Access (Plan D) - South Central
Primary Sources Access & Build (Plan A) - UK / I
Primary Sources Access (Plan D) - Canada
Primary Sources Access (Plan D) - EMEALA
Primary Sources Access—Foundation Edition (Plan E) - North Central
Primary Sources Access—Foundation Edition (Plan E) - South Central
Primary Sources Access & Build (Plan A) - International
Primary Sources Access—Foundation Edition (Plan E) - International
Primary Sources Access (Plan D) - West
Periodicals Index Online Segments 1-50
Primary Sources Access (Plan D) - APAC
Primary Sources Access (Plan D) - Midwest
Primary Sources Access (Plan D) - MEA
Primary Sources Access—Foundation Edition (Plan E) - Canada
Primary Sources Access—Foundation Edition (Plan E) - UK / I
Primary Sources Access—Foundation Edition (Plan E) - EMEALA
Primary Sources Access & Build (Plan A) - APAC
Primary Sources Access & Build (Plan A) - Canada
Primary Sources Access & Build (Plan A) - West
Primary Sources Access & Build (Plan A) - EMEALA
Primary Sources Access (Plan D) - Northeast
Primary Sources Access & Build (Plan A) - Midwest
Primary Sources Access & Build (Plan A) - North Central
Primary Sources Access & Build (Plan A) - Northeast
Primary Sources Access & Build (Plan A) - South Central
Primary Sources Access & Build (Plan A) - Southeast
Primary Sources Access (Plan D) - UK / I
Primary Sources Access—Foundation Edition (Plan E) - APAC
Primary Sources Access—Foundation Edition (Plan E) - MEA
Periodicals Index Online Segment 44
Library & Information Sciences Abstracts (LISA)
Library & Information Science Abstracts (LISA)
DatabaseTitle CrossRef
ERIC
Periodicals Index Online Segment 44
Periodicals Index Online Segments 1-50
Periodicals Index Online
Library and Information Science Abstracts (LISA)
DatabaseTitleList
Library and Information Science Abstracts (LISA)
ERIC
Library and Information Science Abstracts (LISA)
Database_xml – sequence: 1
  dbid: ERI
  name: ERIC
  url: https://eric.ed.gov/
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
EISSN 1873-5371
ERIC EJ441801
EndPage 133
ExternalDocumentID 1138299
5018740
EJ441801
10_1016_0306_4573_92_90098_K
030645739290098K
GroupedDBID --K
--M
-~X
.DC
.~1
0B8
0R~
1B1
1RT
1~.
1~5
29I
4.4
41~
457
4G.
5GY
5VS
7-5
71M
77K
8P~
9JN
9JO
AABNK
AACTN
AAEDT
AAEDW
AAFJI
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
AAYOK
ABBOA
ABFNM
ABFRF
ABJNI
ABMAC
ABMMH
ABPPZ
ABXDB
ABYKQ
ACDAQ
ACGFS
ACHQT
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
AEBSH
AEFWE
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
AKYCK
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOMHK
AOUOD
ASPBG
AVARZ
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
GBOLZ
HLZ
HMY
HVGLF
HZ~
H~9
IHE
J1W
KOM
LG9
LPU
LY1
M3Y
M41
MO0
MS~
MVM
N9A
O-L
O9-
OAUVE
OHT
OZT
P-8
P-9
P2P
PC.
PQQKQ
PRBVW
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SDS
SES
SEW
SPC
SPCBC
SSB
SSO
SSS
SSV
SSZ
T5K
TN5
U5U
UHB
UHS
UNMZH
WUQ
XFK
ZMT
~G-
AATTM
AAXKI
AAYWO
AAYXX
ABWVN
ACRPL
ACVFH
ADCNI
ADMHG
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AFXIZ
AGCQF
AGQPQ
AGRNS
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
BNPGV
CITATION
SSH
77I
7SW
BJH
BNH
BNI
BNJ
BNO
EFKBS
ERI
PET
REK
WWN
08R
ABPIF
IQODW
K30
PAAUG
PAWHS
PAWZZ
PAXOH
PBHAV
PBQSW
PBYQZ
PCIWU
PCMID
PCZJX
PDGRG
PDWWI
PETMR
PFVGT
PGXDX
PIHIL
PISVA
PJCTQ
PJTMS
PLCHJ
PMHAD
PNQDJ
POUND
PPLAD
PQAPC
PQCAN
PQCMW
PQEME
PQHKH
PQMID
PQNCT
PQNET
PQSCT
PQSET
PSVJG
PVMQY
PZGFC
SFNNT
E3H
F2A
ID FETCH-LOGICAL-c355t-960c04ea97bb1142e8bd8199ccfd6821738b8befabb951ef56dba66aac6826733
ISSN 0306-4573
IngestDate Fri Sep 05 11:36:04 EDT 2025
Mon Jun 30 06:00:29 EDT 2025
Sun Jun 29 12:53:23 EDT 2025
Sun Oct 22 16:06:59 EDT 2023
Tue Sep 02 19:15:37 EDT 2025
Tue Jul 01 00:44:27 EDT 2025
Thu Apr 24 23:06:59 EDT 2025
Fri Feb 23 02:20:09 EST 2024
IsDoiOpenAccess false
IsOpenAccess false
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Document retrieval
Zipf law
Statistical distribution
Database query
Bibliometrics
Informetrics
Indexing term
Search term
Modeling
Language English
License https://www.elsevier.com/tdm/userlicense/1.0
CC BY 4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c355t-960c04ea97bb1142e8bd8199ccfd6821738b8befabb951ef56dba66aac6826733
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
PQID 1416131768
PQPubID 2029859
PageCount 13
ParticipantIDs proquest_miscellaneous_57240241
proquest_journals_194885101
proquest_journals_1416131768
pascalfrancis_primary_5018740
eric_primary_EJ441801
crossref_citationtrail_10_1016_0306_4573_92_90098_K
crossref_primary_10_1016_0306_4573_92_90098_K
elsevier_sciencedirect_doi_10_1016_0306_4573_92_90098_K
ProviderPackageCode CITATION
AAYXX
PublicationCentury 1900
PublicationDate 1992
1992-1-00
1992-00-00
19920101
PublicationDateYYYYMMDD 1992-01-01
PublicationDate_xml – year: 1992
  text: 1992
PublicationDecade 1990
PublicationPlace Oxford
PublicationPlace_xml – name: Oxford
PublicationTitle Information processing & management
PublicationYear 1992
Publisher Elsevier Ltd
Elsevier Science
Pergamon Press
Elsevier Science Ltd
Publisher_xml – name: Elsevier Ltd
– name: Elsevier Science
– name: Pergamon Press
– name: Elsevier Science Ltd
References Fedorowicz (BIB10) 1981
Houston, Wall (BIB14) 1964; 15
Nelson (BIB19) 1988; 24
Griffiths (BIB13) 1975; 31
Tague, Nicholls (BIB25) 1987; 23
Fedorowicz (BIB11) 1982; 33
Ayres, Yannakoudakis (BIB2) 1979; 13
Wolfram, Chu, Lu (BIB29) 1990
Ajiferuke (BIB1) 1989; 50
Wolfram (BIB28) 1992; 28
Salton (BIB22) 1975
Berkson (BIB4) 1980; 8
Zunde, Slamecka (BIB30) 1967; 18
Tague, Nelson, Wu (BIB26) 1981
Chandler (BIB6) 1965
Cooper (BIB8) 1973; 9
Nelson (BIB18) 1983; 8
Nelson (BIB20) 1989; 45
Fedorowicz (BIB12) 1982; 33
Irwin (BIB15) 1975; 138
Nelson (BIB17) 1982
Fedorowicz (BIB9) 1981; 42
Nelson, Tague (BIB21) 1985; 36
Williams, Shefner (BIB27) 1976; 9
Bennet (BIB3) 1975
Conover (BIB7) 1980
Tague (BIB24) 1988
Irwin (BIB16) 1975; 138
Sampson, Bendell (BIB23) 1985; 28
Brooks (BIB5) 1987; 24
Irwin (10.1016/0306-4573(92)90098-K_BIB16) 1975; 138
Ajiferuke (10.1016/0306-4573(92)90098-K_BIB1) 1989; 50
Williams (10.1016/0306-4573(92)90098-K_BIB27) 1976; 9
Fedorowicz (10.1016/0306-4573(92)90098-K_BIB12) 1982; 33
Sampson (10.1016/0306-4573(92)90098-K_BIB23) 1985; 28
Brooks (10.1016/0306-4573(92)90098-K_BIB5) 1987; 24
Griffiths (10.1016/0306-4573(92)90098-K_BIB13) 1975; 31
Nelson (10.1016/0306-4573(92)90098-K_BIB19) 1988; 24
Chandler (10.1016/0306-4573(92)90098-K_BIB6) 1965
Salton (10.1016/0306-4573(92)90098-K_BIB22) 1975
Tague (10.1016/0306-4573(92)90098-K_BIB24) 1988
Tague (10.1016/0306-4573(92)90098-K_BIB25) 1987; 23
Tague (10.1016/0306-4573(92)90098-K_BIB26) 1981
Berkson (10.1016/0306-4573(92)90098-K_BIB4) 1980; 8
Wolfram (10.1016/0306-4573(92)90098-K_BIB28) 1992; 28
Ayres (10.1016/0306-4573(92)90098-K_BIB2) 1979; 13
Fedorowicz (10.1016/0306-4573(92)90098-K_BIB9) 1981; 42
Nelson (10.1016/0306-4573(92)90098-K_BIB20) 1989; 45
Fedorowicz (10.1016/0306-4573(92)90098-K_BIB10) 1981
Bennet (10.1016/0306-4573(92)90098-K_BIB3) 1975
Nelson (10.1016/0306-4573(92)90098-K_BIB18) 1983; 8
Irwin (10.1016/0306-4573(92)90098-K_BIB15) 1975; 138
Fedorowicz (10.1016/0306-4573(92)90098-K_BIB11) 1982; 33
Houston (10.1016/0306-4573(92)90098-K_BIB14) 1964; 15
Zunde (10.1016/0306-4573(92)90098-K_BIB30) 1967; 18
Nelson (10.1016/0306-4573(92)90098-K_BIB17) 1982
Wolfram (10.1016/0306-4573(92)90098-K_BIB29) 1990
Conover (10.1016/0306-4573(92)90098-K_BIB7) 1980
Nelson (10.1016/0306-4573(92)90098-K_BIB21) 1985; 36
Cooper (10.1016/0306-4573(92)90098-K_BIB8) 1973; 9
References_xml – volume: 45
  start-page: 227
  year: 1989
  end-page: 237
  ident: BIB20
  article-title: Stochastic models for the distribution of index terms
  publication-title: Journal of Documentation
– volume: 9
  start-page: 89
  year: 1976
  end-page: 100
  ident: BIB27
  article-title: Data element statistics for the MARC II data base
  publication-title: Journal of Library Automation
– year: 1975
  ident: BIB22
  article-title: A theory of indexing
– volume: 31
  start-page: 185
  year: 1975
  end-page: 190
  ident: BIB13
  article-title: Index term input to IR systems
  publication-title: Journal of Documentation
– volume: 36
  start-page: 283
  year: 1985
  end-page: 296
  ident: BIB21
  article-title: Split size-rank models for the distribution of index terms
  publication-title: Journal of the American Society for Information Science
– volume: 24
  start-page: 541
  year: 1988
  end-page: 547
  ident: BIB19
  article-title: Correlation of term usage and term indexing frequencies
  publication-title: Information Processing and Management
– volume: 33
  start-page: 285
  year: 1982
  end-page: 293
  ident: BIB12
  article-title: The theoretical foundation of Zipf's law and its application to the bibliographic database environment
  publication-title: Journal of the American Society for Information Science
– start-page: 236
  year: 1981
  end-page: 255
  ident: BIB26
  article-title: Problems in the simulation of bibliographic retrieval systems
  publication-title: Information Retrieval Research
– volume: 18
  start-page: 104
  year: 1967
  end-page: 108
  ident: BIB30
  article-title: Distribution of indexing terms for maximum efficiency of information transmission
  publication-title: American Documentation
– start-page: 355
  year: 1990
  end-page: 372
  ident: BIB29
  article-title: Growth of knowledge: Bibliometric analysis using online database data
  publication-title: Informetrics 89/90
– volume: 15
  start-page: 105
  year: 1964
  end-page: 114
  ident: BIB14
  article-title: The distribution of term usage in manipulative indexes
  publication-title: American Documentation
– start-page: 233
  year: 1975
  end-page: 237
  ident: BIB3
  article-title: Storage design for information retrieval: Scarrott's conjecture and Zipf's Law
  publication-title: International Computing Symposium
– volume: 28
  start-page: 135
  year: 1992
  end-page: 151
  ident: BIB28
  article-title: Applying informetric characteristics of databases to IR system file design, Part II: Simulation comparisons
  publication-title: Information Processing and Management
– volume: 9
  start-page: 13
  year: 1973
  end-page: 32
  ident: BIB8
  article-title: A simulation model of an information retrieval system
  publication-title: Information Storage and Retrieval
– volume: 23
  start-page: 155
  year: 1987
  end-page: 170
  ident: BIB25
  article-title: The maximal value of a Zipf size variable: Sampling properties and relationship to other parameters
  publication-title: Information Processing and Management
– volume: 50
  start-page: 03-A
  year: 1989
  ident: BIB1
  article-title: A probabilistic model for the distribution of authorships and a measure of the degree of research collaboration
  publication-title: Doctoral dissertation
– start-page: 1393
  year: 1981
  end-page: 1399
  ident: BIB10
  article-title: A Zipfian model of inverted file storage requirements
  publication-title: Proceedings of the Twelfth Annual Pittsburgh Conference on Modelling and Simulation
– year: 1982
  ident: BIB17
  article-title: Probabilistic models for the simulation of bibliographic retrieval systems
  publication-title: Dissertation Abstracts International
– volume: 138
  start-page: 18
  year: 1975
  end-page: 31
  ident: BIB15
  article-title: The generalized Waring distribution, part I
  publication-title: Journal of the Royal Statistical Society, Series A
– volume: 8
  start-page: 457
  year: 1980
  end-page: 487
  ident: BIB4
  article-title: Minimum chi-square, not maximum likelihood
  publication-title: The Annals of Statistics
– year: 1980
  ident: BIB7
  article-title: Practical nonparametric statistics
– volume: 28
  start-page: 309
  year: 1985
  end-page: 312
  ident: BIB23
  article-title: Rank order distributions and secondary key indexing
  publication-title: Computer Journal
– volume: 33
  start-page: 223
  year: 1982
  end-page: 232
  ident: BIB11
  article-title: A Zipfian model of an automatic bibliographic system: An application to Medline
  publication-title: Journal of the American Society for Information Science
– start-page: 271
  year: 1988
  end-page: 278
  ident: BIB24
  article-title: What's the use of bibliometrics
  publication-title: Informetrics 87/88
– volume: 138
  start-page: 204
  year: 1975
  end-page: 227
  ident: BIB16
  article-title: The generalized Waring distribution, part II
  publication-title: Journal of the Royal Statistical Society, Series A
– volume: 24
  start-page: 20
  year: 1987
  end-page: 24
  ident: BIB5
  article-title: Bradford analysis of authorship dispersion for database design
  publication-title: Proceedings of the ASIS Annual Meeting
– volume: 8
  start-page: 67
  year: 1983
  end-page: 73
  ident: BIB18
  article-title: The use of term co-occurrence information in information retrieval
  publication-title: Canadian Journal of Information Science
– volume: 13
  start-page: 127
  year: 1979
  end-page: 142
  ident: BIB2
  article-title: The bibliographic record: An analysis of the size of its constituent parts
  publication-title: Program
– volume: 42
  start-page: 03-A
  year: 1981
  ident: BIB9
  article-title: Modelling an automatic bibliographic system: A Zipfian approach
  publication-title: Doctoral dissertation
– year: 1965
  ident: BIB6
  article-title: Subroutine STEPIT: An algorithm that finds the values of the parameters which minimize a given continuous function
– year: 1965
  ident: 10.1016/0306-4573(92)90098-K_BIB6
– volume: 15
  start-page: 105
  issue: 2
  year: 1964
  ident: 10.1016/0306-4573(92)90098-K_BIB14
  article-title: The distribution of term usage in manipulative indexes
  publication-title: American Documentation
  doi: 10.1002/asi.5090150208
– start-page: 271
  year: 1988
  ident: 10.1016/0306-4573(92)90098-K_BIB24
  article-title: What's the use of bibliometrics
– year: 1982
  ident: 10.1016/0306-4573(92)90098-K_BIB17
  article-title: Probabilistic models for the simulation of bibliographic retrieval systems
– volume: 28
  start-page: 309
  issue: 3
  year: 1985
  ident: 10.1016/0306-4573(92)90098-K_BIB23
  article-title: Rank order distributions and secondary key indexing
  publication-title: Computer Journal
  doi: 10.1093/comjnl/28.3.309
– volume: 42
  start-page: 03-A
  year: 1981
  ident: 10.1016/0306-4573(92)90098-K_BIB9
  article-title: Modelling an automatic bibliographic system: A Zipfian approach
– volume: 24
  start-page: 20
  year: 1987
  ident: 10.1016/0306-4573(92)90098-K_BIB5
  article-title: Bradford analysis of authorship dispersion for database design
  publication-title: Proceedings of the ASIS Annual Meeting
– volume: 23
  start-page: 155
  issue: 3
  year: 1987
  ident: 10.1016/0306-4573(92)90098-K_BIB25
  article-title: The maximal value of a Zipf size variable: Sampling properties and relationship to other parameters
  publication-title: Information Processing and Management
  doi: 10.1016/0306-4573(87)90001-X
– volume: 9
  start-page: 89
  issue: 2
  year: 1976
  ident: 10.1016/0306-4573(92)90098-K_BIB27
  article-title: Data element statistics for the MARC II data base
  publication-title: Journal of Library Automation
– volume: 33
  start-page: 223
  year: 1982
  ident: 10.1016/0306-4573(92)90098-K_BIB11
  article-title: A Zipfian model of an automatic bibliographic system: An application to Medline
  publication-title: Journal of the American Society for Information Science
  doi: 10.1002/asi.4630330406
– volume: 36
  start-page: 283
  year: 1985
  ident: 10.1016/0306-4573(92)90098-K_BIB21
  article-title: Split size-rank models for the distribution of index terms
  publication-title: Journal of the American Society for Information Science
  doi: 10.1002/asi.4630360502
– start-page: 1393
  year: 1981
  ident: 10.1016/0306-4573(92)90098-K_BIB10
  article-title: A Zipfian model of inverted file storage requirements
– volume: 33
  start-page: 285
  year: 1982
  ident: 10.1016/0306-4573(92)90098-K_BIB12
  article-title: The theoretical foundation of Zipf's law and its application to the bibliographic database environment
  publication-title: Journal of the American Society for Information Science
  doi: 10.1002/asi.4630330507
– year: 1975
  ident: 10.1016/0306-4573(92)90098-K_BIB22
– volume: 31
  start-page: 185
  issue: 3
  year: 1975
  ident: 10.1016/0306-4573(92)90098-K_BIB13
  article-title: Index term input to IR systems
  publication-title: Journal of Documentation
  doi: 10.1108/eb026600
– volume: 28
  start-page: 135
  issue: 1
  year: 1992
  ident: 10.1016/0306-4573(92)90098-K_BIB28
  article-title: Applying informetric characteristics of databases to IR system file design, Part II: Simulation comparisons
  publication-title: Information Processing and Management
  doi: 10.1016/0306-4573(92)90099-L
– volume: 50
  start-page: 03-A
  year: 1989
  ident: 10.1016/0306-4573(92)90098-K_BIB1
  article-title: A probabilistic model for the distribution of authorships and a measure of the degree of research collaboration
– volume: 8
  start-page: 457
  issue: 3
  year: 1980
  ident: 10.1016/0306-4573(92)90098-K_BIB4
  article-title: Minimum chi-square, not maximum likelihood
  publication-title: The Annals of Statistics
  doi: 10.1214/aos/1176345003
– volume: 45
  start-page: 227
  issue: 3
  year: 1989
  ident: 10.1016/0306-4573(92)90098-K_BIB20
  article-title: Stochastic models for the distribution of index terms
  publication-title: Journal of Documentation
  doi: 10.1108/eb026845
– volume: 24
  start-page: 541
  issue: 5
  year: 1988
  ident: 10.1016/0306-4573(92)90098-K_BIB19
  article-title: Correlation of term usage and term indexing frequencies
  publication-title: Information Processing and Management
  doi: 10.1016/0306-4573(88)90023-4
– start-page: 236
  year: 1981
  ident: 10.1016/0306-4573(92)90098-K_BIB26
  article-title: Problems in the simulation of bibliographic retrieval systems
– start-page: 233
  year: 1975
  ident: 10.1016/0306-4573(92)90098-K_BIB3
  article-title: Storage design for information retrieval: Scarrott's conjecture and Zipf's Law
– volume: 9
  start-page: 13
  year: 1973
  ident: 10.1016/0306-4573(92)90098-K_BIB8
  article-title: A simulation model of an information retrieval system
  publication-title: Information Storage and Retrieval
  doi: 10.1016/0020-0271(73)90004-1
– year: 1980
  ident: 10.1016/0306-4573(92)90098-K_BIB7
– volume: 138
  start-page: 204
  issue: 2
  year: 1975
  ident: 10.1016/0306-4573(92)90098-K_BIB16
  article-title: The generalized Waring distribution, part II
  publication-title: Journal of the Royal Statistical Society, Series A
  doi: 10.2307/2984648
– start-page: 355
  year: 1990
  ident: 10.1016/0306-4573(92)90098-K_BIB29
  article-title: Growth of knowledge: Bibliometric analysis using online database data
– volume: 13
  start-page: 127
  issue: 3
  year: 1979
  ident: 10.1016/0306-4573(92)90098-K_BIB2
  article-title: The bibliographic record: An analysis of the size of its constituent parts
  publication-title: Program
  doi: 10.1108/eb046796
– volume: 18
  start-page: 104
  year: 1967
  ident: 10.1016/0306-4573(92)90098-K_BIB30
  article-title: Distribution of indexing terms for maximum efficiency of information transmission
  publication-title: American Documentation
  doi: 10.1002/asi.5090180209
– volume: 8
  start-page: 67
  year: 1983
  ident: 10.1016/0306-4573(92)90098-K_BIB18
  article-title: The use of term co-occurrence information in information retrieval
  publication-title: Canadian Journal of Information Science
– volume: 138
  start-page: 18
  issue: 1
  year: 1975
  ident: 10.1016/0306-4573(92)90098-K_BIB15
  article-title: The generalized Waring distribution, part I
  publication-title: Journal of the Royal Statistical Society, Series A
  doi: 10.2307/2345247
SSID ssj0004512
Score 1.4441959
Snippet This study examines how informetric characteristics of information retrieval (IR) system databases can be used to help the systems designer decide what types...
Examines how informetric characteristics of information retrieval (IR) system databases can be used to help system designers decide what type of file...
Informetric characteristics of information retrieval (IR) system databases can be used to help the systems designer decide what types of file structures would...
Contribution to a special issue devoted to informetrics. Examines ways in which informetrics characteristics of information retrieval (IR) system data bases...
SourceID proquest
pascalfrancis
eric
crossref
elsevier
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 121
SubjectTerms Bibliometrics. Scientometrics. Evaluation
Characteristics
Computer System Design
Computerized information storage and retrieval
Computerized subject indexing
Database Design
Design
Exact sciences and technology
File design
Foreign Countries
Higher Education
Indexing
Information and communication sciences
Information Retrieval
Information Science
Information science. Documentation
Information storage and retrieval
Information work
Informetrics
Library and information science. General aspects
Library Catalogs
Library Schools
Literature Reviews
Mathematical analysis
Mathematical Models
Online Catalogs
Poisson Distribution
Sciences and techniques of general use
Statistical Distributions
Studies
Subject Index Terms
Subject indexing
Systems design
University of Western Ontario (Canada)
Title Applying informetric characteristics of databases to ir system file design, part I: Informetric models
URI https://dx.doi.org/10.1016/0306-4573(92)90098-K
http://eric.ed.gov/ERICWebPortal/detail?accno=EJ441801
https://www.proquest.com/docview/1416131768
https://www.proquest.com/docview/194885101
https://www.proquest.com/docview/57240241
Volume 28
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLeguyAhxMcQYevwARBoSmniJE64VWjTsvIhVZu2mxU7DhfUVmt2gL-e92zHzdahApcoipNYyvvlvd-z3wchr4ERK83iPNTVWIGDUrOwiLkKlQJ7I4t0HEkT5fs1OzlPTi_Ty3WsqskuaeVI_bozr-R_pArXQK6YJfsPkvUvhQtwDvKFI0gYjn8lY6SQP21SCnJPbI6lMJX3RglmYIMYBormylRzKGeufLOpyXRYmxCOEVDJq_awNCsEZe9tplPOqk9hXQKTwc3SphnY5YbMhcL2Y2kuvn0-nk2-rEOLa5dxt3ZEfc6LUzM91QSORpiktgnJSFvVmXMWpsw2VOl0a5xvYMgqysjmRTubG9liGBvq3K4s-NmAcxfxGzSgWAV1ujZh3bb9Lcvm4w1T23rwPtmJOY_SAdmZTGcX015Z-chtN9mJuhzLKPvgr70r4vdu4j9xGBcx_3BZreAHa2xnlA0jb5jL2WPyyLkcdGLx84Tc0_OnZOgSVuhb2hModTJ4Rr532KI9bNFb2KKLhnps0XZByxm12KKILeqwRRFbtKQfaQ9Z1CJrl5wfH519OgldU45QATVtQ_B41TjRVcGlxDxsncsaWGWhVFNnOTi4LJe51E0l4U-PdJNmtayyrKoUjGacsedkMF_M9QtCdSNrBp82AjOTpIpXPFFMxaAqlFRFUgeEdZ9ZKFexHhun_BBdaCIKR6BwRBELIxwxDUjon1raii1b7uedBIVjnZZNCkDhlid3UeB-lqNT8C2A8QVkeAMB_gYHw4Dsd4gQTpuswAUH3wvIfJYHZO-O4QIsLdrPgLzyo2AJcHuvmuvF9UqkHHdKk-jllun3yAMbc47riPtk0F5d6yEw61YeuP_iwKwi_QZTsMtq
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Applying+informetric+characteristics+of+databases+to+IR+system+file+design.+Part+I+%3A+Informetric+models&rft.jtitle=Information+processing+%26+management&rft.au=WOLFRAM%2C+D&rft.date=1992&rft.pub=Elsevier+Science&rft.issn=0306-4573&rft.eissn=1873-5371&rft.volume=28&rft.issue=1&rft.spage=121&rft.epage=133&rft_id=info:doi/10.1016%2F0306-4573%2892%2990098-K&rft.externalDBID=n%2Fa&rft.externalDocID=5018740
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0306-4573&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0306-4573&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0306-4573&client=summon