Relevancy and pertinency in indexing

Underlying all types of subject analysis—descriptors, uniterms, subject headings, telegraphic ing, etc.—is the fundamental problem of selection of significant concepts and characteristics from a document to be recorded as reference points for use in future retrieval operations. Faced with several th...

Full description

Saved in:
Bibliographic Details
Published inAmerican Documentation Vol. 13; no. 1; pp. 93 - 94
Main Author Rees, Alan M.
Format Journal Article
LanguageEnglish
Published New York Wiley Subscription Services, Inc., A Wiley Company 01.01.1962
American Documentation Institute
Wiley Periodicals Inc
Subjects
Online AccessGet full text
ISSN0096-946X
0002-8231
1936-6108
1097-4571
DOI10.1002/asi.5090130113

Cover

Loading…
More Information
Summary:Underlying all types of subject analysis—descriptors, uniterms, subject headings, telegraphic ing, etc.—is the fundamental problem of selection of significant concepts and characteristics from a document to be recorded as reference points for use in future retrieval operations. Faced with several thousands of words normally found in a typical document, the analyst selects those words and ideas which seem significant, based upon his subjective knowledge of the subject matter. Such a selection is conditioned by his academic training, observance of the frequency of occurrence of certain words, knowledge of the pattern of use of the literature, acquaintance with the terminology used in the phrasing of questions to be put to the file, and comparative knowledge and ignorance of the association of ideas or relationships between the concepts recorded in the document. Pertinency is therefore in the eyes of the beholder and is relevant to the state of knowledge at any given time. The difficulty in subject analysis is one of recording characteristics for retrieval at a later date when the implications inherent in future requests are unknown at the time of recording and when the terminology has not yet crystallized into any standardized form. In the absence of a permanent description, information requests can either be translated into the archaic language and frozen concepts of the file or the file itself can be updated to match modern concepts and associations and to bring out implications subsequently made apparent by continually evolving technology. The continuous shift in traditional interests is illustrated in the current awareness type of literature search where the constant rearrangement of concepts is seen in the attempt to define interests whose relevance is not yet established. Superimposed on this is the problem of finding suitable words which characterize these shifting concepts. The words of the document are not necessarily those which are in current use, nor will they always be the same words used to characterize an information request put to the file at a later date. Thus it is necessary to use an artificial language (code, authority, list, notation, etc.) into which the natural language of the text and the languge of the request can be converted. This language should be such that it would serve as a more permanent and regularized language which would cut through the tangle of synonyms and infinity of syntactic structures. The coded thesaurus is suggested as a means of providing for this intermediary language at the same time as performing the function of being a means of bringing into coincidence the vocabularies of the future searches and retrieval system and indicate networks of related meaning and associated ideas. The association of ideas in the semantic code is suggested as a yardstick of predetermined relevancy. Experimental data will be presented to facilitate the establishment of objective criteria of relevancy and pertinency in searching operations.
Bibliography:Presented at the Annual Meeting of the ADI, Boston, Nov. 5-8, 1961.
ark:/67375/WNG-W40Z1D3M-7
ArticleID:ASI5090130113
istex:F9A5FA7BD73EDE841000A8FF03CFAC69417FFD5D
Presented at the Annual Meeting of the ADI, Boston, Nov. 5‐8, 1961.
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ISSN:0096-946X
0002-8231
1936-6108
1097-4571
DOI:10.1002/asi.5090130113