Relevancy and pertinency in indexing
Underlying all types of subject analysis—descriptors, uniterms, subject headings, telegraphic ing, etc.—is the fundamental problem of selection of significant concepts and characteristics from a document to be recorded as reference points for use in future retrieval operations. Faced with several th...
Saved in:
Published in | American Documentation Vol. 13; no. 1; pp. 93 - 94 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
New York
Wiley Subscription Services, Inc., A Wiley Company
01.01.1962
American Documentation Institute Wiley Periodicals Inc |
Subjects | |
Online Access | Get full text |
ISSN | 0096-946X 0002-8231 1936-6108 1097-4571 |
DOI | 10.1002/asi.5090130113 |
Cover
Loading…
Summary: | Underlying all types of subject analysis—descriptors, uniterms, subject headings, telegraphic ing, etc.—is the fundamental problem of selection of significant concepts and characteristics from a document to be recorded as reference points for use in future retrieval operations. Faced with several thousands of words normally found in a typical document, the analyst selects those words and ideas which seem significant, based upon his subjective knowledge of the subject matter. Such a selection is conditioned by his academic training, observance of the frequency of occurrence of certain words, knowledge of the pattern of use of the literature, acquaintance with the terminology used in the phrasing of questions to be put to the file, and comparative knowledge and ignorance of the association of ideas or relationships between the concepts recorded in the document. Pertinency is therefore in the eyes of the beholder and is relevant to the state of knowledge at any given time.
The difficulty in subject analysis is one of recording characteristics for retrieval at a later date when the implications inherent in future requests are unknown at the time of recording and when the terminology has not yet crystallized into any standardized form. In the absence of a permanent description, information requests can either be translated into the archaic language and frozen concepts of the file or the file itself can be updated to match modern concepts and associations and to bring out implications subsequently made apparent by continually evolving technology. The continuous shift in traditional interests is illustrated in the current awareness type of literature search where the constant rearrangement of concepts is seen in the attempt to define interests whose relevance is not yet established.
Superimposed on this is the problem of finding suitable words which characterize these shifting concepts. The words of the document are not necessarily those which are in current use, nor will they always be the same words used to characterize an information request put to the file at a later date. Thus it is necessary to use an artificial language (code, authority, list, notation, etc.) into which the natural language of the text and the languge of the request can be converted. This language should be such that it would serve as a more permanent and regularized language which would cut through the tangle of synonyms and infinity of syntactic structures. The coded thesaurus is suggested as a means of providing for this intermediary language at the same time as performing the function of being a means of bringing into coincidence the vocabularies of the future searches and retrieval system and indicate networks of related meaning and associated ideas. The association of ideas in the semantic code is suggested as a yardstick of predetermined relevancy. Experimental data will be presented to facilitate the establishment of objective criteria of relevancy and pertinency in searching operations. |
---|---|
Bibliography: | Presented at the Annual Meeting of the ADI, Boston, Nov. 5-8, 1961. ark:/67375/WNG-W40Z1D3M-7 ArticleID:ASI5090130113 istex:F9A5FA7BD73EDE841000A8FF03CFAC69417FFD5D Presented at the Annual Meeting of the ADI, Boston, Nov. 5‐8, 1961. ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 |
ISSN: | 0096-946X 0002-8231 1936-6108 1097-4571 |
DOI: | 10.1002/asi.5090130113 |