A Natural Language Thesaurus for the Humanities: The Need for a Database Search Aid
Database searching presents special difficulties for humanists because many subjects may be covered, many synonyms may be used to describe a single concept, and terms may vary in precision. Databases may be searched by using controlled vocabularies, free-text (natural language) terms, or a combinati...
Saved in:
Published in | The Library quarterly (Chicago) Vol. 68; no. 4; pp. 406 - 430 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Chicago, Il
University of Chicago Press
01.10.1998
University of Chicago, acting through its Press |
Subjects | |
Online Access | Get full text |
ISSN | 0024-2519 1549-652X |
DOI | 10.1086/603001 |
Cover
Loading…
Abstract | Database searching presents special difficulties for humanists because many subjects may be covered, many synonyms may be used to describe a single concept, and terms may vary in precision. Databases may be searched by using controlled vocabularies, free-text (natural language) terms, or a combination of both. A significant cause of recall failure in a free-text search is the inability of the searcher to think of all the terms an author may have used. The current study was undertaken to determine the potential value to humanists of a thesaurus integrating free-text terms from the humanities and social sciences. In the first part of the study, a sample of common-noun subject headings from the "Humanities Index" was analyzed to determine how many have at least quasi-synonymous terms. The subject headings were compared to terms in "The Contemporary Thesaurus of Social Science Terms and Synonyms: A Guide for Natural Language Computer Searching" to determine the overlap of terminology between the humanities and social sciences. The results indicate a high degree of overlap, suggesting that a thesaurus integrating terms from the humanities and the social sciences would be of value to scholars in both disciplines. Results also demonstrate that a high proportion of common-noun subject headings have at least quasi-synonymous terms useful for searching. In the second part of the study, searches for humanities scholars were conducted on controlled-vocabulary databases, using both controlled vocabulary and free-text terms to determine whether the latter retrieved additional relevant records not retrieved by the controlled vocabulary. The results indicate that combining both approaches yields more relevant items and higher recall than either method alone. Searchers need tools to identify both controlled-vocabulary terms and free-text terms. The proposed free-text thesaurus will complement controlled-vocabulary thesauri. |
---|---|
AbstractList | Database searching presents special difficulties for humanists because many subjects may be covered, many synonyms may be used to describe a single concept, and terms may vary in precision. Databases may be searched by using controlled vocabularies, free text (natural language) terms, or a combination of both. A significant cause of recall failure in a free text search is the inability of the searcher to think of all the terms an author may have used. Describes a study to determine the potential value to humanists of a thesaurus integrating free text terms from the humanities and social sciences. (Original abstract - amended) Database searching presents special difficulties for humanists because many subjects may be covered, many synonyms may be used to describe a single concept, and terms may vary in precision. Databases may be searched by using controlled vocabularies, free-text (natural language) terms, or a combination of both. A significant cause of recall failure in a free-text search is the inability of the searcher to think of all the terms an author may have used. The current study was undertaken to determine the potential value to humanists of a thesaurus integrating free-text terms from the humanities and social sciences. In the first part of the study, a sample of common-noun subject headings from the "Humanities Index" was analyzed to determine how many have at least quasi-synonymous terms. The subject headings were compared to terms in "The Contemporary Thesaurus of Social Science Terms and Synonyms: A Guide for Natural Language Computer Searching" to determine the overlap of terminology between the humanities and social sciences. The results indicate a high degree of overlap, suggesting that a thesaurus integrating terms from the humanities and the social sciences would be of value to scholars in both disciplines. Results also demonstrate that a high proportion of common-noun subject headings have at least quasi-synonymous terms useful for searching. In the second part of the study, searches for humanities scholars were conducted on controlled-vocabulary databases, using both controlled vocabulary and free-text terms to determine whether the latter retrieved additional relevant records not retrieved by the controlled vocabulary. The results indicate that combining both approaches yields more relevant items and higher recall than either method alone. Searchers need tools to identify both controlled-vocabulary terms and free-text terms. The proposed free-text thesaurus will complement controlled-vocabulary thesauri. Database searching presents special difficulties for humanists because many subjects may be covered, many synonyms may be used to describe a single concept, and terms may vary in precision. Databases may be searched by using controlled vocabularies, free-text (natural language) terms, or a combination of both. A study to determine the value to humanists of a thesaurus integrating free-text terms from the humanities and social sciences found a high degree of overlap, suggesting that such a thesaurus would be useful to scholars in both disciplines. Results also demonstrated that combining controlled vocabulary and free-text term searching yielded more relevant records than either method alone. (PEN) |
Author | Laura B. Cohen Sara D. Knapp D. R. Juedes |
Author_xml | – sequence: 1 fullname: Knapp, Sara D – sequence: 2 fullname: Cohen, Laura B – sequence: 3 fullname: Juedes, D. R |
BackLink | http://eric.ed.gov/ERICWebPortal/detail?accno=EJ579998$$DView record in ERIC http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1655887$$DView record in Pascal Francis |
BookMark | eNp10U1v1DAQBmALFYltgV_AwQLEiYDt-JPbqi0UtCqHFolbNHHGXa-ySbGdA_--2e6qlYrwxYf30TsjzTE5GsYBCXnN2SfOrP6sWc0Yf0YWXElXaSV-H5EFY0JWQnH3ghznvGHz01ouyNWSXkKZEvR0BcPNBDdIr9eYYUpTpmFMtKyRXkxbGGKJmL_sUnqJ2N2HQM-gQAsZ6RVC8mu6jN1L8jxAn_HV4T8hv76eX59eVKuf376fLleVr40tledBi8CtdLJzTppguW2NChg8tLUMzssOFfOtabXvOGulkEZ4I1p02rK2PiEf9r23afwzYS7NNmaPfQ8DjlNulJFaMWtm-PYJ3IxTGubdGj4Pno3aoXf_RTsiDK_5rN4fFGQPfUgw-Jib2xS3kP42XCtl7ye-2TNM0T-k5z-Ucc7Zx819GnNOGB4LWLM7YrM_4gw_PoE-FihxHEqC2P_LD2M3uYzpoVTWzAmp6ztJmKNd |
CODEN | LIBQAS |
CitedBy_id | crossref_primary_10_1016_j_acalib_2004_12_006 crossref_primary_10_1080_00048623_2003_10755223 crossref_primary_10_1108_JD_05_2021_0094 crossref_primary_10_1111_lic3_12507 crossref_primary_10_1590_S0100_19652002000100005 crossref_primary_10_3233_EFI_230037 crossref_primary_10_1002_asi_20319 crossref_primary_10_1080_00048623_2004_10755265 crossref_primary_10_3743_KOSIM_2010_27_2_225 crossref_primary_10_1108_JD_12_2019_0231 |
ContentType | Journal Article |
Copyright | Copyright 1998 The University of Chicago 1999 INIST-CNRS Copyright University of Chicago, acting through its Press Oct 1998 |
Copyright_xml | – notice: Copyright 1998 The University of Chicago – notice: 1999 INIST-CNRS – notice: Copyright University of Chicago, acting through its Press Oct 1998 |
DBID | AAYXX CITATION 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN IQODW EOLOZ FUVTR IOIBA K30 PAAUG PAWHS PAWZZ PAXOH PBHAV PBQSW PBYQZ PCIWU PCMID PCZJX PDGRG PDWWI PETMR PFVGT PGXDX PIHIL PISVA PJCTQ PJTMS PLCHJ PMHAD PNQDJ POUND PPLAD PQAPC PQCAN PQCMW PQEME PQHKH PQMID PQNCT PQNET PQSCT PQSET PSVJG PVMQY PZGFC E3H F2A |
DOI | 10.1086/603001 |
DatabaseName | CrossRef ERIC ERIC (Ovid) ERIC ERIC ERIC (Legacy Platform) ERIC( SilverPlatter ) ERIC ERIC PlusText (Legacy Platform) Education Resources Information Center (ERIC) ERIC Pascal-Francis Periodicals Index Online Segment 01 Periodicals Index Online Segment 06 Periodicals Index Online Segment 29 Periodicals Index Online Primary Sources Access—Foundation Edition (Plan E) - West Primary Sources Access (Plan D) - International Primary Sources Access & Build (Plan A) - MEA Primary Sources Access—Foundation Edition (Plan E) - Midwest Primary Sources Access—Foundation Edition (Plan E) - Northeast Primary Sources Access (Plan D) - Southeast Primary Sources Access (Plan D) - North Central Primary Sources Access—Foundation Edition (Plan E) - Southeast Primary Sources Access (Plan D) - South Central Primary Sources Access & Build (Plan A) - UK / I Primary Sources Access (Plan D) - Canada Primary Sources Access (Plan D) - EMEALA Primary Sources Access—Foundation Edition (Plan E) - North Central Primary Sources Access—Foundation Edition (Plan E) - South Central Primary Sources Access & Build (Plan A) - International Primary Sources Access—Foundation Edition (Plan E) - International Primary Sources Access (Plan D) - West Periodicals Index Online Segments 1-50 Primary Sources Access (Plan D) - APAC Primary Sources Access (Plan D) - Midwest Primary Sources Access (Plan D) - MEA Primary Sources Access—Foundation Edition (Plan E) - Canada Primary Sources Access—Foundation Edition (Plan E) - UK / I Primary Sources Access—Foundation Edition (Plan E) - EMEALA Primary Sources Access & Build (Plan A) - APAC Primary Sources Access & Build (Plan A) - Canada Primary Sources Access & Build (Plan A) - West Primary Sources Access & Build (Plan A) - EMEALA Primary Sources Access (Plan D) - Northeast Primary Sources Access & Build (Plan A) - Midwest Primary Sources Access & Build (Plan A) - North Central Primary Sources Access & Build (Plan A) - Northeast Primary Sources Access & Build (Plan A) - South Central Primary Sources Access & Build (Plan A) - Southeast Primary Sources Access (Plan D) - UK / I Primary Sources Access—Foundation Edition (Plan E) - APAC Primary Sources Access—Foundation Edition (Plan E) - MEA Library & Information Sciences Abstracts (LISA) Library & Information Science Abstracts (LISA) |
DatabaseTitle | CrossRef ERIC Periodicals Index Online Segments 1-50 Periodicals Index Online Segment 06 Periodicals Index Online Periodicals Index Online Segment 29 Periodicals Index Online Segment 01 Library and Information Science Abstracts (LISA) |
DatabaseTitleList | Library and Information Science Abstracts (LISA) Library and Information Science Abstracts (LISA) ERIC |
Database_xml | – sequence: 1 dbid: ERI name: ERIC url: https://eric.ed.gov/ sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Library & Information Science |
EISSN | 1549-652X |
ERIC | EJ579998 |
EndPage | 430 |
ExternalDocumentID | 40563962 1655887 EJ579998 10_1086_603001 4309246 |
Genre | Feature |
GroupedDBID | -ET -~X .4I 07C 0R~ 29L 2AX 4.4 5.N 5GY 77K 85S 90D AAEDO AAHCP AAXPP AAYOK ABABT ABBHK ABDBF ABECW ABPEO ABWJO ACGFS ACHQT ACNCT ACNXV ADMHG ADPTO ADTZG ADULT AEGXH AEUPB AFFNX AHEXP ALMA_UNASSIGNED_HOLDINGS ASUFR AS~ BKOMP CS3 DU5 EBS EJD ELW EZTEY F5P GPZZG HVGLF HZ~ H~9 IPSME JAAYA JAB JBMMH JENOY JHFFW JKQEH JLEZI JLXEF JPL JST L7B MS~ NHB O9- P-O P2P PQQKQ RCP SA0 TAE TN5 UFCQG UHB UPT WH7 YQT YZZ ZCG ~45 123 AAYXX ABCQX ABKTN ACIOK ACREJ AHKVK AIATT CITATION DGPHC ECVKH HF~ HYQOX LPU MVM O-F PMFND PMKZF PVKVW Q5E QZG XSW ~P4 ~P5 77I 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN 0B8 AZRUE BHNFS IQODW JSODD VQA VXZ EOLOZ FUVTR IOIBA K30 PAAUG PAWHS PAWZZ PAXOH PBHAV PBQSW PBYQZ PCIWU PCMID PCZJX PDGRG PDWWI PETMR PFVGT PGXDX PIHIL PISVA PJCTQ PJTMS PLCHJ PMHAD PNQDJ POUND PPLAD PQAPC PQCAN PQCMW PQEME PQHKH PQMID PQNCT PQNET PQSCT PQSET PSVJG PVMQY PZGFC E3H F2A |
ID | FETCH-LOGICAL-c378t-c1f62f18494d9947f818b75fefcab34f9c4de50cb7b6cd10b42472c72be9680b3 |
ISSN | 0024-2519 |
IngestDate | Fri Sep 05 04:43:24 EDT 2025 Fri Jul 25 05:50:04 EDT 2025 Fri Jul 25 23:53:42 EDT 2025 Sun Feb 16 07:29:56 EST 2025 Tue Sep 02 19:20:35 EDT 2025 Sat Jul 26 02:22:16 EDT 2025 Thu Apr 24 23:00:05 EDT 2025 Thu Jul 03 21:14:30 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | false |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Keywords | Humanities Terminology Controlled vocabulary Thesaurus Subject heading Database Information retrieval Natural language Full text Comparative study Information language |
Language | English |
License | CC BY 4.0 |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c378t-c1f62f18494d9947f818b75fefcab34f9c4de50cb7b6cd10b42472c72be9680b3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Feature-1 content type line 23 |
PQID | 1750827131 |
PQPubID | 1817283 |
PageCount | 25 |
ParticipantIDs | proquest_miscellaneous_57465087 proquest_journals_194750857 proquest_journals_1750827131 pascalfrancis_primary_1655887 eric_primary_EJ579998 crossref_primary_10_1086_603001 crossref_citationtrail_10_1086_603001 jstor_primary_4309246 |
PublicationCentury | 1900 |
PublicationDate | 1998-10-01 |
PublicationDateYYYYMMDD | 1998-10-01 |
PublicationDate_xml | – month: 10 year: 1998 text: 1998-10-01 day: 01 |
PublicationDecade | 1990 |
PublicationPlace | Chicago, Il |
PublicationPlace_xml | – name: Chicago, Il – name: Chicago |
PublicationTitle | The Library quarterly (Chicago) |
PublicationYear | 1998 |
Publisher | University of Chicago Press University of Chicago, acting through its Press |
Publisher_xml | – name: University of Chicago Press – name: University of Chicago, acting through its Press |
SSID | ssj0000664 |
Score | 1.5136994 |
Snippet | Database searching presents special difficulties for humanists because many subjects may be covered, many synonyms may be used to describe a single concept,... A study to determine the value to humanists of a thesaurus integrating free-text terms from the humanities and social sciences found a high degree of overlap,... |
SourceID | proquest pascalfrancis eric crossref jstor |
SourceType | Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 406 |
SubjectTerms | Citation searching Construction Controlled vocabularies Controlled vocabulary Exact sciences and technology Free Text Searching Full text searching Humanities Information and communication sciences Information processing and retrieval Information Retrieval Information science. Documentation Language Logical and linguistic tools Natural Language Needs Assessment Online Searching Relevance (Information Retrieval) Scholarship Sciences and techniques of general use Search terms Searches Social Sciences Subject headings Subject terms Synonyms Terminology Terminology. Lexicons. Thesaurus Terms Thesauri |
Title | A Natural Language Thesaurus for the Humanities: The Need for a Database Search Aid |
URI | https://www.jstor.org/stable/4309246 http://eric.ed.gov/ERICWebPortal/detail?accno=EJ579998 https://www.proquest.com/docview/1750827131 https://www.proquest.com/docview/194750857 https://www.proquest.com/docview/57465087 |
Volume | 68 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZge-GCeBSxtFt8AC5Vqmzi2DG3lLZaLY8LW6m3yHZsqVKVrprkwq9n_MhjtUUCLtHK9jpWvvF4bH8zg9CHSoMtJJYkEnEmIqJhKkrNCEy8tBKcmVRKe97x_QddXZP1TXYzJrxz3iWtPFO_HvUr-R9UoQxwtV6y_4Ds0CkUwG_AF56AMDz_CuPCco1d3Ixv4djRMiga0T10zcAfXPkQF7ee_LZxnEawMh17EkBvhV3ITj3t-LS43UncaVsHzwbrfunpn-7i1zM4JucIX2ux3fZnzCOPeHD_sA7YYszxvO505TXUxVkgLVaDM17PYht9AUhkfV-n-pTmE7khE-VIXGyBfaUduzskCuom9L0bAPtynTEwY_On6CCBvUA8QwfF-cX51WTBpSHYth_MJIWU79TFHfCd7JgfgeHuiaiWFSsamBjGZzTZW5ydxbF5gZ6HrQIuPO4v0RNdv0KLAAf-hIMnmRUvHFT0a_SzwEEmcC8TeJAJDH_AIBN4lInPthZbiXCVAvcSgb1EYJCIQ3R9dbn5sopC4oxIpSxvI7U0NDGwd-ek4pwwA1aZZJnRRgmZEsMVqXQWK8kkVdUyliQhLFEskZrTPJbpGzSr72v9FmFOCVeJ0HlFDYFecy6oyDOuDDdJtSRz9LH_nqUKUeVtcpO70rEbclp6CObo_dBu6-Oo7LU4tHAMtT1iUO7wGSpIGvOE0Dla7OA1dkuzDJbOOTru8SvDnG1KMJbB5mXLFF539Eg1fKzM5nyA4Q61oG_tJZqo9X3XlBkjdlPD3v1huEfo2ThTjtGsfej0AuzWVp4EuT1xZzS_AYBNk5k |
linkProvider | EBSCOhost |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Natural+Language+Thesaurus+for+the+Humanities%3A+The+Need+for+a+Database+Search+Aid&rft.jtitle=The+Library+quarterly+%28Chicago%29&rft.au=Knapp%2C+Sara+D&rft.au=Cohen%2C+Laura+B&rft.au=Juedes%2C+D.+R&rft.date=1998-10-01&rft.issn=0024-2519&rft.volume=68&rft.issue=4&rft.spage=406&rft_id=info:doi/10.1086%2F603001&rft.externalDocID=EJ579998 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0024-2519&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0024-2519&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0024-2519&client=summon |