A Consideration of a Methodology of the Object-oriented Term Weighting Using Hierarchical Term Classification for Medical Document Analysis

In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to analytical purposes. The hierarchical term classification exploits logical negation and medical information of ranking corresponding to ICD-10...

Full description

Saved in:
Bibliographic Details
Published inJapan Journal of Medical Informatics Vol. 38; no. 2; pp. 69 - 79
Main Authors Matsuo, R, TB, Ho, Ikeda, M, Tanaka, K, Chen, W
Format Journal Article
LanguageJapanese
Published Japan Association for Medical Informatics 15.06.2018
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to analytical purposes. The hierarchical term classification exploits logical negation and medical information of ranking corresponding to ICD-10 codes and consists of the category of terms as the nodes. It is employed to generate weighting rules for the object-oriented term weighting and we capture the order relation among the categories by giving the weights based on analytical purposes to the categories. Specifically, we generate three weighting rules from two features of the hierarchical term classification: the term hierarchy and the exploitation of medical information of ranking, and give higher weight to terms where it is located at the deep layer, non-negative terms’ categories and the higher rank in the hierarchy as important terms for a certain analytical purpose. The experimental results on mortality prediction which is one of the analytical purposes have indicated the effectiveness of the object-oriented term weighting. Therefore, it was suggested that the proposed methodology of the object-oriented term weighting is effective. Although, we regard the terms which correspond to ICD-10 as the dependent and important terms for analytical purposes, we considered the exploitation of machine learning techniques to capture the similar dependencies regarding analytical purposes for the terms which do not correspond to ICD-10. The proposed methodology and the order relation among terms’ categories derived from the weighting rules, a dictionary of terms and the weights have potential to contribute to the enhancement of knowledge acquisition support by big data analysis in medical domain.
AbstractList In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to analytical purposes. The hierarchical term classification exploits logical negation and medical information of ranking corresponding to ICD-10 codes and consists of the category of terms as the nodes. It is employed to generate weighting rules for the object-oriented term weighting and we capture the order relation among the categories by giving the weights based on analytical purposes to the categories. Specifically, we generate three weighting rules from two features of the hierarchical term classification: the term hierarchy and the exploitation of medical information of ranking, and give higher weight to terms where it is located at the deep layer, non-negative terms’ categories and the higher rank in the hierarchy as important terms for a certain analytical purpose. The experimental results on mortality prediction which is one of the analytical purposes have indicated the effectiveness of the object-oriented term weighting. Therefore, it was suggested that the proposed methodology of the object-oriented term weighting is effective. Although, we regard the terms which correspond to ICD-10 as the dependent and important terms for analytical purposes, we considered the exploitation of machine learning techniques to capture the similar dependencies regarding analytical purposes for the terms which do not correspond to ICD-10. The proposed methodology and the order relation among terms’ categories derived from the weighting rules, a dictionary of terms and the weights have potential to contribute to the enhancement of knowledge acquisition support by big data analysis in medical domain.
Author Matsuo, R
Tanaka, K
TB, Ho
Ikeda, M
Chen, W
Author_xml – sequence: 1
  fullname: Matsuo, R
  organization: Research Center for Service Science, Japan Advanced Institute of Science and Technology
– sequence: 2
  fullname: TB, Ho
  organization: School of Knowledge Science, Japan Advanced Institute of Science and Technology
– sequence: 3
  fullname: Ikeda, M
  organization: Research Center for Service Science, Japan Advanced Institute of Science and Technology
– sequence: 4
  fullname: Tanaka, K
  organization: School of Knowledge Science, Japan Advanced Institute of Science and Technology
– sequence: 5
  fullname: Chen, W
  organization: School of Knowledge Science, Japan Advanced Institute of Science and Technology
BookMark eNo9kE1OwzAQRi1UJErpigv4AilOnLjOgkVVfopU1E0Ry8ixx4mjxEa2WfQMXJqkRWy-kd58eiPNLZpZZwGh-5Ss0rzM-UMnBrOifMXKKzTPUs4TnrNyhuYk42XCSVHcoGUIpiaErIuU5GSOfjZ462wwCryIxlnsNBb4HWLrlOtdc5pAbAEf6g5kTJw3YCMofAQ_4E8wTRuNbfBHmHJnRo2XrZGivzS2vRgv6hGc7dr5Ua7O-ycnv4dRhjdW9Kdgwh261qIPsPybC3R8eT5ud8n-8Pq23eyTjtM8UWvJZS14mnKqKTDJIWc60yzTpWKMEp3SIpU5K5TSQElWck2zmkE2dhTRdIEeL9ouRNFA9eXNIPypEj4a2UM1vbGivMqmYOU_l63wFVj6C4Rnc5g
ContentType Journal Article
Copyright 2018 Japan Association for Medical Informatics
Copyright_xml – notice: 2018 Japan Association for Medical Informatics
DOI 10.14948/jami.38.69
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
EISSN 2188-8469
EndPage 79
ExternalDocumentID article_jami_38_2_38_69_article_char_en
GroupedDBID ALMA_UNASSIGNED_HOLDINGS
JSF
KQ8
OK1
RJT
ID FETCH-LOGICAL-j834-d7c8cba81183f3e6c8e46f2f62f9d6630f1351c465ddfe30298f32b6e22f6d0f3
ISSN 0289-8055
IngestDate Wed Apr 05 07:11:52 EDT 2023
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Issue 2
Language Japanese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-j834-d7c8cba81183f3e6c8e46f2f62f9d6630f1351c465ddfe30298f32b6e22f6d0f3
OpenAccessLink https://www.jstage.jst.go.jp/article/jami/38/2/38_69/_article/-char/en
PageCount 11
ParticipantIDs jstage_primary_article_jami_38_2_38_69_article_char_en
PublicationCentury 2000
PublicationDate 2018/06/15
PublicationDateYYYYMMDD 2018-06-15
PublicationDate_xml – month: 06
  year: 2018
  text: 2018/06/15
  day: 15
PublicationDecade 2010
PublicationTitle Japan Journal of Medical Informatics
PublicationTitleAlternate Japan Journal of Medical Informatics
PublicationYear 2018
Publisher Japan Association for Medical Informatics
Publisher_xml – name: Japan Association for Medical Informatics
References 1) Salton G, Buckley C. Term-weighting approaches in automatic text retrieval. Information Processing & Management 1988 ; 24, 5 : 513-523.
13) Martineau J, Finin T. Delta TFIDF : An improved feature space for sentiment analysis. In Proceedings of the Third AAAI International Conference on Weblogs and Social Media 2009.
28) Pedregosa, F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 2011 ; 12(Oct) : 2825-2830.
2) Ramos J. Using tf-idf to determine word relevance in document queries. Technical report, Department of Computer Science, Rutgers University 2003.
4) Zhang X, Jing L, Hu X, Ng M, Jiangxi JX, Zhou X. Medical document clustering using ontology-based term similarity measures. International Journal of Data Warehousing and Mining (IJDWM) 2008 ; 4, 1 : 62-73.
5) Zakos J, Verma B. Concept-based term weighting for web information retrieval. International Journal of Computational Intelligence and Applications 2006 ; 6, 2 : 193-207.
30) Garla VN, Brandt C. Ontology-guided feature engineering for clinical text classification. J Biomed Inform 2012 Oct ; 45, 5 : 992-998.
27) 松尾亮輔,Ho Tu Bao. 重症度を考慮した医学単語重み付け手法による死亡予測.第30回人工知能学会全国大会 2016 ; 4D1-4in2.
10) Jing L, Zhou L, Ng MK, Huang JZ. Ontology-based distance measure for text clustering. In : Proceedings of SIAM SDM workshop on text mining 2006.
20) Gruber TR. A translation approach to portable ontology specifications. Knowledge Acquisition 1993 ; 5, 2 : 199-220.
22) Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proceedings of the AMIA Symposium 2001 ; 17-21.
25) Kramers, PG. The ECHI project: health indicators for the European Community. The European Journal of Public Health 2003 ; 13 : 101-106.
19) World Health Organization. International statistical classification of diseases and related health problems. 2004 ; 1.
8) Sureka V, Punitha S. Approaches to ontology based algorithms for clustering text documents. International Journal of Computer Technology and Applications 2012 ; 3, 5 : 1813-1817.
18) The National Library of Medicine (NLM). Medical Subject Headings (MeSH). [https://www.nlm.nih.gov/mesh/(cited 2017-Feb-24)].
3) Zhang X, Jing L, Hu X, Ng M, Zhou X. A comparative study of ontology based term similarity measures on PubMed document clustering. International Conference on Database Systems for Advanced Applications 2007 ; 115-126.
17) Bodenreider O. The Unified Medical Language System (UMLS) : integrating biomedical terminology. Nucleic Acids Research 2004 ; 32(suppl 1) : D267-D270.
6) Sakre MM, Kouta MM, Allam AM. Weighting query terms using wordnet ontology. International Journal of Computer Science and Network Security 2009 ; 9, 4 : 349-358.
7) Tar HH, Nyunt TTS. Ontology-based concept weighting for text documents. World Academy of Science, Engineering and Technology 2011 ; 57 : 249-253.
9) Varelas G, Voutsakis E, Raftopoulou P, Petrakis EG, Milios EE. Semantic similarity methods in wordNet and their application to information retrieval on the web. In : Proceedings of the 7th Annual ACM International Workshop on Web Information and Data Management 2005 ; 10-16.
23) Noy NF, Shah NH, Whetzel PL, Dai B, Dorf M, Griffith N, et al. BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Research 2009 ; 37(suppl 2) : W170-W173.
14) Luo Q, Chen E, Xiong H. A semantic term weighting scheme for text categorization. Expert Systems with Applications 2011 ; 38, 10 : 12708-12716.
29) Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems 2013 ; 3111-3119.
12) Ko Y. A new term-weighting scheme for text classification using the odds of positive and negative class probabilities. Journal of the Association for Information Science and Technology 2015 ; 66, 12 : 2553-2565.
15) Yu H, Cao YG. Using the weighted keyword models to improve information retrieval for answering biomedical questions. In : Summit on Translational Bioinformatics 2009 ; 143-147.
11) Lan M, Tan CL, Low HB. Proposing a new term weighting scheme for text categorization. In : AAAI 2006 ; 6 : 763-768.
16) Zhu W, Xu X, Hu X, Song IY, Allen RB. Using UMLS-based re-weighting terms as a query expansion strategy. In : IEEE International Conference on Granular Computing 2006 ; 217-222.
21) 溝口理一郎.オントロジー工学.オーム社,2005.
24) Murphy SL, Xu J, Kochanek KD. Deaths : final data for 2010. NVSR 2013 ; 61, 4 : 1-118.
26) Saeed M, Villarroel M, Reisner AT, Clifford G, Lehman LW, Moody G, et al. Multiparameter Intelligent Monitoring in Intensive Care Ⅱ(MIMIC-Ⅱ) : A public-access intensive care unit database. Critical Care Medicine 2011 ; 39, 5 : 952-960.
References_xml
SSID ssib000751040
ssib005879655
ssib031782423
ssib007482534
ssj0002505442
ssib000994792
Score 2.1623342
Snippet In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to...
SourceID jstage
SourceType Publisher
StartPage 69
SubjectTerms Artificial intelligence
ICD
Knowledge bases
Medical informatics applications
Natural language processing
Title A Consideration of a Methodology of the Object-oriented Term Weighting Using Hierarchical Term Classification for Medical Document Analysis
URI https://www.jstage.jst.go.jp/article/jami/38/2/38_69/_article/-char/en
Volume 38
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
ispartofPNX Japan Journal of Medical Informatics, 2018/06/15, Vol.38(2), pp.69-79
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NbtQwELaWcuGCqADxW_nAbZUljR3HOS5V0Qq0IKQgeoscx5a6RRsE2QuvwOPxQszYTuIUDqUXa2XZUZz51vN5PD-EvALGbxnLZNLKgiW8NTJpuEbPHIPXULzhLn3x9oPYfObvLvKLxeJ35LV06JuV_vnPuJLbSBX6QK4YJfsfkh0fCh3wG-QLLUgY2hvJeD3W2xyJn1puXU1on1opOAB8bNDaknSY0xgZZgXb8fKLM4qipcC7DWwuMRbZlUb56ke4gpnoSjQ5JA73OqCbDs6NYEhqMiO5oID3y4jqDrNC7FMfedhvVf_j0M08F6s3Th92I2qvTKtmhttK7dWVmoy0wWpxKtG7ysdteo3iXiRC4GwN19_Gb4dwNAR96pP6rozrA34iE2BQZbyfMxnhNos25zDMq3lfwuYvBYLZcnzlgssVk6thyiwjd5B3jYNqJusMG1HWQz_GywE875C7Gex-uO2-_yRjjgZH4IijlSUvoqvtXBZlHCxccDjETzn8gO_JkQMj3UAOy12RqPELhThUXMvraSXAsXZw4hi8FR2Bqh6Q-wEOdO1f_5gsduoh-bWmMwjTzlJFIwhjB0CYXoMwRYDSEcLUQZjGEPYj5hCmIHAaxE8HCNMBwo9I9fa8OtskoT5IspOMJ22hpW6UhCMys8wILQ0XNrMis2ULRDq1WH1Sc5G3rTUMaw1YljXCZDCmTS17TI723d48IbTU8KCsyVWuFU9lo9JSSnGqU24LVQr7lAj_6epvPgdMfUMMPLvtxOfk3vS3eUGO-u8H8xLob9-cODj9AUy1tWs
link.rule.ids 315,783,787,27936,27937
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Consideration+of+a+Methodology+of+the+Object-oriented+Term+Weighting+Using+Hierarchical+Term+Classification+for+Medical+Document+Analysis&rft.jtitle=Japan+Journal+of+Medical+Informatics&rft.au=Matsuo%2C+R&rft.au=TB%2C+Ho&rft.au=Ikeda%2C+M&rft.au=Tanaka%2C+K&rft.date=2018-06-15&rft.pub=Japan+Association+for+Medical+Informatics&rft.issn=0289-8055&rft.eissn=2188-8469&rft.volume=38&rft.issue=2&rft.spage=69&rft.epage=79&rft_id=info:doi/10.14948%2Fjami.38.69&rft.externalDocID=article_jami_38_2_38_69_article_char_en
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0289-8055&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0289-8055&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0289-8055&client=summon