A Consideration of a Methodology of the Object-oriented Term Weighting Using Hierarchical Term Classification for Medical Document Analysis
In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to analytical purposes. The hierarchical term classification exploits logical negation and medical information of ranking corresponding to ICD-10...
Saved in:
Published in | Japan Journal of Medical Informatics Vol. 38; no. 2; pp. 69 - 79 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | Japanese |
Published |
Japan Association for Medical Informatics
15.06.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to analytical purposes. The hierarchical term classification exploits logical negation and medical information of ranking corresponding to ICD-10 codes and consists of the category of terms as the nodes. It is employed to generate weighting rules for the object-oriented term weighting and we capture the order relation among the categories by giving the weights based on analytical purposes to the categories. Specifically, we generate three weighting rules from two features of the hierarchical term classification: the term hierarchy and the exploitation of medical information of ranking, and give higher weight to terms where it is located at the deep layer, non-negative terms’ categories and the higher rank in the hierarchy as important terms for a certain analytical purpose. The experimental results on mortality prediction which is one of the analytical purposes have indicated the effectiveness of the object-oriented term weighting. Therefore, it was suggested that the proposed methodology of the object-oriented term weighting is effective. Although, we regard the terms which correspond to ICD-10 as the dependent and important terms for analytical purposes, we considered the exploitation of machine learning techniques to capture the similar dependencies regarding analytical purposes for the terms which do not correspond to ICD-10. The proposed methodology and the order relation among terms’ categories derived from the weighting rules, a dictionary of terms and the weights have potential to contribute to the enhancement of knowledge acquisition support by big data analysis in medical domain. |
---|---|
AbstractList | In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to analytical purposes. The hierarchical term classification exploits logical negation and medical information of ranking corresponding to ICD-10 codes and consists of the category of terms as the nodes. It is employed to generate weighting rules for the object-oriented term weighting and we capture the order relation among the categories by giving the weights based on analytical purposes to the categories. Specifically, we generate three weighting rules from two features of the hierarchical term classification: the term hierarchy and the exploitation of medical information of ranking, and give higher weight to terms where it is located at the deep layer, non-negative terms’ categories and the higher rank in the hierarchy as important terms for a certain analytical purpose. The experimental results on mortality prediction which is one of the analytical purposes have indicated the effectiveness of the object-oriented term weighting. Therefore, it was suggested that the proposed methodology of the object-oriented term weighting is effective. Although, we regard the terms which correspond to ICD-10 as the dependent and important terms for analytical purposes, we considered the exploitation of machine learning techniques to capture the similar dependencies regarding analytical purposes for the terms which do not correspond to ICD-10. The proposed methodology and the order relation among terms’ categories derived from the weighting rules, a dictionary of terms and the weights have potential to contribute to the enhancement of knowledge acquisition support by big data analysis in medical domain. |
Author | Matsuo, R Tanaka, K TB, Ho Ikeda, M Chen, W |
Author_xml | – sequence: 1 fullname: Matsuo, R organization: Research Center for Service Science, Japan Advanced Institute of Science and Technology – sequence: 2 fullname: TB, Ho organization: School of Knowledge Science, Japan Advanced Institute of Science and Technology – sequence: 3 fullname: Ikeda, M organization: Research Center for Service Science, Japan Advanced Institute of Science and Technology – sequence: 4 fullname: Tanaka, K organization: School of Knowledge Science, Japan Advanced Institute of Science and Technology – sequence: 5 fullname: Chen, W organization: School of Knowledge Science, Japan Advanced Institute of Science and Technology |
BookMark | eNo9kE1OwzAQRi1UJErpigv4AilOnLjOgkVVfopU1E0Ry8ixx4mjxEa2WfQMXJqkRWy-kd58eiPNLZpZZwGh-5Ss0rzM-UMnBrOifMXKKzTPUs4TnrNyhuYk42XCSVHcoGUIpiaErIuU5GSOfjZ462wwCryIxlnsNBb4HWLrlOtdc5pAbAEf6g5kTJw3YCMofAQ_4E8wTRuNbfBHmHJnRo2XrZGivzS2vRgv6hGc7dr5Ua7O-ycnv4dRhjdW9Kdgwh261qIPsPybC3R8eT5ud8n-8Pq23eyTjtM8UWvJZS14mnKqKTDJIWc60yzTpWKMEp3SIpU5K5TSQElWck2zmkE2dhTRdIEeL9ouRNFA9eXNIPypEj4a2UM1vbGivMqmYOU_l63wFVj6C4Rnc5g |
ContentType | Journal Article |
Copyright | 2018 Japan Association for Medical Informatics |
Copyright_xml | – notice: 2018 Japan Association for Medical Informatics |
DOI | 10.14948/jami.38.69 |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
EISSN | 2188-8469 |
EndPage | 79 |
ExternalDocumentID | article_jami_38_2_38_69_article_char_en |
GroupedDBID | ALMA_UNASSIGNED_HOLDINGS JSF KQ8 OK1 RJT |
ID | FETCH-LOGICAL-j834-d7c8cba81183f3e6c8e46f2f62f9d6630f1351c465ddfe30298f32b6e22f6d0f3 |
ISSN | 0289-8055 |
IngestDate | Wed Apr 05 07:11:52 EDT 2023 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | true |
Issue | 2 |
Language | Japanese |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-j834-d7c8cba81183f3e6c8e46f2f62f9d6630f1351c465ddfe30298f32b6e22f6d0f3 |
OpenAccessLink | https://www.jstage.jst.go.jp/article/jami/38/2/38_69/_article/-char/en |
PageCount | 11 |
ParticipantIDs | jstage_primary_article_jami_38_2_38_69_article_char_en |
PublicationCentury | 2000 |
PublicationDate | 2018/06/15 |
PublicationDateYYYYMMDD | 2018-06-15 |
PublicationDate_xml | – month: 06 year: 2018 text: 2018/06/15 day: 15 |
PublicationDecade | 2010 |
PublicationTitle | Japan Journal of Medical Informatics |
PublicationTitleAlternate | Japan Journal of Medical Informatics |
PublicationYear | 2018 |
Publisher | Japan Association for Medical Informatics |
Publisher_xml | – name: Japan Association for Medical Informatics |
References | 1) Salton G, Buckley C. Term-weighting approaches in automatic text retrieval. Information Processing & Management 1988 ; 24, 5 : 513-523. 13) Martineau J, Finin T. Delta TFIDF : An improved feature space for sentiment analysis. In Proceedings of the Third AAAI International Conference on Weblogs and Social Media 2009. 28) Pedregosa, F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 2011 ; 12(Oct) : 2825-2830. 2) Ramos J. Using tf-idf to determine word relevance in document queries. Technical report, Department of Computer Science, Rutgers University 2003. 4) Zhang X, Jing L, Hu X, Ng M, Jiangxi JX, Zhou X. Medical document clustering using ontology-based term similarity measures. International Journal of Data Warehousing and Mining (IJDWM) 2008 ; 4, 1 : 62-73. 5) Zakos J, Verma B. Concept-based term weighting for web information retrieval. International Journal of Computational Intelligence and Applications 2006 ; 6, 2 : 193-207. 30) Garla VN, Brandt C. Ontology-guided feature engineering for clinical text classification. J Biomed Inform 2012 Oct ; 45, 5 : 992-998. 27) 松尾亮輔,Ho Tu Bao. 重症度を考慮した医学単語重み付け手法による死亡予測.第30回人工知能学会全国大会 2016 ; 4D1-4in2. 10) Jing L, Zhou L, Ng MK, Huang JZ. Ontology-based distance measure for text clustering. In : Proceedings of SIAM SDM workshop on text mining 2006. 20) Gruber TR. A translation approach to portable ontology specifications. Knowledge Acquisition 1993 ; 5, 2 : 199-220. 22) Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proceedings of the AMIA Symposium 2001 ; 17-21. 25) Kramers, PG. The ECHI project: health indicators for the European Community. The European Journal of Public Health 2003 ; 13 : 101-106. 19) World Health Organization. International statistical classification of diseases and related health problems. 2004 ; 1. 8) Sureka V, Punitha S. Approaches to ontology based algorithms for clustering text documents. International Journal of Computer Technology and Applications 2012 ; 3, 5 : 1813-1817. 18) The National Library of Medicine (NLM). Medical Subject Headings (MeSH). [https://www.nlm.nih.gov/mesh/(cited 2017-Feb-24)]. 3) Zhang X, Jing L, Hu X, Ng M, Zhou X. A comparative study of ontology based term similarity measures on PubMed document clustering. International Conference on Database Systems for Advanced Applications 2007 ; 115-126. 17) Bodenreider O. The Unified Medical Language System (UMLS) : integrating biomedical terminology. Nucleic Acids Research 2004 ; 32(suppl 1) : D267-D270. 6) Sakre MM, Kouta MM, Allam AM. Weighting query terms using wordnet ontology. International Journal of Computer Science and Network Security 2009 ; 9, 4 : 349-358. 7) Tar HH, Nyunt TTS. Ontology-based concept weighting for text documents. World Academy of Science, Engineering and Technology 2011 ; 57 : 249-253. 9) Varelas G, Voutsakis E, Raftopoulou P, Petrakis EG, Milios EE. Semantic similarity methods in wordNet and their application to information retrieval on the web. In : Proceedings of the 7th Annual ACM International Workshop on Web Information and Data Management 2005 ; 10-16. 23) Noy NF, Shah NH, Whetzel PL, Dai B, Dorf M, Griffith N, et al. BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Research 2009 ; 37(suppl 2) : W170-W173. 14) Luo Q, Chen E, Xiong H. A semantic term weighting scheme for text categorization. Expert Systems with Applications 2011 ; 38, 10 : 12708-12716. 29) Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems 2013 ; 3111-3119. 12) Ko Y. A new term-weighting scheme for text classification using the odds of positive and negative class probabilities. Journal of the Association for Information Science and Technology 2015 ; 66, 12 : 2553-2565. 15) Yu H, Cao YG. Using the weighted keyword models to improve information retrieval for answering biomedical questions. In : Summit on Translational Bioinformatics 2009 ; 143-147. 11) Lan M, Tan CL, Low HB. Proposing a new term weighting scheme for text categorization. In : AAAI 2006 ; 6 : 763-768. 16) Zhu W, Xu X, Hu X, Song IY, Allen RB. Using UMLS-based re-weighting terms as a query expansion strategy. In : IEEE International Conference on Granular Computing 2006 ; 217-222. 21) 溝口理一郎.オントロジー工学.オーム社,2005. 24) Murphy SL, Xu J, Kochanek KD. Deaths : final data for 2010. NVSR 2013 ; 61, 4 : 1-118. 26) Saeed M, Villarroel M, Reisner AT, Clifford G, Lehman LW, Moody G, et al. Multiparameter Intelligent Monitoring in Intensive Care Ⅱ(MIMIC-Ⅱ) : A public-access intensive care unit database. Critical Care Medicine 2011 ; 39, 5 : 952-960. |
References_xml | |
SSID | ssib000751040 ssib005879655 ssib031782423 ssib007482534 ssj0002505442 ssib000994792 |
Score | 2.1623342 |
Snippet | In this paper, we consider a methodology of the object-oriented term weighting, by using a hierarchical structure of terms in medical documents according to... |
SourceID | jstage |
SourceType | Publisher |
StartPage | 69 |
SubjectTerms | Artificial intelligence ICD Knowledge bases Medical informatics applications Natural language processing |
Title | A Consideration of a Methodology of the Object-oriented Term Weighting Using Hierarchical Term Classification for Medical Document Analysis |
URI | https://www.jstage.jst.go.jp/article/jami/38/2/38_69/_article/-char/en |
Volume | 38 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
ispartofPNX | Japan Journal of Medical Informatics, 2018/06/15, Vol.38(2), pp.69-79 |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NbtQwELaWcuGCqADxW_nAbZUljR3HOS5V0Qq0IKQgeoscx5a6RRsE2QuvwOPxQszYTuIUDqUXa2XZUZz51vN5PD-EvALGbxnLZNLKgiW8NTJpuEbPHIPXULzhLn3x9oPYfObvLvKLxeJ35LV06JuV_vnPuJLbSBX6QK4YJfsfkh0fCh3wG-QLLUgY2hvJeD3W2xyJn1puXU1on1opOAB8bNDaknSY0xgZZgXb8fKLM4qipcC7DWwuMRbZlUb56ke4gpnoSjQ5JA73OqCbDs6NYEhqMiO5oID3y4jqDrNC7FMfedhvVf_j0M08F6s3Th92I2qvTKtmhttK7dWVmoy0wWpxKtG7ysdteo3iXiRC4GwN19_Gb4dwNAR96pP6rozrA34iE2BQZbyfMxnhNos25zDMq3lfwuYvBYLZcnzlgssVk6thyiwjd5B3jYNqJusMG1HWQz_GywE875C7Gex-uO2-_yRjjgZH4IijlSUvoqvtXBZlHCxccDjETzn8gO_JkQMj3UAOy12RqPELhThUXMvraSXAsXZw4hi8FR2Bqh6Q-wEOdO1f_5gsduoh-bWmMwjTzlJFIwhjB0CYXoMwRYDSEcLUQZjGEPYj5hCmIHAaxE8HCNMBwo9I9fa8OtskoT5IspOMJ22hpW6UhCMys8wILQ0XNrMis2ULRDq1WH1Sc5G3rTUMaw1YljXCZDCmTS17TI723d48IbTU8KCsyVWuFU9lo9JSSnGqU24LVQr7lAj_6epvPgdMfUMMPLvtxOfk3vS3eUGO-u8H8xLob9-cODj9AUy1tWs |
link.rule.ids | 315,783,787,27936,27937 |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Consideration+of+a+Methodology+of+the+Object-oriented+Term+Weighting+Using+Hierarchical+Term+Classification+for+Medical+Document+Analysis&rft.jtitle=Japan+Journal+of+Medical+Informatics&rft.au=Matsuo%2C+R&rft.au=TB%2C+Ho&rft.au=Ikeda%2C+M&rft.au=Tanaka%2C+K&rft.date=2018-06-15&rft.pub=Japan+Association+for+Medical+Informatics&rft.issn=0289-8055&rft.eissn=2188-8469&rft.volume=38&rft.issue=2&rft.spage=69&rft.epage=79&rft_id=info:doi/10.14948%2Fjami.38.69&rft.externalDocID=article_jami_38_2_38_69_article_char_en |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0289-8055&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0289-8055&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0289-8055&client=summon |