Association Rule for Classification of Type-2 Diabetic Patients

The discovery of knowledge from medical databases is important in order to make effective medical diagnosis. The aim of data mining is extract the information from database and generate clear and understandable description of patterns. In this study we have introduced a new approach to generate asso...

Full description

Saved in:
Bibliographic Details
Published in2010 Second International Conference on Machine Learning and Computing pp. 330 - 334
Main Authors Patil, B M, Joshi, R C, Toshniwal, Durga
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.02.2010
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The discovery of knowledge from medical databases is important in order to make effective medical diagnosis. The aim of data mining is extract the information from database and generate clear and understandable description of patterns. In this study we have introduced a new approach to generate association rules on numeric data. We propose a modified equal width binning interval approach to discretizing continuous valued attributes. The approximate width of the desired intervals is chosen based on the opinion of medical expert and is provided as an input parameter to the model. First we have converted numeric attributes into categorical form based on above techniques. Apriori algorithm is usually used for the market basket analysis was used to generate rules on Pima Indian diabetes data. The data set was taken from UCI machine learning repository containing total instances 768 and 8 numeric attributes.We discover that the often neglected pre-processing steps in knowledge discovery are the most critical elements in determining the success of a data mining application. Lastly we have generated the association rules which are useful to identify general associations in the data, to understand the relationship between the measured fields whether the patient goes on to develop diabetes or not. We are presented step-by-step approach to help the health doctors to explore their data and to understand the discovered rules better.
ISBN:1424460069
9781424460069
DOI:10.1109/ICMLC.2010.67