Nowadays, Diabetes has become a constant chronic disease affecting the mankind. Various causes such as
bacterial or viral infection, toxic or chemical contents mix with the food, auto immune reaction, obesity, bad diet, change in
lifestyles, eating habit, environment pollution, etc. are responsible in increasing number of victims suffering from Diabetes.
Hence, It would be very helpful in predicting this disease at early stage and diagnosing the disease effectively. In health
care, this process is carried out using machine learning algorithms to analyze medical data to build to carry out medical
diagnoses. Diabetes Mellitus or Diabetes is a serious chronic disease which results in increase of blood sugar. It has always
been tedious to identify diabetes, but with emergence of machine learning the identification process has become simpler.
Three machine learning algorithms namely SVM, Decision Tree and Naive Bayes are used to detect Diabetes in earlier
stages. Algorithms are experimented and evaluated on measures like precision, Accuracy-measure and Recall. The results
obtained show Naive Bayes performs better with 76.30% compared to other algorithms. These results are verified using
Receiver Operating Characteristic (ROC) curves in a proper and systematic manner.
Published In:IJCSN Journal Volume 8, Issue 4
Date of Publication : August 2019
Pages : 387-397
Figures :14
Tables : --
Ruchita Gudipati :
currently studying 4th year of engineering in
Computer Science from GITAM University, Hyderabad. I have an
inclination towards the field of research and towards contributing
my part of knowledge especially to the domain of Machine
Learning.
Muvva Vennela Sai :
currently in 4th year pursuing engineering in
Computer Science from GITAM University, Hyderabad. Apart from
my various interests in this field of study, the subject of Machine
Learning has inspired me to research further.
K Radha :
completed her BTECH,MTECH at JNTUH. Pursuing
PhD in KL University,Guntur. She has 12 years of Teaching
Experience and 3 Years of Research Experience. She has applied
for DST Research Projects. She has published 25 papers in
International Journals, SCOPUS journals and Springer and IEEE
Conferences. Her Research Interests are Cloud Computing, Big
Data Analytics, Machine Learning, Deep Learning, and Artificial
Intelligence.
Diabetes has become a chronic disease claiming many
lives hence, detection of it in early stages is vital. In the
conducted study, initiatives are taken to predict diabetes.
Machine learning classification algorithms were used and
Naive Bayes was found to outperform the remaining two
classification algorithms with accuracy over 76.30%.
Results were obtained from Pima Indians Diabetes
Database and classification algorithms. The above results
indicate a promising results and the scope of fiels like
Machine learning and Data mining. In near future,
machine learning classification algorithms can be used to
predict and diagnose other diseases. Automation can be
used to perform better diabetes analysis.
[1]
Kumar,D.A.,Govindasamy,R.,2015.PerformanceandEv
aluationofClassificationDataMiningTechniquesinDiabe
tes.InternationalJournalofComputerScienceandInformat
ionTechnologies,6,1312-1319.
[2] Iyer, A., S, J., Sumbaly, R., 2015. Diagnosis of
Diabetes Using Classification Mining Techniques.
International Journal of Data Mining & Knowledge Management Process 5, 1-14.
doi:10.5121/ijdkp.2015.5101, arXiv:1502.03774.
[3] Fatima, M., Pasha, M., 2017. Survey of Machine
Learning Algorithms for Disease Diagnostic. Journal of
Intelligent Learning Systems and Applications 09, 1-
16. doi:10.4236/jilsa.2017.91001.
[4]
Orabi,K.M.,Kamal,Y.M.,Rabah,T.M.,2016.EarlyPredic
tiveSystemforDiabetesMellitusDisease,in:IndustrialCo
nferenceonDataMining,Springer.Springer.pp.420-427.
[5] Aishwarya, R., Gayathri, P., Jaisankar, N., 2013. A
Method for Classification Using Machine Learning
Technique for Diabetes. International Journal of
Engineering and Technology (IJET) 5, 2903-2908.
[6] Dhomse Kanchan B., M.K.M., 2016. Study of Machine
Learning Algorithms for Special Disease Prediction
using Principal of Component Analysis, in: 2016
International Conference on Global Trends in Signal
Processing, Information Computing and
Communication, IEEE. pp. 5-10.
[7] Symposium on Data Mining Applications,
SDMA2016, 30 March 2016, Riyadh, Saudi Arabia,
"Performance Analysis of Data Mining Classification
Techniques to Predict Diabetes", Procedia Computer
Science 82 ( 2016 ) 115 - 121.
[8] Harleen,Dr. Pankaj Bhambri "A Prediction Technique
in Data Mining for Diabetes Mellitus", Journal of
Management Sciences and Technology, 4 (1), October
- 2016 ISSN -2347-5005.
[9] Misra, B.B. G. (2007). "Simplified Polynomial Neural
Network for classification task in data mining".
International Conf. on Evolutionary Computation,
2007, pp 721 - 728.
[10]
Perveen,S.,Shahbaz,M.,Guergachi,A.,Keshavjee,K.,20
16.PerformanceAnalysisofDataMiningClassificationTe
chniquestoPredictDiabetes.ProcediaComputerScience8
2,115-121.doi:10.1016/j.procs.2016.04.016.Issue 1, pp
10, 2010.
[11]
NaiArun,N.,Sittidech,P.,2014.EnsembleLearningModel
forDiabetesClassification.AdvancedMaterialsResearch
931-932,1427-
1431.doi:10.4028/www.scientific.net/AMR.931-
932.1427.
[12] Bamnote, M.P., G.R., 2014. Design of Classifier for
Detection of Diabetes Mellitus Using Genetic
Programming. Advances in Intelligent Systems and
Computing 1, 763-770. doi:10.1007/978-3-319-11933-
5.
[13]
Priyam,A.,Gupta,R.,Rathee,A.,Srivastava,S.,2013.Com
parativeAnalysisofDecisionTreeClassificationAlgorith
ms.InternationalJournalofCurrentEngineeringandTechn
ologyVol.3,334-337.doi:JUNE2013,arXiv:ISSN2277-
4106.
[14] Esposito, F., Malerba, D., Semeraro, G., Kay, J., 1997.
A comparative analysis of methods for pruning
decision trees. IEEE Transactions on Pattern Analysis
and Machine Intelligence 19, 476-491.
doi:10.1109/34.589207.
[15] Bamnote, M.P., G.R., 2014. Design of Classifier for
Detection of Diabetes Mellitus Using Genetic
Programming. Advances in Intelligent Systems and
Computing 1, 763-770. doi:10.1007/978-3-319-11933-
5.
[16]
Sharief,A.A.,Sheta,A.,2014.DevelopingaMathematical
ModeltoDetectDiabetesUsingMultigeneGeneticProgra
mming.InternationalJournalofAdvancedResearchinArti
ficialIntelligence(IJARAI)3,54-
59.doi:doi:10.14569/IJARAI.2014.031007.
[17]
Pradhan,P.M.A.,Bamnote,G.R.,Tribhuvan,V.,Jadhav,K
.,Chabukswar,V.,Dhobale,V.,2012.AGeneticProgramm
ingApproachforDetectionofDiabetes.InternationalJourn
alOfComputationalEngineeringResearch2,91-94.
[18]
TarikA.Rashid,S.M.A.,Abdullah,R.M.,Abstract,2016.A
nIntelligentApproachforDiabetesClassification,Predicti
onandDescription.AdvancesinIntelligentSystemsandCo
mputing424,323-335.doi:10.1007/978-3-319-28031-8
[19] Han, J., Rodriguez, J.C., Beheshti, M., 2008.
Discovering decision tree based diabetes prediction
model, in: International Conference on Advanced
Software Engineering and Its Applications, Springer.
pp. 99-109.
[20]
NaiArun,N.,Moungmai,R.,2015.ComparisonofClassifie
rsfortheRiskofDiabetesPrediction.ProcediaComputerSc
ience69,132-142.doi:10.1016/j.procs.2015.10.014.
[21]
Sisodia,D.,Singh,L.,Sisodia,S.,2014.FastandAccurateF
aceRecognitionUsingSVMandDCT,in:Proceedingsofth
eSecondInternationalConferenceonSoftComputingforP
roblemSolving(SocProS2012),December28-
30,2012,Springer.pp.1027-1038.