Data clustering is considered an important data analysis and data mining technique. It is included in a variety of disciplines such as machine learning, pattern recognition and bioinformatics. K-Means algorithm is a popular clustering algorithm but it suffers from its dependency on its initial centroid locations which fells the algorithm into the local optima. Bio-inspired algorithms are powerful in searching for the global optimal solutions. In this paper, the most recent bio-inspired algorithms; Crow search, Whale optimization, Grasshopper optimization and Salp swarm algorithms are integrated into the K-Means algorithm, to overcome the K-Means drawbacks. The proposed techniques are implemented and applied on eight numerical UCI datasets. Experimental results reveal the capability of the proposed algorithms to find the optimal initial centroid locations which achieve better clustering integrity. Moreover, the results show that the integration of the k-Means with the Crow search algorithm is superior compared to the others bio-inspired algorithms.
Published In:IJCSN Journal Volume 7, Issue 6
Date of Publication : December 2018
Pages : 361-373
Figures :12
Tables : 14
Doaa Abdullah Abdel-Mohsen :
is a teaching assistant in Computer Science department, Faculty of Computers and Information, Helwan University, Cairo, Egypt. She holds a bachelor in Computer Science with honors degree.
Dr. Hala Abdel-Galil :
is associate professor of Computer Science, and head of the Computer Science Department, Faculty of Computers and information, Helwan University, Cairo, Egypt. She has skills and expertise in Image Processing, Pattern Recognition, Classification, Neural Networks and Artificial Intelligence, Computational Intelligence, Pattern Classification, Applied Artificial Intelligence and Machine Intelligence.
Dr. Ensaf Hussein Mohamed :
received her Ph.D. in Computer Science, Faculty of Computers and Information, Helwan University, Cairo, Egypt, 2013. Her recent Research focuses on Natural Language Processing, Text Mining, and Machine Learning. Currently, she is an assistant professor, Faculty of Computers and Information, Helwan University, Cairo, Egypt.
Crow Search Algorithm, Whale optimization Algorithm, Salp Swarm algorithm, Grasshopper Optimization Algorithm, K-Means Clustering Algorithm, Sum of Squared Errors (SSE).
In this paper, we presented an integration of the k-Means algorithm with each one of the most recent bio-inspired algorithms to overcome the drawback of the K-means algorithm which is falling in the local optima and to maximize clusters integrity. C-Crow search algorithm, C- Salp algorithm, C-Whale search algorithm and CGrasshopper
optimization algorithm are proposed and
validated over eight datasets. six different evaluation
criteria are adopted in this study. These criteria are the
best, the worst, and the mean fitness value, the mean rank,
the SD, and the Accuracy. The experimental results show
that the proposed algorithms outperform the standard Kmeans
algorithm in terms of the best, the worst and the
mean fitness value.
[1] Askarzadeh, A, "A novel metaheuristic method for solving constrained engineering optimization problems: Crow search algorithm," Computers & Structures, vol. 169, (2016) , pp. 1-12.
[2] Chen, X. ; Zhou, Y. ; Luo, Q.,"A Hybrid Monkey Search Algorithm for Clustering Analysis," The Scientific World Journal, (2014) , p. 16.
[3] Corręa, G. S., and et al. ,"Combining K-Means and K-Harmonic with Fish School Search Algorithm for data clustering task on graphics processing units," Applied Soft Computing, vol. 41, (2016) , pp. 290-304.
[4] Dua, D. ;Karra Taniskidou, E. UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science. (2017).
[5] Fong, S. , and et al. ,"Towards Enhancement of Performance of K-Means Clustering Using Nature-Inspired Optimization Algorithms," The Scientific World Journal, vol. 2014, p. 16.
[6] Gu, C. ; Chen, Q. ; Tao, Q. ,"An Improved K-Means Algorithm Combined with Chaotic Particle Swarm Optimization Algorithm," Journal of Information & Computational Science, ( 2015), p. 12.
[7] Huang, Z. ; Zhou, Y. ,"Using glowworm swarm optimization algorithm for clustering analysis," Journal of Convergence Information Technology 6(2), (2011) , p. 78-85.
[8] Hassanzadeh, T.;Meybodi, M. R.,"A New Hybrid Approach for Data Clustering using Firefly Algorithm and K-means," Proceedings of the CSI International Symposium on Artificial Intelligence and Signal Processing, (2012) , pp. 7-11.
[9] Hatamlou, A. ; Abdullah, S. ; Nezamabadi-pour, H.,"A combined approach for clustering based on K-means and gravitational search algorithms," Swarm and Evolutionary Computation, vol. 6, (2012) , p. 47-52.
[10] Inkaya, T.; Kayaligil, S.; Özdemirel, N. E. "Swarm Intelligence-Based Clustering Algorithms: A Survey," in Unsupervised Learning Algorithms, Springer, Cham, 30. (2016).
[11] Karaboga, D. ; Ozturk, C.,"A novel clustering approach: Artificial Bee Colony (ABC) algorithm.," Applied Soft Computing, vol. 11, no. 1, (2011) , p. 652-657.
[12] Kwedlo, W., "A clustering method combining differential evolution with the K-means algorithm," Pattern Recognition Letters, vol. 32, no. 12, ( 2011), p. 1613-1621.
[13] Liu, Y. ; Wu, X. ; Shen, Y. "Cat swarm optimization clustering (KSACSOC): A cat swarm optimization clustering algorithm," Scientific Research and Essays, vol. 7, no. 49, (2012) ,pp. 4176-4185.
[14] Li, Q. ; Liu, B., "Clustering using an Improved Krill Herd Algorithm," Algorithms, vol. 10, no. 2 , (2017).
[15] Mirjalili, S. ; Lewis, A."The Whale Optimization Algorithm," Advances in Engineering Software, vol. 95, (2016), pp. 51-67.
[16] Mirjalili, S., and et al. "Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems," Advances in Engineering Software, vol. 114, (2017) ,pp. 163-191.
[17] Shah-Hosseini, H."Improving K-means clustering algorithm with the intelligent water drops (IWD) algorithm," International Journal of Data Mining, Modeling and Management, vol. 5, no. 4, (2013), pp. 301-317.
[18] Saida, I. B. ; Nadjet, K. ; Omar, B. "A New Algorithm for Data Clustering Based on Cuckoo Search Optimization," Genetic and Evolutionary Computing, vol. 238, (2014) , p. 55-64.
[19] Saremi, S. ; Mirjalili, A. ; Lewisa, A."Grasshopper Optimization Algorithm: Theory and application," Advances in Engineering Software, vol. 105, (2017) ,pp. 30-47.
[20] Tang, R., and et al. ,"Integrating nature inspired optimization algorithms," in Proceedings of the 7th International Conference on Digital Information Management (ICDIM '12) (2012).
[21] Yang, X. S., and et al." Swarm Intelligence and Bio-Inspired Computation: Theory and Applications, Elsevier Science Publishers B. V. Amsterdam, The Netherlands, (2013) .