Home
Call For Papers
Submission
Author
Registration
Publications
About
Contact Us

  Enhancing Speed of XML Data Mining on GPU  
  Authors : Deepak V. Mangela; Manoj A. Kale; Sandip M. Walunj
  Cite as:

 

We propose a system for data mining dynamic XML document using a GPU. As a GPU is capable of processing in parallel in real time, it promises on increasing the speed of data discovery process. Nvidia has provided its CUDA platform to utilize its graphics processor for implementing non-graphical algorithms on it. The WWW is structured over the Extensive Markup Language (XML) and hence provides access to the data structured over the WWW. The main objective is to find association rules by extracting items on XML nodes into the XML documents. The use of GPU comes in picture in the pre-processing phase and sorting phase when the item set is being shortened over the occurrences of the items, as this process takes a lot of time the use of a GPU promises a very significant decrease in the pre-processing time.

 

Published In : IJCSN Journal Volume 5, Issue 2

Date of Publication : April 2016

Pages : --

Figures :01

Tables : --

Publication Link : Enhancing Speed of XML Data Mining on GPU

 

 

 

Deepak V. Mangela : B.E Computer Department Sandip Institute of Technology and Research Center, SPPU University Nashik, India

Manoj A. Kale : B.E Computer Department Sandip Institute of Technology and Research Center, SPPU University Nashik, India

Sandip M. Walunj : B.E Computer Department Sandip Institute of Technology and Research Center, SPPU University Nashik, India

 

 

 

 

 

 

 

GPU, Data Mining, XML, CUDA, Parallel Processing

The findings of this paper suggests that the only use of graphics processors is not only just implementing graphical algorithms but their parallel processing architecture can be made use to enhance the speeds of other non-graphical algorithms too. In the near future the graphics processors would not be known for just increasing the visual experience and imaging but would also be known in every field where there is a need of fast processing and computational power.

 

 

 

 

 

 

 

 

 

[1] S.Rathi, C.Dhote, V. Bangera, Accelerating XML Mining using Graphic Processors International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT) 2014 [2] J.W. Wan and G. Dobbie, Extracting Association Rules from XML documents using XQuery, 2003 [3] C. Zheng, Y. Fang and Y. Shen, Web Mining Based on XML Technology, Computational Intelligence and Security, 2009. CIS '09. International Conference on (Volume:1 ) [4] R.Porkodi, V.Bhuvaneswari, R.Rajesh, T.Amudha, \emph{An Improved Association Rule Mining Technique for Xml Data Using Xquery and Apriori Algorithm}, 2009 IEEE International Advance Computing Conference (IACC 2009) Patiala, India, 978-1-4244-1888-6/08 2008 IEEE. [5] Liu, Md. Sumon Shahriar and Jixue. \emph{On Mining Association Rules with Semantic Constraints in XML}, ICDIM, pp.1-5, 978-1-4577-1539-6/11 IEEE, 2011. [6] Zhi-gang Wang, Chi-she Wang. \emph{A parallel association-rule mining algorithm}. In Proceedings of the 2012 International conference on Web Information Systems and Mining, WISM'12, Springer, pp. 125-129. [7] Han J, Pei J, Yin Y, \emph{Mining frequent patterns without candidate generation}, In: Proc. of the ACM SIGM0D Conference on Management of Data. Dallas, TX, 2000.2 [8] Fang R., He B., Lu M., Yang K., Govindaraju N. K., Luo Q., Sander P. V.: \emph{GPUQP:query co-processing using graphics processors}. In ACM SIGMOD International Conference on Managament of Data, pp. 10611063, New York, NY, USA, 2007. ACM. [9] Braga, A. Campi, S. Ceri, M. Klemettinen, and P. L. Lanzi. \emph{Mining association rules from xml data}. In Proc.Of 4th InternationalConference on DataWarehousing and Knowledge Discovery(DaWaK’02), volume 2454 of LNCS. Springer, pp.21-30,2002. [10] S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, and K. Skadron. \emph{A performance study of general-purpose applications on graphics processors using cuda}. J. Parallel Distrib. Comput.,68(10):1370-1380, 2008. [11] J. Dean and S. Ghemawat. Mapreduce: \emph{Simplified data processing on large clusters}. Commun. ACM, 51(1):107-113, 2008 [12] W. Fang, K. K. Lau, M. Lu, X. Xiao, C. K. Lam, P. Y. Yang, B. Hel, Q. Luo, P. V. Sander, and K. Yang.\emph{ Parallel data mining on graphics processors.} Technical report, Hong Kong University of Science and Technology,2008 [13] CUDA C/C++ Basics – Nvidia www.nvidia.com/docs/IO/116711/sc11-cuda-c-basics.pdf