Home
Call For Papers
Submission
Author
Registration
Publications
About
Contact Us

  An Efficient Indexing Approach on Hidden Web for AJAX Applications  
  Authors : Beena Mahar; Dr. C K Jha
  Cite as:

 

In AJAX search engine, indexing techniques for the hidden web document is the major issue to optimize speed and performance to find out the related hidden data for a search query. This paper is based on indexing the hidden web documents for AJAX web search engine to emphasis on how to utilize the resources to fulfill the extent to save time and cost as compare to the existing frameworks for the organization details to understanding of the steps to be taken both in term of business process and technical implementation, evaluation and analysis. The searching process is basically directly based on the indexing technique. In web there are many indexing technique which index the web content retrieved by a general purpose of web crawler. But it’s not necessary that all the indexing techniques are good for the hidden web. So that there are needs for the hidden web content must be indexed efficiently. In this research paper proposed a designed an efficient indexing technique. The main aim of this indexing technique is to reduce the query processing time based on domain and give more specific result html pages to the use’s query.

 

Published In : IJCSN Journal Volume 5, Issue 6

Date of Publication : December 2016

Pages : 924-929

Figures :03

Tables : --

 

Beena Mahar : Research Scholar, AIM & ACT Department Banasthali Vidyapeeth, Rajisthan, India.

Dr. C K Jha : HOD, AIM & ACT Department Banasthali Vidyapeeth, Rajisthan, India.

 

 

 

 

 

 

 

Hidden Web, AJAX, Deep Web, Indexing Technique, Web Index, Indexer, Crawler

The Hidden Web is important because it retrieves highquality information. Therefore there is a need to implement an indexing technique to be more efficient to index the high quality data. Many research focus on the crawling and indexing algorithms for client side as well as server side DOM state changes, some are the hidden web behind forms, text and search query with their own advantages and disadvantage. In this paper proposed a framework for indexing technique which is performed by the various modules. This is a simple and efficient indexing technique for improving the access of hidden web documents for the AJAX search engines. This research can also be expanded through adding multiple domain and their corresponding sub domains. Along with modification in implementation, new algorithms for better understanding and quick results of multiphase dynamic queries can be introduced.

 

[1] J. P. Lage, A.S.da Silva, P.B. Golgher and A.H.F. Laender, “Automatic generation of agents for collecting hidden web pages for data extraction” In Data & Knowledge Engineering, volume 49,issue 2, pages 177-196,2004. [2] L. Barbosa and J. Freire,”An adaptive crawler for location hidden-web entry points” In Proc. of the 16th Int. Conf. on World Wide Web, pages 441-450, ACM-2007. [3] S. Raghavan and H. Garcia-Moline, “Crawling the hidden web” In Proc. of the Conf. on Very Large Data Bases, pages 129-138, 2001. [4] Hasan Mahmud, Moumie Soulemane and Mohammad Rafiuzzaman, “ A framework for dynamic indexing from hidden web” In the Int .Journal of Computer Science, Volume 8, Issue 2, 2011 [5] Eibe Frank, Gordon W. Paynter, Ian H. Witten, Carl Gutwin and Craig G. Nevill-Manning, ”Domain Specific Key phrase Extraction” ” In the Proc. of the 16th Int. Conf. on Artificial Intelligence IJCAI, volume 2, pages 668-673, ACM-1999 [6] Moumie Soulemane, Mohammad Rafiuzzaman, Hasan Mahmud,”Crawling the Hidden web:An approach to dynamic web indexing”, In the Int. journal of Computer Application, volume 55-no.1, Snn no: 0975-8887, 2012 [7] Steven s Skiena,”The Algorithm design”, Manual 2nd edition , Springer, Verlag London , 2008 [8] Rahul Kumar, Anurag Jain and Chetan Agarwl,”Survey of web crawling algorithms”, In the Int. Journal of Advances in Visions Computing(AVC), volume 1, Issue 2/3, 2014 [9] Aviral Nigam, NIT-Calicut, “Web Crawling Algorithms”, In the International Journal of Computer Science and Artificial Intelligence, volume 4, Issue 3,Pages 63-67, 2014 [10] Justin Zobel and Alistair Moffat,”Inverted Files for Text Search Engines”, In the Journal of ACM Computing Surveys(CSUR), Volume 38, Issue 2, July 2006 [11] Priyanka Gupta, Komal Bhatia and Kalpna Gupta,”Optimized method for indexing the Hidden web data”, In the Int.Journal of Inforatmation Technology and knowledge Management, Volume 4, Issue 2, pages 673-678, July 2011 [12] Anjali Ganesh Jivani, “A Comparative Study of Stemming Algorithims” In the Int. Journal of Computer Technology and Applciation (IJCTA), Volume 2, Issue 6, Pages 1930-1938, 2011 [13] Hao Yah, Shuai Ding and Torsten Suel “Inverted Index Compression and Query Processing with Optimized Document Ordering”, In the Int. WWW Conf. Committee –IW3C2 Madrid, Spain, Pages 20- 24, ACM 2009 [14] Farhi Marir and lamel Houam,”RST Index: indexing and retrieving web document using computational and linguistic techniques” ” In the Proc. of the 3rd Int. Conf. on Intelligent Data Engineering and Automated Learning, UK, pages 135-140, 2002 Available at http://portal.acm.org/citation.cfm?id=646288.686474. [15] Jagannathan Srinivasan, Ravi Murthy, Seema Sundara, Nipun Aggarwal and Samuel DeFazio,”Extensible Indexing : A framework for integrating domain specific indexing schemes into Oracle 8i”, In the Proc. of the 16th Int. Conf. on Data Engineering IEEE-2000 [16] A.Mesbah, A..an Deursen and S. Lenselink, ”Crawling AJAX Based web applications through dynamic analysis of uder interface state changes”, In ACM Transaction on the web- TWEB, volume 6, Issue 1, 2012 [17] C Duda, G. Frey, D. Kossmann, R. Matter and C Zhou,”AJAX Crawl: making Ajax applications searchable” In the Proc. of the Int. Conf. on Data Engineering, pages 78-78, 2009 [18] A. Bergholz, B. Chidlovskii, “Crawling for Domain- Specific Hidden Web Resources” In the Proc. of the 4th Int. Conf. on Web Information System Engineering,2003 [19] A. Ntoulas, P. Zerfos and J. Cho, ”Downloading Textual Hidden Web Content through Keyword Queries” In the Proc. of the 5th ACM/IEEE Joint Conf. on Digital Libraris,2005 [20] S. Liddle, D. Embley, Del Scott and S. Ho Yau, ” Extracting Data Behind Web Forms” In the Proc. of the 28th Int. Conf. on Very Large Data Bases, China, 2005 [21] Manuel Alvarez, Juan Raposo, Alberto Pan, Fidel Cacheda, Femando Bellas and Victor Cameiro, ”Crawling the Content Hidden Behind Web forms ” Department of Information and Communications Technologies, University of A Coruna, 15071, Spain, [22] Ritu Shandilya, Sugam Sharma and Shamimul Qamar,”A Domain Specific Indexing Technique for Hidden Web Documents”, In CISME, volume 2, issue 2, pages 37-41, 2012. Available at: www.jcisme.org [23] A. K. Sharma and Komal Kumar Bhatia,”Merging ”query interfaces in domain specific hidden web databases” In Int. Journal of Computer Science,2008 [24] A. Mesbah, E. Bozdag and A.V. Deursen, “Crawling AJAX by inferring user interface state changes”, In the Proc. Of 8th Int. Conf. on Web Engineering (ICWE), Washington DC,USA, IEEsE-CSI , pages 122-134, 2008 [25] Zahra Behfarshad and Ali Mesbah. “Hidden-Web Induced by Client-Side Scripting: An Empirical Study”, Springer Berlin Heidelberg, In Proceedings International Conference (ICWE 2013) Aalborg, Denmark, pages 52-67, 2013. [26] Li Jie Cui, Hui He and Hong Wei Xuan ”Analysis and Implementation of an Ajax-enabled Web Crawler” ,In the Int .Journal of Future Generation Communication and Networking , Volume 6, Issue 2, April 2013 [27] Paul Suganthan G. C.,”AJAX Crawler”, In the Int. Conf. on Data Science and Engineering,(ICDSE), IEEE- 2012 [28] A. Mesbah, A. Van Deursen and S. Lenselink, “Crawling Ajax based web applications through dynamic analysis of user interface state changes”, In ACM Transaction on the web- TWEB, volume 6, Issue 1,page 3, 2012 [29] A Rosaline Mary, B Visvanath, ”Evaluation of web search engine-a comparative study”, In Research Gate 2015 [30] Bhupendra Singh, Shashank Sahu,”Model for performance testing of AJAX based web applications”, In Int. Journal of Research in Engineering and Technology, volume 3, Issue 4, eISSN 2319-1163,pISSN 2321-7308, 2014 [31] Bhupendra Singh, Shashank Sahu,”A Noval approach for evaluation of applying AJAX in the web site”, In Int. Journal of Research in Engineering and Technology, volume 3, Issue 8, eISSN 2319- 1163,pISSN 2321-7308, 2014.