In AJAX search engine, indexing techniques for
the hidden web document is the major issue to optimize
speed and performance to find out the related hidden data
for a search query. This paper is based on indexing the
hidden web documents for AJAX web search engine to
emphasis on how to utilize the resources to fulfill the extent
to save time and cost as compare to the existing frameworks
for the organization details to understanding of the steps to
be taken both in term of business process and technical
implementation, evaluation and analysis. The searching
process is basically directly based on the indexing technique.
In web there are many indexing technique which index the
web content retrieved by a general purpose of web crawler.
But it’s not necessary that all the indexing techniques are
good for the hidden web. So that there are needs for the
hidden web content must be indexed efficiently. In this
research paper proposed a designed an efficient indexing
technique. The main aim of this indexing technique is to
reduce the query processing time based on domain and give
more specific result html pages to the use’s query.
Published In:IJCSN Journal Volume 5, Issue 6
Date of Publication : December 2016
Pages : 924-929
Figures :03
Tables : --
Beena Mahar : Research Scholar, AIM & ACT Department
Banasthali Vidyapeeth, Rajisthan, India.
Dr. C K Jha : HOD, AIM & ACT Department
Banasthali Vidyapeeth, Rajisthan, India.
Hidden Web, AJAX, Deep Web, Indexing
Technique, Web Index, Indexer, Crawler
The Hidden Web is important because it retrieves highquality
information. Therefore there is a need to
implement an indexing technique to be more efficient to
index the high quality data. Many research focus on the
crawling and indexing algorithms for client side as well
as server side DOM state changes, some are the hidden
web behind forms, text and search query with their own
advantages and disadvantage. In this paper proposed a
framework for indexing technique which is performed by
the various modules. This is a simple and efficient
indexing technique for improving the access of hidden web documents for the AJAX search engines. This
research can also be expanded through adding multiple
domain and their corresponding sub domains. Along with
modification in implementation, new algorithms for
better understanding and quick results of multiphase
dynamic queries can be introduced.
[1] J. P. Lage, A.S.da Silva, P.B. Golgher and A.H.F.
Laender, “Automatic generation of agents for
collecting hidden web pages for data extraction” In
Data & Knowledge Engineering, volume 49,issue 2,
pages 177-196,2004.
[2] L. Barbosa and J. Freire,”An adaptive crawler for
location hidden-web entry points” In Proc. of the 16th
Int. Conf. on World Wide Web, pages 441-450,
ACM-2007.
[3] S. Raghavan and H. Garcia-Moline, “Crawling the
hidden web” In Proc. of the Conf. on Very Large Data
Bases, pages 129-138, 2001.
[4] Hasan Mahmud, Moumie Soulemane and
Mohammad Rafiuzzaman, “ A framework for
dynamic indexing from hidden web” In the Int
.Journal of Computer Science, Volume 8, Issue 2,
2011
[5] Eibe Frank, Gordon W. Paynter, Ian H. Witten, Carl
Gutwin and Craig G. Nevill-Manning, ”Domain
Specific Key phrase Extraction” ” In the Proc. of the
16th Int. Conf. on Artificial Intelligence IJCAI,
volume 2, pages 668-673, ACM-1999
[6] Moumie Soulemane, Mohammad Rafiuzzaman,
Hasan Mahmud,”Crawling the Hidden web:An
approach to dynamic web indexing”, In the Int.
journal of Computer Application, volume 55-no.1,
Snn no: 0975-8887, 2012
[7] Steven s Skiena,”The Algorithm design”, Manual 2nd
edition , Springer, Verlag London , 2008
[8] Rahul Kumar, Anurag Jain and Chetan
Agarwl,”Survey of web crawling algorithms”, In the
Int. Journal of Advances in Visions
Computing(AVC), volume 1, Issue 2/3, 2014
[9] Aviral Nigam, NIT-Calicut, “Web Crawling
Algorithms”, In the International Journal of Computer
Science and Artificial Intelligence, volume 4, Issue
3,Pages 63-67, 2014
[10] Justin Zobel and Alistair Moffat,”Inverted Files for
Text Search Engines”, In the Journal of ACM
Computing Surveys(CSUR), Volume 38, Issue 2, July
2006
[11] Priyanka Gupta, Komal Bhatia and Kalpna
Gupta,”Optimized method for indexing the Hidden
web data”, In the Int.Journal of Inforatmation
Technology and knowledge Management, Volume 4,
Issue 2, pages 673-678, July 2011
[12] Anjali Ganesh Jivani, “A Comparative Study of
Stemming Algorithims” In the Int. Journal of
Computer Technology and Applciation (IJCTA),
Volume 2, Issue 6, Pages 1930-1938, 2011
[13] Hao Yah, Shuai Ding and Torsten Suel “Inverted
Index Compression and Query Processing with
Optimized Document Ordering”, In the Int. WWW
Conf. Committee –IW3C2 Madrid, Spain, Pages 20-
24, ACM 2009
[14] Farhi Marir and lamel Houam,”RST Index: indexing
and retrieving web document using computational and
linguistic techniques” ” In the Proc. of the 3rd Int.
Conf. on Intelligent Data Engineering and Automated
Learning, UK, pages 135-140, 2002 Available at
http://portal.acm.org/citation.cfm?id=646288.686474.
[15] Jagannathan Srinivasan, Ravi Murthy, Seema
Sundara, Nipun Aggarwal and Samuel
DeFazio,”Extensible Indexing : A framework for
integrating domain specific indexing schemes into
Oracle 8i”, In the Proc. of the 16th Int. Conf. on Data
Engineering IEEE-2000
[16] A.Mesbah, A..an Deursen and S. Lenselink,
”Crawling AJAX Based web applications through
dynamic analysis of uder interface state changes”, In
ACM Transaction on the web- TWEB, volume 6,
Issue 1, 2012
[17] C Duda, G. Frey, D. Kossmann, R. Matter and C
Zhou,”AJAX Crawl: making Ajax applications
searchable” In the Proc. of the Int. Conf. on Data
Engineering, pages 78-78, 2009
[18] A. Bergholz, B. Chidlovskii, “Crawling for Domain-
Specific Hidden Web Resources” In the Proc. of the
4th Int. Conf. on Web Information System
Engineering,2003
[19] A. Ntoulas, P. Zerfos and J. Cho, ”Downloading
Textual Hidden Web Content through Keyword
Queries” In the Proc. of the 5th ACM/IEEE Joint
Conf. on Digital Libraris,2005
[20] S. Liddle, D. Embley, Del Scott and S. Ho Yau, ”
Extracting Data Behind Web Forms” In the Proc. of
the 28th Int. Conf. on Very Large Data Bases, China,
2005
[21] Manuel Alvarez, Juan Raposo, Alberto Pan, Fidel
Cacheda, Femando Bellas and Victor Cameiro,
”Crawling the Content Hidden Behind Web forms ”
Department of Information and Communications
Technologies, University of A Coruna, 15071, Spain,
[22] Ritu Shandilya, Sugam Sharma and Shamimul
Qamar,”A Domain Specific Indexing Technique for
Hidden Web Documents”, In CISME, volume 2, issue
2, pages 37-41, 2012. Available at: www.jcisme.org
[23] A. K. Sharma and Komal Kumar Bhatia,”Merging
”query interfaces in domain specific hidden web
databases” In Int. Journal of Computer Science,2008
[24] A. Mesbah, E. Bozdag and A.V. Deursen, “Crawling
AJAX by inferring user interface state changes”, In
the Proc. Of 8th Int. Conf. on Web Engineering
(ICWE), Washington DC,USA, IEEsE-CSI , pages
122-134, 2008
[25] Zahra Behfarshad and Ali Mesbah. “Hidden-Web
Induced by Client-Side Scripting: An Empirical
Study”, Springer Berlin Heidelberg, In Proceedings
International Conference (ICWE 2013) Aalborg,
Denmark, pages 52-67, 2013. [26] Li Jie Cui, Hui He and Hong Wei Xuan ”Analysis and
Implementation of an Ajax-enabled Web Crawler” ,In
the Int .Journal of Future Generation Communication
and Networking , Volume 6, Issue 2, April 2013
[27] Paul Suganthan G. C.,”AJAX Crawler”, In the Int.
Conf. on Data Science and Engineering,(ICDSE),
IEEE- 2012
[28] A. Mesbah, A. Van Deursen and S. Lenselink,
“Crawling Ajax based web applications through
dynamic analysis of user interface state changes”, In
ACM Transaction on the web- TWEB, volume 6,
Issue 1,page 3, 2012
[29] A Rosaline Mary, B Visvanath, ”Evaluation of web
search engine-a comparative study”, In Research Gate
2015
[30] Bhupendra Singh, Shashank Sahu,”Model for
performance testing of AJAX based web
applications”, In Int. Journal of Research in
Engineering and Technology, volume 3, Issue 4,
eISSN 2319-1163,pISSN 2321-7308, 2014
[31] Bhupendra Singh, Shashank Sahu,”A Noval approach
for evaluation of applying AJAX in the web site”, In
Int. Journal of Research in Engineering and
Technology, volume 3, Issue 8, eISSN 2319-
1163,pISSN 2321-7308, 2014.