Focused Web Crawler and its Approaches

Authors

  • Jay Sampat Department of Computer Science, Dwarkadas J Sanghvi College of Engineering, Vile Parle(W), Mumbai, India Author
  • Anmol Jain Department of Computer Science, Dwarkadas J Sanghvi College of Engineering, Vile Parle(W), Mumbai, India Author
  • Dharmeshkumar Mistry Department of Computer Science, Dwarkadas J Sanghvi College of Engineering, Vile Parle(W), Mumbai, India Author

Keywords:

web crawlers, focused crawlers, web pages, priority based, contextual based, indexing.

Abstract

There has been a rapid growth of the world-wide web which has scaled beyond our imaginations. To surmount these challenges search engines are used. One of the most important type of crawler is Focused crawler which is used to index information according to a particular topic. To maximize the possibility of downloading relevant documents focused crawler makes a prediction of hyperlinks visiting priority which in turn helps to reduce downloading of irrelevant documents and drastically saves network resources and hardware. Instead of using keywords topics are specified by using commendable documents. One of the most important feature of this type of web crawler is collecting and indexing all accessible web credentials. This crawler mainly diagnosis its crawl boundary to search different URLs. In this paper we’ll illustrate a clear cut comparison between focused and standard web crawlers as well as various approaches of focused crawling like contextual and priority based crawling.

References

Downloads

Published

2014-10-31

Issue

Section

Articles

How to Cite

Focused Web Crawler and its Approaches. (2014). International Journal of Current Engineering and Technology, 4(5), 3121-3124. https://ijcet.evegenis.org/index.php/ijcet/article/view/1180