Focused Web Crawler and its Approaches

Jay Sampat; Anmol Jain; Dharmeshkumar Mistry

Authors

Jay Sampat Department of Computer Science, Dwarkadas J Sanghvi College of Engineering, Vile Parle(W), Mumbai, India Author
Anmol Jain Department of Computer Science, Dwarkadas J Sanghvi College of Engineering, Vile Parle(W), Mumbai, India Author
Dharmeshkumar Mistry Department of Computer Science, Dwarkadas J Sanghvi College of Engineering, Vile Parle(W), Mumbai, India Author

Keywords:

web crawlers, focused crawlers, web pages, priority based, contextual based, indexing.

Abstract

There has been a rapid growth of the world-wide web which has scaled beyond our imaginations. To surmount these challenges search engines are used. One of the most important type of crawler is Focused crawler which is used to index information according to a particular topic. To maximize the possibility of downloading relevant documents focused crawler makes a prediction of hyperlinks visiting priority which in turn helps to reduce downloading of irrelevant documents and drastically saves network resources and hardware. Instead of using keywords topics are specified by using commendable documents. One of the most important feature of this type of web crawler is collecting and indexing all accessible web credentials. This crawler mainly diagnosis its crawl boundary to search different URLs. In this paper we’ll illustrate a clear cut comparison between focused and standard web crawlers as well as various approaches of focused crawling like contextual and priority based crawling.

Focused Web Crawler and its Approaches

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

How to Cite

journal_details

IMPACT METRIC: 8.7

Information

call_for_papers

Make a Submission

indexed_in

facts_and_figures

Share