Classifying Web Spam Using Block-based TrustRank

Authors

  • M Sreevani Dept of CSE, MGIT, Gandipet, Hyderabad -500075 Author
  • R Bhramaramba Dept of IT, GITAM University, Vizag-530045 Author
  • D Vasumati Dept of CSE, JNTUH, Hyderabad-500032 Author
  • O.Yaswanth Babu CMC Limited(A TATA Enterprise), Hyderabad- Author

Keywords:

spam, page segmentation, TrustRank.

Abstract

Web spamming refers to actions intended to mislead search engines into ranking some pages higher than they deserve. TrustRank is a recent algorithm that can combat web spam. However, the seed set used by TrustRank may not be sufficiently representative to cover well the different topics on the Web. In this paper, We propose the use of Combined page segmentation for selecting seed set in TrustRank algorithm and uses Block-level retrieval to rank the seed pages so that we can use highly multiple–topic ranked pages as seed set. Experimental results show that our approach deals effectively with the problem of multiple drifting topics and identify highly desirable pages for seed set and thus improve the performance of TrustRank.

References

Downloads

Published

2012-12-31

Issue

Section

Articles

How to Cite

Classifying Web Spam Using Block-based TrustRank. (2012). International Journal of Current Engineering and Technology, 2(4), 369-373. https://ijcet.evegenis.org/index.php/ijcet/article/view/36