Survey on Clustering of Text using COATES Methodology

Authors

  • Sneha S. Bhatkulkar Department of Information Technology, Shri Guru Gobind Singhji Institute of Engg & Technology, Nanded, India-431606 Author
  • M. V. Vaidya Department of Information Technology, Shri Guru Gobind Singhji Institute of Engg & Technology, Nanded, India-431606 Author

DOI:

https://doi.org/10.14741/

Keywords:

Department of Information Technology, Shri Guru Gobind Singhji Institute of Engg & Technology, Nanded, India-431606

Abstract

In many text mining applications, side-information is available along with the text documents. Side-information, such as document provenance information, the links in the document, user-access behavior from weblogs, or other non-textual attributes which are present into the documents. Such attributes lead to better clustering results. However, the relative importance of this side-information may be difficult to estimate, especially when some of the information is noisy. We require a better way to perform the mining process, to maximize the advantages of side information. In this paper, we design an algorithm which combines classical partitioning algorithms with probabilistic models in order to create an effective clustering approach.

References

Downloads

Published

2016-06-30

Issue

Section

Articles

How to Cite

Survey on Clustering of Text using COATES Methodology. (2016). International Journal of Current Engineering and Technology, 6(3), 900-903. https://doi.org/10.14741/