Access the full text.
Sign up today, get DeepDyve free for 14 days.
J. Ward (1963)
Hierarchical Grouping to Optimize an Objective FunctionJournal of the American Statistical Association, 58
A. Huang (2008)
Similarity Measures for Text Document Clustering
Ibrahim Aljarah, Simone Ludwig (2012)
Parallel particle swarm optimization clustering algorithm based on MapReduce methodology2012 Fourth World Congress on Nature and Biologically Inspired Computing (NaBIC)
Chang Liu, Song-nian Yu, Qiang Guo (2009)
Distributed Document Clustering for Search Engine2009 International Conference on Wavelet Analysis and Pattern Recognition
M. Porter (1997)
An algorithm for suffix strippingProgram, 40
Souptik Datta, C. Giannella, H. Kargupta (2009)
Approximate Distributed K-Means Clustering over a Peer-to-Peer NetworkIEEE Transactions on Knowledge and Data Engineering, 21
D. Deb, R. Angryk (2007)
Distributed Document Clustering Using Word-clusters2007 IEEE Symposium on Computational Intelligence and Data Mining
G. Salton, Anita Wong, Chung-Shu Yang (1975)
A vector space model for automatic indexingCommun. ACM, 18
Benjamin King (1967)
Step-Wise Clustering ProceduresJournal of the American Statistical Association, 62
P. Zhou, Jingsheng Lei, Wenjun Ye (2011)
Large-Scale Data Sets Clustering Based on MapReduce and Hadoop
Linping Shuang, Hongjun Zhi (2011)
Analysis of distributed information retrieval2011 International Conference on Multimedia Technology
Qing He, Tingting Li, Fuzhen Zhuang, Zhongzhi Shi (2010)
Frequent term based peer-to-peer text clustering2010 Third International Symposium on Knowledge Acquisition and Modeling
R. Kashef (2008)
Cooperative Clustering Model and Its Applications
(2011)
Distributed algorithms for bottleneck identification and segmentation in 3D wireless sensor networks
Souptik Datta, Kanishka Bhaduri, C. Giannella, H. Kargupta, Ran Wolff (2006)
Distributed Data Mining in Peer-to-Peer NetworksIEEE Internet Computing, 10
Eshref Januzaj, H. Kriegel, M. Pfeifle (2003)
Towards Effective and Efficient Distributed Clustering
Khaled Hammouda, M. Kamel (2006)
Collaborative Document Clustering
Zongzhen Hu, Z. Weina, Li E, Xiaojuan Du, Yan Fan (2013)
A Fuzzy Approach to Clustering of Text Documents Based on MapReduce2013 International Conference on Computational and Information Sciences
Y. Patil, V. Nandedkar (2014)
Hadoop: A new approach for document clusteringInternational journal of scientific research in science, engineering and technology, 3
Yang Liu, Maozhen Li, Suhel Hammoud, N. Alham, M. Ponraj (2010)
A MapReduce based distributed LSI2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery, 6
Khaled Hammouda, M. Kamel (2009)
Hierarchically Distributed Peer-to-Peer Document Clustering and Cluster SummarizationIEEE Transactions on Knowledge and Data Engineering, 21
Odysseas Papapetrou, W. Siberski, N. Fuhr (2012)
Decentralized Probabilistic Text ClusteringIEEE Transactions on Knowledge and Data Engineering, 24
Distributed data mining paradigm is an active research area due to the enormous volume of data that are to be processed from across a wide cluster of data nodes. Document clustering algorithms are widely applied in a variety of distributed environments like peer-to-peer networks, wireless sensor networks, etc. This paper entails a comprehensive review on most of the recent that is ultimately making massive impacts on the technological realm. These algorithms are analysed based on few pivotal elements such as clustering quality, scale-up, speed-up and accuracy. Recent advances in technology have developed MapReduce-based , which show dramatic improvements in the aforementioned analytical elements. Based on the review, intelligent discussions are presented for algorithm development and implementation. Keywords: distributed document clustering; speed-up; scale-up; MapReduce. Reference to this paper should be made as follows: Judith, J.E. and Jayakumari, J. (2015) `: a recent survey', Int. J. Enterprise Network Management, Vol. 6, No. 3, pp.207221. Biographical notes: J.E. Judith received her BE in Computer Science and Engineering from Manonmaniam Sundaranar University in 2003 and ME in Computer Science and Engineering from Karunya University in 2006. Currently, she is working as an Assistant Professor at the Department of Computer Science and Engineering
International Journal of Enterprise Network Management – Inderscience Publishers
Published: Jan 1, 2015
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.