Access the full text.
Sign up today, get DeepDyve free for 14 days.
A. Jumani, M. Mahar, F. Khoso, M. Memon (2018)
Online Text Categorization System Using Support Vector MachineSindh University Research Journal, 50
H. Hashimi, Alaaeldin Hafez, H. Mathkour (2015)
Selection criteria for text mining approachesComput. Hum. Behav., 51
B. Sharef, N. Omar, Zeyad Sharef (2014)
An automated arabic text categorization based on the frequency ratio accumulationInt. Arab J. Inf. Technol., 11
N. Ranjan, R. Prasad (2018)
LFNN: Lion fuzzy neural network-based evolutionary model for text classification using context and sense based featuresAppl. Soft Comput., 71
V. Korde, C. Mahender (2012)
TEXT CLASSIFICATION AND CLASSIFIERS: A SURVEYInternational Journal of Artificial Intelligence & Applications, 3
S. Chander, P. Vijaya, P. Dhyani (2017)
Multi kernel and dynamic fractional lion optimization algorithm for data clusteringalexandria engineering journal, 57
Z. Elberrichi, Abdellatif Rahmoun, Mohamed Bentaallah (2008)
Using WordNet for Text CategorizationInt. Arab J. Inf. Technol., 5
Bo Tang, S. Kay, Haibo He (2016)
Toward Optimal Feature Selection in Naive Bayes for Text CategorizationIEEE Transactions on Knowledge and Data Engineering, 28
D. Lewis (1996)
Challenges in machine learning for text classification
F. Camastra, Gennaro Razi (2020)
Italian Text Categorization with Lemmatization and Support Vector Machines
Chuan Liu, Wenyong Wang, Guanghui Tu, Yu Xiang, Siyang Wang, Fengmao Lv (2017)
A new Centroid-Based Classification model for text categorizationKnowl. Based Syst., 136
L. Fu, Hui-Hunag Hsu, J. Príncipe (1994)
A knowledge-based approach to supervised incremental learningProceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94), 3
Tianyi Ma, G. Motta, Kaixu Liu (2017)
Delivering Real-Time Information Services on Public Transit: A FrameworkIEEE Transactions on Intelligent Transportation Systems, 18
(2014)
SR-K-Means clustering algorithm for semantic information retrieval
Shengli Song, Xiao Qiao, Ping Chen (2009)
Hierarchical Text Classification Incremental Learning
Z. Pawlak (1998)
Rough Set Theory and its Applications to Data AnalysisCybern. Syst., 29
Kui Xie, G. Tosto, Lin Lu, Youngsuk Cho (2018)
Detecting leadership in peer-moderated online collaborative learning through text mining and social network analysisInternet High. Educ., 38
R. Silva, Túlio Alberto, Tiago Almeida, A. Yamakami (2017)
Towards filtering undesired short text messages using an online learning approach with semantic indexingExpert Syst. Appl., 83
Gaige Wang (2018)
Moth search algorithm: a bio-inspired metaheuristic algorithm for global optimization problemsMemetic Computing, 10
(1999)
Text Mining: the state of the art and the challenges”
M. Ghiassi, M. Olschimke, B. Moon, P. Arnaudo (2012)
Automated text classification using a dynamic artificial neural network modelExpert Syst. Appl., 39
M. Beno, I. Valarmathi, S. Swamy, B. Rajakumar (2014)
Threshold prediction for segmenting tumour from brain MRI scansInternational Journal of Imaging Systems and Technology, 24
Renjith Thomas, M. Rangachar (2017)
Fractional Bat and Multi-Kernel-Based Spherical SVM for Low Resolution Face RecognitionInt. J. Pattern Recognit. Artif. Intell., 31
Yanfang Zhang, Yan Wan (2017)
How to find valuable references? Application of text mining in abstract clustering2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)
ZhiHang Chen, Liping Huang, Y. Murphey (2007)
Incremental Learning for Text Document Classification2007 International Joint Conference on Neural Networks
Oswaldo Ludwig, U. Nunes, R. Araújo (2014)
Eigenvalue decay: A new method for neural network regularizationNeurocomputing, 124
Hidayet Takçi, Tunga Güngör (2012)
A high performance centroid-based classification approach for language identificationPattern Recognit. Lett., 33
G. Berge, Ole-Christoffer Granmo, T. Tveit, Morten Goodwin, Lei Jiao, B. Matheussen (2018)
Using the Tsetlin Machine to Learn Human-Interpretable Rules for High-Accuracy Text Categorization With Medical ApplicationsIEEE Access, 7
Text mining has been used for various knowledge discovery based applications, and thus, a lot of research has been contributed towards it. Latest trending research in the text mining is adopting the incremental learning data, as it is economical while dealing with large volume of information.Design/methodology/approachThe primary intention of this research is to design and develop a technique for incremental text categorization using optimized Support Vector Neural Network (SVNN). The proposed technique involves four major steps, such as pre-processing, feature selection, classification and feature extraction. Initially, the data is pre-processed based on stop word removal and stemming. Then, the feature extraction is done by extracting semantic word-based features and Term Frequency and Inverse Document Frequency (TF-IDF). From the extracted features, the important features are selected using Bhattacharya distance measure and the features are subjected as the input to the proposed classifier. The proposed classifier performs incremental learning using SVNN, wherein the weights are bounded in a limit using rough set theory. Moreover, for the optimal selection of weights in SVNN, Moth Search (MS) algorithm is used. Thus, the proposed classifier, named Rough set MS-SVNN, performs the text categorization for the incremental data, given as the input.FindingsFor the experimentation, the 20 News group dataset, and the Reuters dataset are used. Simulation results indicate that the proposed Rough set based MS-SVNN has achieved 0.7743, 0.7774 and 0.7745 for the precision, recall and F-measure, respectively.Originality/valueIn this paper, an online incremental learner is developed for the text categorization. The text categorization is done by developing the Rough set MS-SVNN classifier, which classifies the incoming texts based on the boundary condition evaluated by the Rough set theory, and the optimal weights from the MS. The proposed online text categorization scheme has the basic steps, like pre-processing, feature extraction, feature selection and classification. The pre-processing is carried out to identify the unique words from the dataset, and the features like semantic word-based features and TF-IDF are obtained from the keyword set. Feature selection is done by setting a minimum Bhattacharya distance measure, and the selected features are provided to the proposed Rough set MS-SVNN for the classification.
Data Technologies and Applications – Emerald Publishing
Published: Nov 2, 2020
Keywords: Incremental learning; Text mining; Support Vector Neural Network; Rough set theory; Moth search algorithm
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.