Access the full text.
Sign up today, get DeepDyve free for 14 days.
—In this article we consider a fundamentally new information-theoretic approach to the classification of scientific texts based on compression algorithms. An analysis using the example of the comparative classification of full-text documents from arXiv.org and short annotations from Scopus showed that the accuracy of the proposed method is 87–92% and, in general, is not inferior to the existing ones. These conclusions were confirmed by an expert assessment.
Automatic Documentation and Mathematical Linguistics – Springer Journals
Published: Jul 1, 2021
Keywords: text classification methods; data compression algorithms; scientific texts; arXiv.org; Scopus; k-nearest neighbors; logistic regression; random forests; naive Bayesian classification; support vector machines
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.