Access the full text.
Sign up today, get DeepDyve free for 14 days.
Zahn Bozanic, Lin Cheng, Tzachi Zach (2016)
Soft Information in Loan AgreementsJournal of Accounting, Auditing & Finance, 33
Tomas Mikolov, Ilya Sutskever, Kai Chen, G. Corrado, J. Dean (2013)
Distributed Representations of Words and Phrases and their Compositionality
Sean Humpherys, Kevin Moffitt, M. Burns, J. Burgoon, William Felix (2011)
Identification of fraudulent financial statements using linguistic credibility analysisDecis. Support Syst., 50
Thomas Dietterich (1998)
Approximate Statistical Tests for Comparing Supervised Classification Learning AlgorithmsNeural Computation, 10
S. Bonsall, B. Miller (2017)
The impact of narrative disclosure readability on bond ratings and the cost of debtReview of Accounting Studies, 22
Zan Huang, Hsinchun Chen, Chia-Jung Hsu, Wun-Hwa Chen, Soushan Wu (2004)
Credit rating analysis with support vector machines and neural networks: a market comparative studyDecis. Support Syst., 37
Kyoung-jae Kim, Hyunchul Ahn (2012)
A corporate credit rating model using multi-class support vector machines with an ordinal pairwise partitioning approachComput. Oper. Res., 39
C. Frost (2006)
Credit Rating Agencies in Capital Markets: A Review of Research Evidence on Selected Criticisms of the AgenciesJournal of Accounting, Auditing & Finance, 22
Wun-Hwa Chen, J. Shih (2006)
A study of Taiwan's issuer credit rating systems using support vector machinesExpert Syst. Appl., 30
C. Luo, Desheng Wu, Dexiang Wu (2017)
A deep learning approach for credit scoring using credit default swapsEng. Appl. Artif. Intell., 65
P. Hájek, R. Henriques (2017)
Mining corporate annual reports for intelligent detection of financial statement fraud - A comparative study of machine learning methodsKnowl. Based Syst., 128
Z. Bozanic, P. Kraft (2014)
Qualitative disclosure and credit analysts' soft rating adjustments
Wei Dong, S. Liao, Liang Liang (2016)
Financial Statement Fraud Detection using Text Mining: a Systemic Functional Linguistics Theory Perspective
Ronen Feldman, S. Govindaraj, J. Livnat, Benjamin Segal (2009)
Management's Tone Change, Post Earnings Announcement Drift and AccrualsAccounting
C. Yeh, F. Lin, C. Hsu (2012)
A hybrid KMV model, random forests and rough set theory approach for credit ratingKnowl. Based Syst., 33
Jey Lau, Timothy Baldwin (2016)
An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation
Tomas Mikolov, Wen-tau Yih, G. Zweig (2013)
Linguistic Regularities in Continuous Space Word Representations
Management Science, 61
Ronen Feldman, S. Govindaraj, J. Livnat, Benjamin Segal (2010)
Management’s tone change, post earnings announcement drift and accrualsReview of Accounting Studies, 15
J. Horrigan (1966)
DETERMINATION OF LONG-TERM CREDIT STANDING WITH FINANCIAL RATIOSJournal of Accounting Research, 4
Stephen Brown, J. Tucker (2011)
Large-Sample Evidence on Firms’ Year-Over-Year MD&A ModificationsJournal of Accounting Research, 49
Quoc Le, Tomas Mikolov (2014)
Distributed Representations of Sentences and Documents
Ming-Feng Tsai, Chuan-Ju Wang, Po-Chuan Chien (2016)
Discovering Finance Keywords via Continuous-Space Language ModelsACM Trans. Manag. Inf. Syst., 7
Sunita Goel, Özlem Uzuner (2016)
Do Sentiments Matter in Fraud Detection? Estimating Semantic Orientation of Annual ReportsIntell. Syst. Account. Finance Manag., 23
Liang Yao, Yin Zhang, Baogang Wei, Zherong Li, Xiangzhou Huang (2016)
Traditional Chinese medicine clinical records classification using knowledge-powered document embedding2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Volkan Muslu, S. Radhakrishnan, K. Subramanyam, D. Lim (2015)
Forward-Looking MD&A Disclosures and the Information EnvironmentCapital Markets: Market Efficiency eJournal
Donghwa Kim, Deokseong Seo, Suhyoun Cho, Pilsung Kang (2019)
Multi-co-training for document classification using various document representations: TF-IDF, LDA, and Doc2VecInf. Sci., 477
B. Lehmann (2003)
Is it Worth the While? The Relevance of Qualitative Information in Credit RatingBanking & Financial Institutions eJournal
You-Shyang Chen, Ching-Hsue Cheng (2013)
Hybrid models based on rough set classifiers for setting credit rating decision rules in the global banking industryKnowl. Based Syst., 39
William Mayew, Mani Sethuraman, M. Venkatachalam (2015)
MD&A Disclosure and the Firm's Ability to Continue as a Going ConcernThe Accounting Review, 90
Orie Barron, Charles Kile, T. O'keefe (1999)
MD&A Quality as Measured by the SEC and Analysts' Earnings Forecasts*Contemporary Accounting Research, 16
R. West (1970)
An Alternative Approach to Predicting Corporate Bond RatingsJournal of Accounting Research, 8
Tomas Mikolov, Kai Chen, G. Corrado, J. Dean (2013)
Efficient Estimation of Word Representations in Vector Space
Standard & Poor's (2018)
Guide to credit rating essentials
P. Hájek, Krzysztof Michalak (2013)
Feature selection in corporate credit rating predictionKnowl. Based Syst., 51
Young-Chan Lee (2007)
Application of support vector machines to corporate credit rating predictionExpert Syst. Appl., 33
Fernando Enríquez, J. Jiménez, Tomás López-Solaz (2016)
An approach to the use of word embeddings in an opinion classification taskExpert Syst. Appl., 66
Feng Li (2010)
The Information Content of Forward-Looking Statements in Corporate Filings—A Naïve Bayesian Machine Learning ApproachJournal of Accounting Research, 48
R. Kaplan, Gabriel Urwitz (1979)
Statistical Models of Bond Ratings: A Methodological InquiryThe Journal of Business, 52
The purpose of this study is to investigate the effectiveness of qualitative information extracted from firm’s annual report in predicting corporate credit rating. Qualitative information represented by published reports or management interview has been known as an important source in addition to quantitative information represented by financial values in assigning corporate credit rating in practice. Nevertheless, prior studies have room for further research in that they rarely employed qualitative information in developing prediction model of corporate credit rating.Design/methodology/approachThis study adopted three document vectorization methods, Bag-Of-Words (BOW), Word to Vector (Word2Vec) and Document to Vector (Doc2Vec), to transform an unstructured textual data into a numeric vector, so that Machine Learning (ML) algorithms accept it as an input. For the experiments, we used the corpus of Management’s Discussion and Analysis (MD&A) section in 10-K financial reports as well as financial variables and corporate credit rating data.FindingsExperimental results from a series of multi-class classification experiments show the predictive models trained by both financial variables and vectors extracted from MD&A data outperform the benchmark models trained only by traditional financial variables.Originality/valueThis study proposed a new approach for corporate credit rating prediction by using qualitative information extracted from MD&A documents as an input to ML-based prediction models. Also, this research adopted and compared three textual vectorization methods in the domain of corporate credit rating prediction and showed that BOW mostly outperformed Word2Vec and Doc2Vec.
Data Technologies and Applications – Emerald Publishing
Published: Jun 2, 2020
Keywords: Corporate credit rating; Qualitative information; MD&A; Document vectorization; Machine learning; Predictive model
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.