Predicting corporate credit rating based on qualitative information of MD&A transformed using document vectorization techniques

Jinwook Choi; Yongmoo Suh; Namchul Jung

doi:10.1108/dta-08-2019-0127

Loading next page...

References (40)

Zahn Bozanic, Lin Cheng, Tzachi Zach (2016)
Soft Information in Loan Agreements
Journal of Accounting, Auditing & Finance, 33
Tomas Mikolov, Ilya Sutskever, Kai Chen, G. Corrado, J. Dean (2013)
Distributed Representations of Words and Phrases and their Compositionality
Sean Humpherys, Kevin Moffitt, M. Burns, J. Burgoon, William Felix (2011)
Identification of fraudulent financial statements using linguistic credibility analysis
Decis. Support Syst., 50
Thomas Dietterich (1998)
Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms
Neural Computation, 10
S. Bonsall, B. Miller (2017)
The impact of narrative disclosure readability on bond ratings and the cost of debt
Review of Accounting Studies, 22
Zan Huang, Hsinchun Chen, Chia-Jung Hsu, Wun-Hwa Chen, Soushan Wu (2004)
Credit rating analysis with support vector machines and neural networks: a market comparative study
Decis. Support Syst., 37
Kyoung-jae Kim, Hyunchul Ahn (2012)
A corporate credit rating model using multi-class support vector machines with an ordinal pairwise partitioning approach
Comput. Oper. Res., 39
C. Frost (2006)
Credit Rating Agencies in Capital Markets: A Review of Research Evidence on Selected Criticisms of the Agencies
Journal of Accounting, Auditing & Finance, 22
Wun-Hwa Chen, J. Shih (2006)
A study of Taiwan's issuer credit rating systems using support vector machines
Expert Syst. Appl., 30
C. Luo, Desheng Wu, Dexiang Wu (2017)
A deep learning approach for credit scoring using credit default swaps
Eng. Appl. Artif. Intell., 65
P. Hájek, R. Henriques (2017)
Mining corporate annual reports for intelligent detection of financial statement fraud - A comparative study of machine learning methods
Knowl. Based Syst., 128
Z. Bozanic, P. Kraft (2014)
Qualitative disclosure and credit analysts' soft rating adjustments
Wei Dong, S. Liao, Liang Liang (2016)
Financial Statement Fraud Detection using Text Mining: a Systemic Functional Linguistics Theory Perspective
Ronen Feldman, S. Govindaraj, J. Livnat, Benjamin Segal (2009)
Management's Tone Change, Post Earnings Announcement Drift and Accruals
Accounting
C. Yeh, F. Lin, C. Hsu (2012)
A hybrid KMV model, random forests and rough set theory approach for credit rating
Knowl. Based Syst., 33
Jey Lau, Timothy Baldwin (2016)
An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation
Tomas Mikolov, Wen-tau Yih, G. Zweig (2013)
Linguistic Regularities in Continuous Space Word Representations
Forward-looking MD&A disclosures and the information environment
Management Science, 61
Ronen Feldman, S. Govindaraj, J. Livnat, Benjamin Segal (2010)
Management’s tone change, post earnings announcement drift and accruals
Review of Accounting Studies, 15
J. Horrigan (1966)
DETERMINATION OF LONG-TERM CREDIT STANDING WITH FINANCIAL RATIOS
Journal of Accounting Research, 4
Stephen Brown, J. Tucker (2011)
Large-Sample Evidence on Firms’ Year-Over-Year MD&A Modifications
Journal of Accounting Research, 49
Quoc Le, Tomas Mikolov (2014)
Distributed Representations of Sentences and Documents
Ming-Feng Tsai, Chuan-Ju Wang, Po-Chuan Chien (2016)
Discovering Finance Keywords via Continuous-Space Language Models
ACM Trans. Manag. Inf. Syst., 7
Sunita Goel, Özlem Uzuner (2016)
Do Sentiments Matter in Fraud Detection? Estimating Semantic Orientation of Annual Reports
Intell. Syst. Account. Finance Manag., 23
Liang Yao, Yin Zhang, Baogang Wei, Zherong Li, Xiangzhou Huang (2016)
Traditional Chinese medicine clinical records classification using knowledge-powered document embedding
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Volkan Muslu, S. Radhakrishnan, K. Subramanyam, D. Lim (2015)
Forward-Looking MD&A Disclosures and the Information Environment
Capital Markets: Market Efficiency eJournal
Donghwa Kim, Deokseong Seo, Suhyoun Cho, Pilsung Kang (2019)
Multi-co-training for document classification using various document representations: TF-IDF, LDA, and Doc2Vec
Inf. Sci., 477
B. Lehmann (2003)
Is it Worth the While? The Relevance of Qualitative Information in Credit Rating
Banking & Financial Institutions eJournal
You-Shyang Chen, Ching-Hsue Cheng (2013)
Hybrid models based on rough set classifiers for setting credit rating decision rules in the global banking industry
Knowl. Based Syst., 39
William Mayew, Mani Sethuraman, M. Venkatachalam (2015)
MD&A Disclosure and the Firm's Ability to Continue as a Going Concern
The Accounting Review, 90
10.1080/09638180.2022.2038227
Orie Barron, Charles Kile, T. O'keefe (1999)
MD&A Quality as Measured by the SEC and Analysts' Earnings Forecasts*
Contemporary Accounting Research, 16
R. West (1970)
An Alternative Approach to Predicting Corporate Bond Ratings
Journal of Accounting Research, 8
Tomas Mikolov, Kai Chen, G. Corrado, J. Dean (2013)
Efficient Estimation of Word Representations in Vector Space
Standard & Poor's (2018)
Guide to credit rating essentials
P. Hájek, Krzysztof Michalak (2013)
Feature selection in corporate credit rating prediction
Knowl. Based Syst., 51
Young-Chan Lee (2007)
Application of support vector machines to corporate credit rating prediction
Expert Syst. Appl., 33
Fernando Enríquez, J. Jiménez, Tomás López-Solaz (2016)
An approach to the use of word embeddings in an opinion classification task
Expert Syst. Appl., 66
Feng Li (2010)
The Information Content of Forward-Looking Statements in Corporate Filings—A Naïve Bayesian Machine Learning Approach
Journal of Accounting Research, 48
R. Kaplan, Gabriel Urwitz (1979)
Statistical Models of Bond Ratings: A Methodological Inquiry
The Journal of Business, 52

Publisher: Emerald Publishing
Copyright: © Emerald Publishing Limited
ISSN: 2514-9288
DOI: 10.1108/dta-08-2019-0127
Publisher site: See Article on Publisher Site

Abstract

The purpose of this study is to investigate the effectiveness of qualitative information extracted from firm’s annual report in predicting corporate credit rating. Qualitative information represented by published reports or management interview has been known as an important source in addition to quantitative information represented by financial values in assigning corporate credit rating in practice. Nevertheless, prior studies have room for further research in that they rarely employed qualitative information in developing prediction model of corporate credit rating.Design/methodology/approachThis study adopted three document vectorization methods, Bag-Of-Words (BOW), Word to Vector (Word2Vec) and Document to Vector (Doc2Vec), to transform an unstructured textual data into a numeric vector, so that Machine Learning (ML) algorithms accept it as an input. For the experiments, we used the corpus of Management’s Discussion and Analysis (MD&A) section in 10-K financial reports as well as financial variables and corporate credit rating data.FindingsExperimental results from a series of multi-class classification experiments show the predictive models trained by both financial variables and vectors extracted from MD&A data outperform the benchmark models trained only by traditional financial variables.Originality/valueThis study proposed a new approach for corporate credit rating prediction by using qualitative information extracted from MD&A documents as an input to ML-based prediction models. Also, this research adopted and compared three textual vectorization methods in the domain of corporate credit rating prediction and showed that BOW mostly outperformed Word2Vec and Doc2Vec.

Journal

Data Technologies and Applications – Emerald Publishing

Published: Jun 2, 2020

Keywords: Corporate credit rating; Qualitative information; MD&A; Document vectorization; Machine learning; Predictive model

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Predicting corporate credit rating based on qualitative information of MD&A transformed using document vectorization techniques

Predicting corporate credit rating based on qualitative information of MD&A transformed using document vectorization techniques

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Predicting corporate credit rating based on qualitative information of MD&A transformed using document vectorization techniques

Predicting corporate credit rating based on qualitative information of MD&A transformed using document vectorization techniques

References (40)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies