Using machine learning methods to predict financial performance: Does disclosure tone matter?

Research output: Contribution to journalArticlepeer-review

Standard Standard

Using machine learning methods to predict financial performance: Does disclosure tone matter? / Mousa, Gehan ; Elamir, Elsayed; Hussainey, Khaled.
In: International Journal of Disclosure and Governance , Vol. 19, 03.2022, p. 93–112.

Research output: Contribution to journalArticlepeer-review

HarvardHarvard

Mousa, G, Elamir, E & Hussainey, K 2022, 'Using machine learning methods to predict financial performance: Does disclosure tone matter?', International Journal of Disclosure and Governance , vol. 19, pp. 93–112. https://doi.org/10.1057/s41310-021-00129-x

APA

CBE

MLA

Mousa, Gehan , Elsayed Elamir, and Khaled Hussainey. "Using machine learning methods to predict financial performance: Does disclosure tone matter?". International Journal of Disclosure and Governance . 2022, 19. 93–112. https://doi.org/10.1057/s41310-021-00129-x

VancouverVancouver

Mousa G, Elamir E, Hussainey K. Using machine learning methods to predict financial performance: Does disclosure tone matter? International Journal of Disclosure and Governance . 2022 Mar;19:93–112. Epub 2021 Sept 5. doi: 10.1057/s41310-021-00129-x

Author

Mousa, Gehan ; Elamir, Elsayed ; Hussainey, Khaled. / Using machine learning methods to predict financial performance: Does disclosure tone matter?. In: International Journal of Disclosure and Governance . 2022 ; Vol. 19. pp. 93–112.

RIS

TY - JOUR

T1 - Using machine learning methods to predict financial performance: Does disclosure tone matter?

AU - Mousa, Gehan

AU - Elamir, Elsayed

AU - Hussainey, Khaled

PY - 2022/3

Y1 - 2022/3

N2 - We use three supervised machine learning methods, namely linear discriminant analysis, quadratic discriminant analysis, and random forest, to predict corporate financial performance. We use a sample of 63 listed banks from eight emerging markets, covering 10 years from 2008 to 2017, using earning per share as a measure of performance. We use the design science research (DSR) framework to examine whether the textual contents of annual reports in previous years contain value-relevant information to predict future performance; thus, these contents can improve the accuracy and quality of predictive models. We combine two groups of variables in the proposed models. The first group is the sentiment analysis of disclosure tone in annual report narratives using the Loughran and McDonald dictionary (J Finance 66:35–65, 2011), while the second group is the quantitative properties of banks which consist of five variables, namely size, financial leverage, age, market-to-book ratio, and risk. Our analysis suggests that the random forest method provides the best predictive model. We also provide evidence on the accuracy and performance of predictive models that can be increased by incorporating disclosure tone variables as non-financial variables with financial variables. Interestingly, we find that the uncertainty variable is the most important disclosure tone variable. Finally, we find that size is the most important variable related to banks’ quantitative characteristics. Our study suggests that the analysis of tone through corporate narrative disclosures can be used as a complementary or diagnostic approach rather than an alternative in making decisions by different stakeholders such as analysts, investors, and auditors.

AB - We use three supervised machine learning methods, namely linear discriminant analysis, quadratic discriminant analysis, and random forest, to predict corporate financial performance. We use a sample of 63 listed banks from eight emerging markets, covering 10 years from 2008 to 2017, using earning per share as a measure of performance. We use the design science research (DSR) framework to examine whether the textual contents of annual reports in previous years contain value-relevant information to predict future performance; thus, these contents can improve the accuracy and quality of predictive models. We combine two groups of variables in the proposed models. The first group is the sentiment analysis of disclosure tone in annual report narratives using the Loughran and McDonald dictionary (J Finance 66:35–65, 2011), while the second group is the quantitative properties of banks which consist of five variables, namely size, financial leverage, age, market-to-book ratio, and risk. Our analysis suggests that the random forest method provides the best predictive model. We also provide evidence on the accuracy and performance of predictive models that can be increased by incorporating disclosure tone variables as non-financial variables with financial variables. Interestingly, we find that the uncertainty variable is the most important disclosure tone variable. Finally, we find that size is the most important variable related to banks’ quantitative characteristics. Our study suggests that the analysis of tone through corporate narrative disclosures can be used as a complementary or diagnostic approach rather than an alternative in making decisions by different stakeholders such as analysts, investors, and auditors.

U2 - 10.1057/s41310-021-00129-x

DO - 10.1057/s41310-021-00129-x

M3 - Article

VL - 19

SP - 93

EP - 112

JO - International Journal of Disclosure and Governance

JF - International Journal of Disclosure and Governance

SN - 1741-3591

ER -