News Sentiment Analysis and Credit Risk
Do we still need humans to monitor and interpret qualitative data for our risk models?

Recent technological advances increase the possibility of using qualitative data in risk models to ensure a timelier recognition of threats. News articles, which can be seen as a type of unstructured data, are flooding the world every day. However, one can imagine the time it would take to manually process all this information. Recent developments in natural language processing (NLP) show some very promising results in automating that task by a computer. We assess the possibilities of these recent advances within credit risk management.
Monitoring borrowers is one of the most important tasks for financial institutions to correctly assess credit risk. This is often done using credit ratings, that have been developed using quantitative methods on financial data. Since key financial data is often updated yearly through annual reports, the question arises of whether threats can be recognized on time. News headlines appear on a much more frequent basis than annual reports, with Bloomberg bureaus alone covering around 10000 news messages daily.
Keeping in mind that news must be carefully filtered from noise, the use of news headlines sentiment in a credit risk setting is an example of how unstructured data can be used. The tone of news headlines says a lot about how investors perceive the market. The field of behavioral finance shows us how important this perception can be in the decision-making process of investors. They often act upon perceptions gained from consuming media. This makes news headlines incredibly powerful, as they not only influence our opinion, but also indirectly influence reality itself. Sentiment can therefore be described as the way that the information is perceived by the market. A timely assessment of this sentiment is useful for pricing financial instruments, a better understanding of the state of the economy and of the risk associated with a counterparty. The rise in interest in NLP models of the past decade plays an important role in retrieving a sentiment value, as the more sophisticated models can interpret complicated news headlines.
Measuring News Sentiment with NLP Methods
One of the most traditional approaches in extracting sentiment from text is the lexicon-based method (LM). In this method, a labeled dictionary with positive and negative words is applied to a snippet of text and the sum of those values classifies this snippet as positive or negative. In a financial context, such a dictionary is provided by Loughran and McDonald (2011) and contains more than 4000 labeled words constructed from a financial point of view.
Although the method described above is simple and easy to use, its disadvantage is that it does not capture the full nuance of words. Linguistics is a very complicated subject that takes the human brain years to master. The most intuitive example is the fact that the words 'dog' and 'cat' would be considered as two completely different words within the lexicon method, whereas we would like to at least consider the fact that they are both animals and are in that sense similar to each other. Deep learning methods can help in understanding this with the use of so-called embeddings. These are representations for text where words that have the same meaning have a similar representation. Combining this with an NLP method such as Google's BERT (Bidirectional Encoder Representations from Transformers), one can combine this broader knowledge of the language with sentiment weights for sentences so that the model learns when to interpret a certain sentence as positive and when as negative, using neural networks. A neural network is a series of algorithms that endeavors to recognize underlying relationships in a dataset and mimics the functions of a human brain. Neural networks are even more powerful in computational terms than the human brain and can learn a language in half an hour rather than taking many years.
News Sentiment as Early Warning System
News sentiment is already being used a lot in stock price forecasting. Studies have shown that there exists some correlation between the sentiment of de-noised news and the direction of stock prices. Compared to equity markets, the use of NLP within risk management is relatively new. A lot of financial institutions focus on a stable outlook of creditworthiness of a counterparty using so called Through-The-Cycle models. As these models do not fluctuate according to current changes in macro-economic circumstances, they provide a long-term outlook of creditworthiness. The main drawback of these models is that they are infrequently updated. If a company defaults during a year where the credit rating indicated a low probability of default, the question arises if such an event could be foreseen. Forward looking models could give an indication of the current fluctuations in creditworthiness, however it is difficult to assess if these fluctuations are rational or caused by irrational behavior in the market. An early warning system that could interpret current fluctuations and indicate where these fluctuations originate could reduce the number of unforeseen defaults.
The application of NLPs to real-time news items can be used as such an early warning system. By automatically processing thousands of news items for all counterparties, we can distinguish the importance of the accompanied sentiment over a given time span for relevant news messages. When a counterparty received a lot of negative news coverage, indicating a reduction in creditworthiness, an early warning system could give a sign to further investigate the current rating and, if necessary, advise that it should be adjusted accordingly. This solution is particularly useful for financial institutions with a large portfolio of counterparties, where not every single party can be manually assessed.
Conclusion
The possibilities to use unstructured data with the surge in NLP technology are endless. With more advanced techniques being introduced that are able to understand the complexity of financial news, a timelier assessment of credit risk with the use of an early warning system is a great example of how these techniques can be helpful. Although we are far away from complete automation of risk detection, such an early warning system in combination with expert judgement is a great next step.

Sentiment analysis and Zanders
Zanders has been researching the use of forward-looking credit risk models for some time now. Recent research showed promising results in forecasting credit rating changes with the sentiment analysis as described above. A tool has been developed that uses Google's BERT model to process news headlines for individual counterparties and convert them to a sentiment score.
New research is currently being conducted, building on the findings that there is some correlation between aggregated sentiment for a certain counterparty and a transition in credit rating. This research focuses on developing an industry sentiment measure such that counterparties that are not mentioned in the news headlines can still receive a sentiment score and an associated early warning indicator.
Contact
We are looking forward to sharing the insights on to how to use this innovative technique within credit risk models. For more information, please contact Martijn de Groot or Kilian Wessels at +31 35 692 89 89.
Share this article: