Risk
Apr 15, 2025
Learn how Credolab’s ML credit scoring models use alternative data to improve credit scoring
MD Americas, Chief Strategy Officer
In Part 1, we explored the limitations of traditional credit scoring and the rise of alternative data as a complementary way to improve risk assessment. However, data asymmetry still poses a challenge, creating blind spots that limit predictive accuracy.
In Part 2, we will explore how Credolab’s innovative solutions use alternative data and machine learning (ML) to modernise credit scoring.
Lenders can bridge the gap using device and behavioural ML-driven insights to improve risk visibility and refine real-time borrower assessments.
Using alternative data in credit scoring can identify more nuanced behavioural patterns that predict an applicant’s willingness to repay a loan. A risk score built on alternative data, such as Credolab's behavioural risk score, can help improve the assessment of the first two questions in the Five Cs of Credit framework:
1. Character
2. Capacity
The table below illustrates how behavioural indicators (BIs) from alternative data can more effectively assess Character and Capacity. It also examines how BIs affect traditional methods compared to modern tools, providing a deeper and more holistic understanding of borrower risk.
In addition to BIs, Credolab uses Statistical indicators (SIs), features engineered from about 80,000 data points (containing raw metadata) collected by Credolab with the user’s consent and transformed into nearly 11 million features through a proprietary feature engine. These features provide quantitative proof that these behavioural patterns statistically predict defaults.
Furthermore, Credolab's data modelling pipeline filters 11 million features to identify the top 30 to 50 with the highest predictive power for defaults. Net Logistic Regression finalises the analysis by ranking features by Information Value (IV), correlation with each other, and stability over time.
This approach ensures that only the best features, consistently predictive of repayment behaviour across populations, are included in the final alternative score. Meanwhile, metrics like the Gini Coefficient, Kolmogorov-Smirnov (KS) statistic, and AUC/ROC confirm the model’s ability to distinguish “Good” vs “Bad” borrowers.
Individually, BIs and SIs are powerful tools that can impact the assessment of a borrower’s Character and Capacity and the resulting credit decision. However, their combined power takes things a step further. BIs explain the intuitive “why” behind credit risk (e.g. capturing traits like reliability, integrity, and financial habits), and SIs prove the “how”, demonstrating that these behaviours have measurable, predictive value for defaults.
Together, they:
Even in the absence of credit history, Credolab’s behavioural data (device and behavioural metadata) can assess an applicant's financial responsibility and capacity to take on new debt, tackling Character and Capacity in the Five Cs framework, respectively. In doing so, Credolab helps lenders fairly assess every applicant, even those that traditional underwriting models would have rejected.
The ideal approach is a hybrid risk model that combines the best of both worlds and leverages Credolab’s alternative scores as input into a lender's general risk model.
Credolab’s unique methodology leverages proprietary SDKs embedded in the lender’s mobile app and online application form to collect privacy-consented, depersonalised and anonymised device and behavioural metadata. Transformed into features first and scores second, Credolab’s risk solutions supercharge traditional risk assessment with a 100% hit rate. By effectively scoring all applicants, including thin-files and new-to-credit individuals, Credolab can:
The ideal credit scoring model combines automation, diverse data sources, and adaptability to deliver accurate and inclusive risk assessment. It should also leverage machine learning to continuously learn, identify new patterns, and refine decision-making. However, the accuracy and reliability of an ML-driven model depend on the quality of the data used to train it.
This is where alternative data, such as Credolab’s device and behavioural data, plays a critical role. By complementing traditional data sources, alternative data enables deeper, more nuanced insights into borrower behaviour.
As Andre Ripla, PgCert, explains:
“By leveraging ML, natural language processing, and other AI technologies, organisations can process vast amounts of data, identify complex patterns, and make predictive analyses that were previously impossible or impractical using traditional methods.”
Credolab’s ML-driven credit scoring models are designed to help lenders succeed in today’s data-driven world. This proven approach ensures lenders can access scorecards tailored to each specific loan product and origination channel: Android app, iOS app, mobile web, and web.
ML applied to alternative data offers a novel approach to credit risk assessment that translates into tangible benefits for lenders.
With Credolab, lenders can identify hidden behavioural patterns and improve their accuracy in assessing risk for every borrower, not just thin-files. Here are case studies to prove it:
1. Neobank in The Philippines
2. BNPL in the United Kingdom
3. Short-term loans in Mexico
4. Short-term loans in Brazil
5. Consumer loans in Colombia
While Credolab’s solutions are often associated with risk assessment, their applications extend far beyond. Here are a few ways organisations can leverage Credolab’s products and solutions:
A case study example would be how a telecom company could use Credolab’s risk scores to identify customers likely to default on their bills, enabling proactive interventions to reduce losses. Similarly, an e-commerce platform could use these scores to offer tailored payment options, improving customer satisfaction and retention.
The future of risk management lies in data-driven decision-making. As traditional credit scoring methods show their limitations, organisations must embrace innovative solutions on top of existing models to stay competitive in a rapidly changing financial landscape.
Staying ahead in risk management demands smarter tools and better data sources in an era of rapid financial transformation. Using alternative data powered by Credolab’s proven technology offers a path to more inclusive, accurate, and efficient credit scoring.
Credolab, as a leading charge in this transformation, provides tools to minimise risks, reduce losses and costs and unlock opportunities in every market, paving the way for a more equitable financial future. By leveraging alternative data and ML, Credolab is redefining credit scoring and shaping the future of risk management.
Ready to modernise your risk management processes? Explore Credolab’s risk solutions today and see how alternative data and ML can help you excel in a rapidly changing financial landscape.