TARIFF ANALYSIS OF MOTOR INSURANCE USING GENERALIZED LINEAR MODEL (GLM) AND GRADIENT BOOSTING MACHINE (GBM)

Authors

  • Yunike Jemis Fifnelavindy Alsitaningtyas Dept. of Mathematics, Universitas Gadjah Mada, Yogyakarta, Indonesia
  • Hubbi Muhammad Department of Mathematics, Universitas Pamulang, Serang, Indonesia
  • Adhitya Ronnie Effendie Dept. of Mathematics, Universitas Gadjah Mada, Yogyakarta, Indonesia

DOI:

https://doi.org/10.25077/jmua.15.1.78-94.2026

Keywords:

Gradient Boosting Machine, Generalized Linear Model, Insurance Premium

Abstract

he insurance sector operates by managing the transfer of risk from policyholders to insurance providers, where premiums are charged as compensation for the assumed risk. Traditionally, premium determination in motor vehicle insurance relies on the Generalized Linear Model (GLM), which requires the response variable to follow a distribution from the exponential family and may have limitations in capturing non-linear relationships and complex interactions among rating factors. To address these limitations, this study compares the performance of the Generalized Linear Model (GLM) and the Gradient Boosting Machine (GBM) in modeling claim frequency and claim severity for motor vehicle insurance premiums. The analysis is conducted using an insurance dataset obtained from a public data repository, and both models are evaluated using K-Fold Cross Validation. Model performance is assessed based on the Root Mean Square Error (RMSE), which measures the average magnitude of prediction errors and is commonly used to evaluate predictive accuracy. The results indicate that the GBM consistently produces lower RMSE values than the GLM for both claim frequency and claim severity modeling, indicating superior predictive performance. However, despite its higher accuracy, the GBM model lacks the interpretability inherent in the GLM framework, which remains crucial for transparency and regulatory considerations in insurance premium determination. These findings suggest that while GBM is effective for improving prediction accuracy, GLM remains valuable for interpretability, and a complementary use of both approaches may provide optimal results in actuarial pricing applications

References

1] Agresti A, Foundations Linear Generalized Linear Models, Cambridge: John Wiley & Sons, Inc, 2015.

[2] Breiman, L., Friedman, J., Stone, C. J., & Olshen, R. A, Classification and regression trees, CRC press, 1984.

[3] Hastie, T., Tibshirani, R., & Friedman, J, The elements of statistical learning: Data mining, inference, and prediction, Springer Science & Business Media, 2009.

[4] Jong, P D, Heller GZ, Generalized Linear Models for Insurance Data, Cambridge: Cambridge University Press, 2008.

[5] Nelder, J.A., dan Verrall, R.J., Credibility Theory and Generalized Linear Models, Astin Bulletin 27(1), London, 1997.

[6] Ohlsson, E., Johanson, B., Non-Life Insurance Pricing with Generalized Linear Models, Springer, Berlin, 2010.

[7] Rosenlund, S., Integrating Ordinary GLM with Credibility in a Compound Poisson Model, Stockholm, Swedia, 2013.

[8] Tweedie M, An Index Which Distinguishes Between Some Important Exponential Families, The Statistics: Applications and New Directions, Di dalam: Proc. Indian statistical institute golden Jubilee Internasional conference, 1984.

[9] Garrido J, Genest C, Schulz J, Generalized Linear Models for Dependent Frequency and Severity of Insurance Claims, Insur Math Econ. 70:205–215, 2016.

[10] Henckaerts, R. dkk., Boosting insights in insurance tariff plans with tree-based machine learning method, North American Actuarial Journal. 25(2): 255-285, 2021.

[11] Li N, Peng X, Kawaguchi E, Suchard MA, Li G, A Scalable Surrogate L0 Sparse Regression Method for Generalized Linear Models with Applications to Large Scale Data, J Stat Plan Inference. 213:262–281, 2020.

[12] “Car Insurance Claim (Cleaned),” Data.World, 2025. [Online]. Available: https://data.world/saleem786/car-insurance-claims-analysis/workspace/file?filename=Car+Insurance+Claim+(Cleaned).xlsx. [Accessed: 08-02-2023]

Downloads

Published

26-01-2026

Issue

Section

Articles