Modeling auto insurance frequency using K-means and mixture regression

سال انتشار: 1402
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 22

فایل این مقاله در 17 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JMMF-3-2_006

تاریخ نمایه سازی: 19 اسفند 1402

چکیده مقاله:

Given the importance of policyholder classification in helping to make a good decision in predicting optimal premiums for actuaries.This paper proposes, first, an optimal construction of policyholder classes. Second, Poisson-negative Binomial mixture regression model is proposed as an alternative to deal with the overdispersion of these classes.The proposed method is unique in that it takes Tunisian data and classifies the insured population based on the K-means approach which is an unsupervised machine learning algorithm. The choice of the model becomes extremely difficult due to the presence of zero mass in one of the classes and the significant degree of overdispersion. For this purpose, we proposed a mixture regression model that leads us to estimate the density of each class and to predict its probability distribution that allows us to understand the underlying properties of our data. In the learning phase, we estimate the values of the model parameters using the Expectation-Maximization algorithm. This allows us to determine the probability of occurrence of each new insured to create the most accurate classification. The goal of using mixed regression is to get as heterogeneous a classification as possible while having a better approximation. The proposed mixed regression model, which uses a number of factors, has been evaluated on different criteria, including mean square error, variance, chi-square test and accuracy. According to the experimental findings on several datasets, the approach can reach an overall accuracy of ۸۰%. Then, the application on real Tunisian data shows the effectiveness of using the mixed regression model.

نویسندگان

Maryem Jaziri

University of Sfax , Probability and Statistics Laboratory

Afif Masmoudi

University of Sfax , Probability and Statistics Laboratory

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • A. E. Renshaw, Modelling the Claims Process in the Presence ...
  • A. Dempster, N. Laird, and D. Rubin, Maximum likelihood from ...
  • C. Akantziliotou, R. A. Rigby, and D. M. Stasinopoulos, A ...
  • S. Aryuyuen and W. Bodhisuwan, The negative binomial-generalized exponential (NB-GE)distribution, ...
  • J. Del Castillo and M. Perez-Casany ´ , Overdispersed and ...
  • T. C. Fung, A. L. Badescu, and X. S. Lin, ...
  • T. C. Fung, A. L. Badescu, and X. S. Lin, ...
  • G. Maclachlan and T. Krishnan, The EM Algorithm and Extensions, ...
  • J. Garrido, C. Genest, and J. Schulz, Generalized linear models ...
  • H. Zamani and N. Ismail, Negative Binomial-Lindley Distribution And Its ...
  • Mathematics And Statistics, ۱, ۴۹, (۲۰۱۰) ...
  • J. F. Lawless, Negative binomial and mixed Poisson regression, The ...
  • D. Lukasz, L. Mathias, and M. V. Wuthrich ¨ , ...
  • L. Simon, Fitting negative binomial distribution by the method of ...
  • Casualty Actuarial Society, ۱۷, ۴۵-۵۳, (۱۹۶۱) ...
  • L. Breiman, J. Friedman, R. Olshen, and C. Stone, Classification ...
  • B. G. Lindsay, Mixture Models: Theory, Geometry, and Applications, NSF-CBMS ...
  • L. Breiman, Random forests, Machine Learning, ۴۵, ۵-۳۲, (۲۰۰۱) ...
  • M. Aitkin, D. Anderson, B. Francis, and J. Hinde, Statistical ...
  • G. McLachlan and D. Peel, Finite Mixture Models, Wiley Series ...
  • M. Lichman, UCI Machine Learning Repository, http://archive.ics.uci.edu/ml, (۲۰۱۳) ...
  • N. E. Breslow, Extra-Poisson Variation in Log-Linear Models, Journal of ...
  • R. Verbelen, L. Gong, K. Antonio, A. Badescu, and S. ...
  • S. M. Goldfeld and R. E. Quandt, A Markov model ...
  • S. C. K. Lee and X. Sheldon Lin, Modeling Dependent ...
  • P. Shi and E. A. Valdez, Multivariate negative binomial models ...
  • K. F. Sellers and A. Raim, A flexible zero-inflated model ...
  • S. C. Tseung, A. Badescu, T. C. Fung, and X. ...
  • W. Jiang and M. A. Tanner, On the Approximation Rate ...
  • K. K. Yau, K. Wang, and A. H. Lee, Zero-Inflated ...
  • Y. Lv, S. Tang, and H. Zhao, Real-time Highway Traffic ...
  • H. Zamani, P. Faroughi, and N. Ismail, Bivariate generalized Poisson ...
  • P. Zhang, D. Pitt, and X. Wu, A new multivariate ...
  • نمایش کامل مراجع