Analysing Health Insurance Customer Dataset to Determine Cross-Selling Potential

Authors

  • Khulekani Mavundla
  • Surendra Thakur

Keywords:

Cross-Selling, Machine Learning algorithms, Health Insurance, Prediction, Model training

Abstract

Health insurance cross-selling refers to the practice of offering additional or complementary insurance products to existing policyholders. Insurance providers leverage cross-selling, offering customers additional policies like dental or life insurance when they already have a basic health insurance plan. This study is conducted to focus on the application of machine learning techniques to predict health insurance cross-selling opportunities among South African customers. The aim of this study is to develop a cross-selling predictive machine learning model that can assist health insurance companies to identify potential customers for cross-selling probabilities. To achieve this goal, a quantitative research methodology is adopted, focusing on extracting a comprehensive dataset of health insurance consumer information and employing various machine learning algorithms using the Python programming language, including Random Forest, K-Nearest Neighbours, XGBoost classifier, and Logistic Regression algorithms to build the cross-selling predictive machine learning model. The experimental results obtained demonstrate the accuracy scores of four different machine learning algorithms trained using 1,000,000 customer dataset with 17 features, logistic regression is considered as the top-performing model. It achieved an accuracy score of 0.83 and an F1 score of 0.91. The analysis indicates that customers aged 30-60, with prior insurance, and longer service history are more likely to buy additional health insurance products. The findings of this research can help health insurers boost revenue by improving their customer targeting and retention strategies.

https://doi.org/10.59200/ICARTI.2023.031

Downloads

Published

2023-12-10