Data Science For Non Life Insurance Assignmnet

Assignment Questions

You analyze the data set (in .csv) that is available on TOLEDO. This data set contains observations on the variables listed in the table printed below. Your report should document the following steps:

An exploratory data analysis.
The construction of a (technical) tariff structure for a car insurance product. Hereto you analyze both the frequency and severity information in the data with (at least) two of the methods/algorithms discussed in the lectures (GLM, GAM, regression tree, bagging, random forest, gradient boosting, : : :). You combine frequency and severity models appropriately into a technical pure premium. You compare the performance of the constructed models, based on your own defined set of criteria. You discuss the resulting (pure premium) pricing structure.
As an extra step you will discuss (and demonstrate) the calculation of a safety (or risk) loading on top of the pure premiums. To calculate these risk loadings you explore the literature on insurance pricing and propose a suitable strategy. Yang et al. (2020) is a useful starting point. There is no need to answer the above questions separately (question by question) in your report. A well structured text that covers the above items is preferred. Be creative and rigorous!

Overview of variables

ageph

age of the policyholder

CODPOSS

postal code in Belgium

duree

exposure, fraction of the year the insured is covered

lnexpo

log of exposure

nbrtotc

total number of claims during period of exposure

chargtot

total claim amount

agecar

age of the car: 0 − 1, 2 − 5, 6 − 10, > 10

sexp

sex of the policyholder: male or female

fuelc

type of fuel: petrol or gasoil

split

split of the premium: monthly, once, twice, three times per year

usec

use of the car: private or professional fleetc car belonging to a fleet: yes or no

sportc

sport car: yes or no

coverp

coverage: MTPL, MTPL+, MTPL+++

powerc

power of the car: < 66, 66-110, >110

L. Yang, Z. Li, and S. Meng. Risk loadings in classification ratemaking. https://arxiv.org/ abs/2002.01798, 2020.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
shapefileBelgium		shapefileBelgium
.RData		.RData
.Rhistory		.Rhistory
.gitignore		.gitignore
2021-04-03 DSNonLife - Assignment 3 - Construct a motor insurance tariff.pdf		2021-04-03 DSNonLife - Assignment 3 - Construct a motor insurance tariff.pdf
Assignment.csv		Assignment.csv
Assignment.r		Assignment.r
README.md		README.md
Report.rmd		Report.rmd
Risk_Loading.pdf		Risk_Loading.pdf
Spatialized_Claim_Frequency.png		Spatialized_Claim_Frequency.png
agecar_bar.png		agecar_bar.png
ageph_hist.png		ageph_hist.png
assignment.Rproj		assignment.Rproj
belgium_shape_plot.png		belgium_shape_plot.png
coverp_bar.png		coverp_bar.png
fleetc_bar.png		fleetc_bar.png
freq_tables.rda		freq_tables.rda
fuelc_bar.png		fuelc_bar.png
inspost.xls		inspost.xls
lnamount_hist.png		lnamount_hist.png
lnchargtot_hist.png		lnchargtot_hist.png
nclaims_hist.png		nclaims_hist.png
powerc_bar.png		powerc_bar.png
sexp_bar.png		sexp_bar.png
split_bar.png		split_bar.png
sportc_bar.png		sportc_bar.png
usec_bar.png		usec_bar.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Science For Non Life Insurance Assignmnet

Assignment Questions

Overview of variables

ageph

CODPOSS

duree

lnexpo

nbrtotc

chargtot

agecar

sexp

fuelc

split

usec

sportc

coverp

powerc

About

Uh oh!

Releases

Packages

Languages

mvfti/DataScience

Folders and files

Latest commit

History

Repository files navigation

Data Science For Non Life Insurance Assignmnet

Assignment Questions

Overview of variables

ageph

CODPOSS

duree

lnexpo

nbrtotc

chargtot

agecar

sexp

fuelc

split

usec

sportc

coverp

powerc

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages