Categories
- ACTUARIAL DATA SCIENCE
- AFIR / ERM / RISK
- ASTIN / NON-LIFE
- BANKING / FINANCE
- CORONA SPECIAL
- DIVERSITY & INCLUSION
- EDUCATION
- HEALTH
- IACA / CONSULTING
- LIFE
- PENSIONS
- PROFESSIONALISM
- MISC
Cluster analysis is the task of grouping a set of objects (e.g., observations, policies, claims) in such a way that objects in the same group (called a cluster) are more similar to each other than to those in other groups. In contrast to simple segmentation (e.g. by geographical location only), clustering uses several features to differentiate among those groups. Potential applications are manifold and centred around questions such as, for example:
The course shows how different algorithms can be used to obtain a segmentation of insurance data. The methods covered range from centroid-based (k-means, k-prototypes) to probabilistic (Gaussian Mixture Models) and density-based (DBSCAN) approaches. We demonstrate how the clustering results can be visualized and evaluated. Moreover, it will be shown how the clustering results can be used to identify outliers in the data set. We also cover techniques that reduce the dimension of the data so that the segments can be computed either on aggregated information or using only a subset of the available information. The course puts an emphasis on the practical application and therefore showcases all concepts on an insurance data set.
0 Comments
There are no comments yet. Add a comment.