pc-pricing-tutorial icon indicating copy to clipboard operation
pc-pricing-tutorial copied to clipboard

Data granularity

Open kevinykuo opened this issue 7 years ago • 3 comments

Some of the exposures are large, but they might actually be individual policies with many vehicles. Will have to investigate/ask.

kevinykuo avatar Dec 17 '18 17:12 kevinykuo

I believe that every record is not an individual risk. Each row is the unique combination of vehicle_category_code, region_code, vehicle_code, sex_code, age_code, and vehicle_year.

image

TylerGrantSmith avatar Dec 19 '18 04:12 TylerGrantSmith

I agree. Also, each individual exposure can count to up to 0.5, as the database is for a 6-month period (even though the vast majority of auto policies in Brazil are annual.)

rafaelcosta1 avatar Dec 19 '18 04:12 rafaelcosta1

The aggregated rows shouldn't make a difference to the output of a GLM... What worries me a bit more is that some of the rows with the largest exposures seem to have a different premium rate than the single rows.

RonRichman avatar Dec 28 '18 12:12 RonRichman