cdcfluview icon indicating copy to clipboard operation
cdcfluview copied to clipboard

pi_mortality data is doubled

Open philipshirk opened this issue 2 years ago • 0 comments

each row in pi_mortality data is duplicated.

pimn <- cdcfluview::pi_mortality(coverage_area = 'national')
nrow(pimn) / sum(base::duplicated(pimn))
pimr <- cdcfluview::pi_mortality(coverage_area = 'region')
nrow(pimr) / sum(base::duplicated(pimr))
(pims <- cdcfluview::pi_mortality(coverage_area = 'state'))
nrow(pims) / sum(base::duplicated(pims))

returns 2 for each dataset.

The problem originates in CDC's data: https://gis.cdc.gov/grasp/flu7/GetPhase07InitApp?appVersion=Public which has 2 entries for each of nchs_mapcode 1 and 2.

And the warnings from left_join are hidden, so the user is not alerted to the issue.

philipshirk avatar Nov 17 '23 15:11 philipshirk