dataverse-client-r icon indicating copy to clipboard operation
dataverse-client-r copied to clipboard

Guidance on downloading and reading multiple files (shp)

Open kuriwaki opened this issue 2 years ago • 0 comments

Something to think about:

Shapefiles are a common example of case for the user needing to load multiple files at once. shapefiles come as a set of multiple files like this example:

image

(from https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IU8RQK by @mmukaigawara

In R, we would read this in with sf::st_read("iraq_districts") to refer to the whole set of files that start in "iraq_districts". But doing this with dataverse::get_* in one step is not currently possible

Other datasets like VEST precinct data upload a zipped .zip file instead of the components separately.

What should the guidance be? It is probably too complicated to allow get_* to take a vector of filenames. So an alternative is a snippet that downloads multiple files into a tempfile(), and reads from that tempfile?

kuriwaki avatar Jan 04 '24 17:01 kuriwaki