datasets
datasets copied to clipboard
The primary repository for all of the CORGIS Datasets
This has always been a very difficult dataset to work with, from its inception. But the data quality could stand to be improved, I believe. In particular, I notice that...
I'm guessing we weren't building many hydropower dams in 19 AD. But someone should check and make sure.
I'm very suspicious of this dataset, not least because I don't really understand it. - Check that the values are in consistent units. - Check the actual possible values of...
This seems like a pretty promising dataset. https://www.bls.gov/developers/ Perhaps we should even be replacing the existing labor dataset with it? That one is pretty terrible and breaks in Java...
Identify when the primary occured, whether it was a primary or caucus, and possibly demographics on the location. Also, stack this dataset! And possibly indicate whether the candidate was actually...
Parse the "Artist Info.Years Living" field to extract out a birth year, death year, and age.
We have some fashion majors this semester. It'd be nice to see if any of these could bear fruit: http://mmlab.ie.cuhk.edu.hk/projects/DeepFashion.html http://www.st.ewi.tudelft.nl/~bozzon/fashion10000dataset/
2011-2015 drug delivery data https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Information-on-Prescription-Drugs/2015MedicareData.html Part D in particular seems nice. Might need to impute the data a little.
Real numbers are sometimes coded as Integers, because we only look at the first field. We should mimic the other builders and check the set of possible types and use...