quac
quac copied to clipboard
QUAC ("quantitative analysis of chatter" or any related acronym you like) is a package for acquiring and analyzing social Internet content. Docs are online at http://reidpr.github.io/quac.
Related to #110, we should add support for granularities other than hourly. The CDC data are daily. I think some low-hanging fruit would be to allow the user to provide...
We've received some (non-publicly available) CDC time series that include the number of hits per day per page per region. This directly relates to how the Wikipedia time series are...
Guaranteeing that all pagecount files which pass metadata will parse 100% correctly means excluding quite a lot of files, for example all of February and half of March 2013. For...
As the Twitter raw data cannot be recovered if something happens to it, make it read-only so that accidental modification is more difficult.
The functionality for inference is present, but we need to add it to the pre-processing pipeline.
This PR tackles two related issues: #110 and #111. Not included in this PR (but it could be) is a script for migrating the v1 Wikipedia data to v2. This...