easydata
easydata copied to clipboard
A flexible template for doing reproducible data science in Python.
this text ``` Now would be a good time to initialize a git repo; i.e. >>> git init >>> git add . >>> git commit -m 'initial import' >>> git...
20_newsgroups is created on a make_test. Should be removed, or confined to CI only
right now it doesn't. Found out the hard way when implementing extra_base
Detect auto-unpacking (like with wine reviews dataset in Safari if there's a way?!). At least give a better message with the manual downloading instructions.
Give INFO level messages on which files are being downloaded to where. In the case that I have to download the files manually, it would be good if Dataset.load were...
Create an import statement that will automatically run all the "useful" things we want such as: ``` from IPython.core.display import display, HTML display(HTML(".container { width:100% !important; }")) ``` to cut...
If possible, catch import errors and remind users to `make update_environment`
e.g. ghego/travis_anaconda_jupyter: Testing Jupyter notebooks with Travis.ci https://github.com/ghego/travis_anaconda_jupyter
We need something better for updating a dataset recipe and it's hash...If I change something and I want to generate a new hash, I don't want to have to run...
There was a "bum" dataset created out of the moby_dick dataset when bs4 was not installed...how is this possible?