datacleaner
datacleaner copied to clipboard
A Python tool that automatically cleans data sets and readies them for analysis.
## What does this PR do? I add the parameters a fill_func for filling NA default is the same as the original and we can expand the function to fill...
## What does this PR do? I add the parameters a fill_func for filling NA default is the same as the original and we can expand the function to fill...
[provide general introduction to the issue and why it is relevant to this repository] ## Context of the issue CI/CD doens't work at all ## Process to reproduce the issue...
when running the script, my_data = pd.read_csv('test2.csv', sep=',',encoding='utf-8') my_clean_data = autoclean(my_data) my_data.to_csv('my_clean_data.csv') getting error '
The try except block starting at line 76 of datacleaner.py raises a ValueError in Python 2.7 when the column is of type object (string). Since the Python 2.7 icon is...
Add a easy-to-use handle that saves the mapping between features values to their categorical label.
## What does this PR do? ## Where should the reviewer start? ## How should this PR be tested? ## Any background context you want to provide? ## What are...
Thanks for this awesome tool! I was wondering if we could include some sanity checking/cleanup for badly behaved text (e.g. all those invalid unicode characters). Could be as simple as...
Write a wrapper for datacleaner that allows it to act as a scikit-learn transformer. See the [scikit-learn docs](http://scikit-learn.org/stable/developers/contributing.html#apis-of-scikit-learn-objects) for information on the transformer API.
Hi there, datacleaner seems quite interesting. Cleaning Data is always annoying and tools are missing. If I have seen it right, you impute NaNs. You could also consider to replace...