datacleaner icon indicating copy to clipboard operation
datacleaner copied to clipboard

A Python tool that automatically cleans data sets and readies them for analysis.

Results 12 datacleaner issues
Sort by recently updated
recently updated
newest added

## What does this PR do? I add the parameters a fill_func for filling NA default is the same as the original and we can expand the function to fill...

## What does this PR do? I add the parameters a fill_func for filling NA default is the same as the original and we can expand the function to fill...

[provide general introduction to the issue and why it is relevant to this repository] ## Context of the issue CI/CD doens't work at all ## Process to reproduce the issue...

when running the script, my_data = pd.read_csv('test2.csv', sep=',',encoding='utf-8') my_clean_data = autoclean(my_data) my_data.to_csv('my_clean_data.csv') getting error '

The try except block starting at line 76 of datacleaner.py raises a ValueError in Python 2.7 when the column is of type object (string). Since the Python 2.7 icon is...

bug
help wanted

Add a easy-to-use handle that saves the mapping between features values to their categorical label.

## What does this PR do? ## Where should the reviewer start? ## How should this PR be tested? ## Any background context you want to provide? ## What are...

Thanks for this awesome tool! I was wondering if we could include some sanity checking/cleanup for badly behaved text (e.g. all those invalid unicode characters). Could be as simple as...

enhancement

Write a wrapper for datacleaner that allows it to act as a scikit-learn transformer. See the [scikit-learn docs](http://scikit-learn.org/stable/developers/contributing.html#apis-of-scikit-learn-objects) for information on the transformer API.

enhancement
help wanted

Hi there, datacleaner seems quite interesting. Cleaning Data is always annoying and tools are missing. If I have seen it right, you impute NaNs. You could also consider to replace...

enhancement
help wanted