diffino
diffino copied to clipboard
Diffing tools for comparing datasets in CSV, XLSX and other formats
Hi, Using python3 I got: from diffino.models import Diffino File "local/lib/python3.6/site-packages/diffino/models.py", line 5, in from urlparse import urlparse ModuleNotFoundError: No module named 'urlparse' The same as in : https://github.com/heroku/kafka-helper/issues/6
### Problem Currently there is no good way to exclude specific fields from the match process. When you're comparing two versions of a report, one with a new column and...
Just wondering what made you stick to pandas 0.19.2? Seems this is only working on python 3.6? Thanks a lot!
Right now were are saving the output diff in three separate CSV files, we could add support for whenever the output file is an xls(x) then use spreadsheets instead: `--output...
Allow using strict `NaN` with `--strict-nan`, otherwise fill na with blank strings ` ` and use that for comparing
@pacofvf suggested exploring if we could use of this to some extend to benefit from sequence matchers: https://kite.com/python/docs/difflib.SequenceMatcher 
> If it looks like a duck, swims like a duck, and quacks like a duck, then it probably is a duck. For example: - 2019-01-31 (datetime) vs 2019/01/31 (string)...