sarbani
sarbani
we are now using postgresql approach -> Refer : https://github.com/dedupeio/dedupe-examples/tree/master/pgsql_big_dedupe_example Version used - 2.0.13 Total records 18K with 16 core, 64 GIG RAM its is taking 20 mins to run...
Today we observed some accuracy concerns..(we are using postgresql approach). Lot of duplicates ara tagged as unique.. we are using first_name, last_name, gender, marital_status, ssn_last_4, race, ethnicity, phone, date_of_birth, address...
We are seeing this issue consistently-> eg I want records with "same fname, lname, date-of-birth" or "same fname, lname, ssn" will be labeled as duplicate. Here I dont want to...
I am following this tutorial -> https://learn.microsoft.com/en-us/azure/machine-learning/v1/tutorial-pipeline-python-sdk Getting error in train.oy -> [azure-ml-error.txt](https://github.com/Azure/azure-cli/files/9591954/azure-ml-error.txt) Can you please help to resolve. Thanks