valentine issues

Code refactors. No changes in functionality whatsoever

1

Add Noise Functionality to DataFrame Columns in Utils Package

1

# Description: This PR introduces a new function, add_noise_to_df_column, designed to add noise to a specified column in a DataFrame. The function addresses issue #63, where there was a request...

ThanosTsiamis

enhancement

New feature: Get top n columns

2

Resolves #52 As stated in issue #52 , it would be useful to be able to get the top n similar columns when analyzing the data. Since the issue is...

michaelkonstantinou

Add embedding-based methods

Add methods that utilize column vector representations and cosine similarity among them to determine matches.

chrisk21

enhancement

nice to have

Add the data fabricator the Valentine package

I would like to be able to load a dataframe, and then add noise to specific columns of that dataset.

asteriosk

similarity_flooding case where e[1].long_name=None in __get_attribute_tuple

2

Hi Valentine authors! I am having trouble with a bug that seems to be coming from Valentine, but I am unsure: - in `similarity_flooding.py`, is it expected that `long_name` may...

cchristodoulaki

Feature Request: Top n matches

1

It would be incredibly useful to give for each column in df1, give top n column matches in df2.

thisisanameforsure

nice to have

Upgrade to nltk 3.9.1 to address CVE-2024-39705

1

The upgrade to nltk to version 3.9.1 is a BREAKING change. This change downloads `punkt_tab` instead of `punkt` which has a critical security vulnerability (CVE-2024-39705). See e.g.: - https://github.com/advisories/GHSA-cgvx-9447-vcch -...

aecio

Bump nltk from 3.8.1 to 3.9

Bumps [nltk](https://github.com/nltk/nltk) from 3.8.1 to 3.9. Changelog Sourced from nltk's changelog. Version 3.9.1 2024-08-19 Fixed bug that prevented wordnet from loading Version 3.9 2024-08-18 Avoid need for pickled models, resolves...

dependabot[bot]

dependencies

feat: Add Git hooks for pre-commit and pre-push testing

1

This PR introduces a standardized .githooks/ directory and sets Git’s core.hooksPath so that all contributors automatically run the test suite before committing or pushing code. This ensures higher code quality,...

ThanosTsiamis

valentine
valentine copied to clipboard

Metadata

Code refactors. No changes in functionality whatsoever

Add Noise Functionality to DataFrame Columns in Utils Package

New feature: Get top n columns

Add embedding-based methods

Add the data fabricator the Valentine package

similarity_flooding case where e[1].long_name=None in __get_attribute_tuple

Feature Request: Top n matches

Upgrade to nltk 3.9.1 to address CVE-2024-39705

Bump nltk from 3.8.1 to 3.9

feat: Add Git hooks for pre-commit and pre-push testing

← Metadata

Owner

Metadata

valentine valentine copied to clipboard

Metadata

← Metadata

Owner

Metadata

valentine
valentine copied to clipboard