Théo Jolivet

Results 18 comments of Théo Jolivet

I'm working on adding the `key` argument, and the `MultiAggJoiner` :)

I'd be interested to work on screening when I'm done with the multi joiners. I think it's a feature that might also be useful for other estimators than the Joiners,...

Closing because this has been addressed in the postprocessing step of the `TableVectorizer` in #902

Implemented in https://github.com/skrub-data/skrub/pull/926

I'm unsure about the default suffixes in `MultiAggJoiner`. If we add suffixes to the tables to be joined based on their position in the auxiliary table list, it will have...

I want to be a bit more consistent with parameter names. In the `MultiAggJoiner`, I think we want to keep all plurals i.e. `aux_tables`, `keys`, `main_keys`, `aux_keys`, `cols`, `operations` and...

I'm working on testing for polars inputs in : test_deduplicate.py test_fuzzy_join.py test_minhash_encoder.py test_gap_encoder.py test_similarity_encoder.py test_table_vectorizer.py test_datetime_encoder.py test_fast_hash.py test_joiner.py

All done, last item was completed in #945

Closing as it was done in #902 with `CheckInputDataFrame`

Closing because columns created/modified by the TableVectorizer are now in the same order as the input columns since https://github.com/skrub-data/skrub/pull/902