Théo Jolivet
Théo Jolivet
I'm working on adding the `key` argument, and the `MultiAggJoiner` :)
I'd be interested to work on screening when I'm done with the multi joiners. I think it's a feature that might also be useful for other estimators than the Joiners,...
Closing because this has been addressed in the postprocessing step of the `TableVectorizer` in #902
Implemented in https://github.com/skrub-data/skrub/pull/926
I'm unsure about the default suffixes in `MultiAggJoiner`. If we add suffixes to the tables to be joined based on their position in the auxiliary table list, it will have...
I want to be a bit more consistent with parameter names. In the `MultiAggJoiner`, I think we want to keep all plurals i.e. `aux_tables`, `keys`, `main_keys`, `aux_keys`, `cols`, `operations` and...
I'm working on testing for polars inputs in : test_deduplicate.py test_fuzzy_join.py test_minhash_encoder.py test_gap_encoder.py test_similarity_encoder.py test_table_vectorizer.py test_datetime_encoder.py test_fast_hash.py test_joiner.py
All done, last item was completed in #945
Closing as it was done in #902 with `CheckInputDataFrame`
Closing because columns created/modified by the TableVectorizer are now in the same order as the input columns since https://github.com/skrub-data/skrub/pull/902