DAME-FLAME-Python-Package Accelerate runtime when data set is large

When data set size goes to millions of rows and hundreds of features, it takes hours to run. Could there be ways to shorten the computing time?

Nov 08 '23 15:11 wtc100

Perhaps you might try our other package https://github.com/almost-matching-exactly/variable_imp_matching ? This one might scale better. Or you could try our database option.

On Nov 8, 2023, at 10:22 AM, wtc100 @.***> wrote:

When data set size goes to millions of rows and hundreds of features, it takes hours to run. Could there be ways to shorten the computing time? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you are subscribed to this thread.Message ID: @.***>

Nov 08 '23 17:11 cynrudin

I want to add the link to the database version (the flame_db folder): https://github.com/almost-matching-exactly/DAME-FLAME-Python-Package/tree/2d941bcfa76d7bcd33d58cbf4657202e62cc5b0c

and its documentation: https://github.com/almost-matching-exactly/DAME-FLAME-Python-Package?tab=readme-ov-file#a-tutorial-to-flame-database-version

Dec 19 '23 16:12 nehargupta