DAME-FLAME-Python-Package icon indicating copy to clipboard operation
DAME-FLAME-Python-Package copied to clipboard

Accelerate runtime when data set is large

Open wtc100 opened this issue 2 years ago • 2 comments

When data set size goes to millions of rows and hundreds of features, it takes hours to run. Could there be ways to shorten the computing time?

wtc100 avatar Nov 08 '23 15:11 wtc100

Perhaps you might try our other package https://github.com/almost-matching-exactly/variable_imp_matching ? This one might scale better. Or you could try our database option.

On Nov 8, 2023, at 10:22 AM, wtc100 @.***> wrote:

When data set size goes to millions of rows and hundreds of features, it takes hours to run. Could there be ways to shorten the computing time? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you are subscribed to this thread.Message ID: @.***>

cynrudin avatar Nov 08 '23 17:11 cynrudin

I want to add the link to the database version (the flame_db folder): https://github.com/almost-matching-exactly/DAME-FLAME-Python-Package/tree/2d941bcfa76d7bcd33d58cbf4657202e62cc5b0c

and its documentation: https://github.com/almost-matching-exactly/DAME-FLAME-Python-Package?tab=readme-ov-file#a-tutorial-to-flame-database-version

nehargupta avatar Dec 19 '23 16:12 nehargupta