Devin Petersohn
Devin Petersohn
Hi @wqh17101 thanks for the report! Modin works on Windows. What type of object did you pass as `file_path`? It seems like that might be causing the issue.
@wqh17101 is this a test file you can share? I have tried to reproduce this, but I haven't been able to.
@wqh17101 can you share the full script? I wasn't able to reproduce the issue with your code and file. Are you doing `os.chdir` or something like that? That could be...
@gshimansky Great work, will you create a PR with these added failing tests?
@mvashishtha For the algebra, typically there are two things that qualify an operator: * Unique handling of metadata * Unique handling of data We typically want each operator to have...
@mvashishtha for now, let's keep this fix local to the axis-wide apply. we should revisit after.
@alejandro-ponder is this in a fresh environment? Your machine may not have enough memory to do both Modin and pandas in the same notebook/interpreter environment.
Thanks @danpetruk for the report! This is not expected, but I can reproduce it. When I run your code I get a similar difference. I know this is not expected...
As it currently stands, the minimum number of columns in any partition is 32, that leads to limited parallelism. Often there are datasets with less than 32 columns, so we...
@mvashishtha A couple of questions: - What kind of performance do you get as the number of rows increases? - Does anything change if you start the benchmark from a...