Jan Gorecki

Results 84 issues of Jan Gorecki

It would be nice to put example of conversion from arrow to Julia DataFrames somewhere in readme. I would expect this to be very common use case. Also note that...

Are there any plans to make interfaces from other languages to your binary format. Having python or julia interface we can easily move data between different platforms. Something that feather...

question

As I don't have twitter I would like to leave feedback about benchmark here. I strongly advise to scale up benchmark so single time measurement is above 1s. Having timings...

The wish item is about API for efficient pop and push methods, that operates in-place. Because R is a functional language I understand it might not provide such methods directly....

Would be a little bit easier to contribute to base R if one could use productive version control like git. For now we have mirror only: https://github.com/wch/r-source

Not sure if it is a "feature" of last firmware A07 which has to deal with keystroke repetition, or is it something related to kernel 4.2. On my previous setup...

Just a simple flag so we can filter out our past messages that were left without reply.

[Type] enhancement
Module:Messages

When doing ``` ans = x(select=mean(f.v1), groupby="id1") ``` we are getting aggregated results that are also sorted by `id1`. It is sometimes useful to retain original order of values from...

improve
groupby
low priority

introduced between https://github.com/h2oai/datatable/compare/a45cc503494571bfbf0feb00ece03ad0bab16dfc...31fbad7b471113e30f34de8cc2b321c76f926c32 reproduced using ```sh cd db-benchmark source ./pydatatable/py-pydatatable/bin/activate pip install --upgrade git+https://github.com/h2oai/datatable.git@a45cc503494571bfbf0feb00ece03ad0bab16dfc ./_launcher/solution.R --solution=pydatatable --task=join --nrow=1e7 --quiet=true --out=dt-time.csv sleep 5 ./_launcher/solution.R --solution=pydatatable --task=join --nrow=1e8 --quiet=true --out=dt-time.csv sleep 5...

performance
join

Pydatatable join can be very fast, but in case of big to big join the variance of timing is very big. Numeric columns presents unix epoch time of the benchmark...

question
join