Maxim Grechkin
Maxim Grechkin
This isn't actually completed, right? There are `Signal`s still left in the `Event.elm` for example.
Even non-unique items can lead to issues. I've run into the same issue before (on the same dataset :() in scikit-learn which lead to: https://github.com/scikit-learn/scikit-learn/pull/4436, https://github.com/scikit-learn/scikit-learn/pull/3725. I think scikit-learn tries...
@andreasnoack I'm processing microarray gene expression data and values for one probe happen to be all equal across sample. Standardizing the data (zero mean, variance one) is a standard pre-processing...
You should avoid storing `Any` using JLD, it takes significantly more space that way. If you will cast your arrays to some concrete types, it will be significantly more compact...
@dkdog an option would be to write custom serializers for types you are interested in. E.g. https://gist.github.com/maximsch2/4257a23911b7fe71e5ec519fc23082ff
JLD supports them just fine, but doesn't allow to mmap an array of Float16s, because it is an `HDF5Opaque` type. This was my original motivation for asking for support for...
I guess as long as Julia has consistent format for writing Float16s (which it seems it has), then doing for JLD (which is Julia-specific) shouldn't be a problem. I worked...
I have an OpenMP based version here: https://github.com/maximsch2/bhtsne. I don't like the binary file interface, so I'm also modifying it to build as a shared library and expose a simple...
I haven't benchmarked it on big datasets yet, but I think I get around 1.3-1.5x on two cores on a smallish dataset (takes a couple of seconds to build). My...
I don't have an easy access to a Windows machine, but I don't think there is anything unix-specific that I've added there. I think MinGW supports OpenMP, so you should...