Konstantin Slavnov
Konstantin Slavnov
There is a special window function in MYSQL to calculate complex aggregations. For example, if you have a column with values `[1, 1, 2, 2]` and want to get cumulative...
To reinstall python driver I need to remove it first and install the new one after. Good to have something like `reinstall` or `install -f` subcommand for it.
Dependent projects such as https://github.com/src-d/style-analyzer need only algorithms part of the sourced-ml: https://github.com/src-d/ml/tree/master/sourced/ml/algorithms Data collection part uses deprecated jgit-spark-connector which depends on old packages. This leads to unpleasant dependency conflicts:...
We want to be sure that only compatible Transformers will be linked together. So it is a good idea to add to `Transformer` class something like ```python INPUT_FORMAT = Rdd[Row["cname1",...
we have such updates for https://hub.docker.com/r/srcd/ml, but not for https://hub.docker.com/r/srcd/ml-core We should add it. @vmarkovtsev, as I understand only you in the team, have permissions for it.
We constantly confuse terms, so what to say about other developers. I do not want to make it full, but to have a start. Here is terms list to explain...
The biggest part of cmd arguments we use come from Transformers. I think we can include them to Transformers and assemble a list of cmd arguments from a pipeline. It...
I already do not remember what the difference and why we have both. I think we need describe it better. The same for classes in https://github.com/src-d/ml/blob/master/sourced/ml/transformers/content2ids.py Cannot get what are...
We can take some rule-based approach as a benchmark: email contains `bot` word or `no-reply`. However, there are emails like `[email protected]` that is hard to find. So some ML should...