pinot
pinot copied to clipboard
Add functions for statistical analysis in SQL
As discussed with @siddharthteotia, consider adding some common statistical analysis methods SQL language.
Few examples:
- Pearson's coefficient
- Sampling (bernoulli/stratified)
- Histogram
- Entropy
- Linear regression
- Logistic regression
- SVM
Designing the one request - multiple (sequential) queries model for statistical functions. Planning to use mini-batch stochastic gradient descent for regression algorithms 2. 3. 4.
Supporting histogram, entropy like computations could also be potentially useful
Anyone working for supporting Sampling? Do we know how much effort is it going to be ? will it be few days or weeks?
@jasperjiaguo @siddharthteotia