Rory Mitchell comments

Results 48 comments of


                                            Rory Mitchell

Optimise histogram kernels

Here is the A100 benchmark. Everything looks good. | dataset | master | hist | |---------|-------------|-------------| | airline | 65.77564727 | 60.79124835 | | bosch | 13.05801762 | 13.36868745 |...

Optimise histogram kernels

Unfortunately using aligned byte sizes in the compressed iterator increased the memory usage of the large sizes test by 1gb and I think it barely no longer fits on the...

Optimise histogram kernels

I reverted the changes to compressed iterator. In the test for large sizes the bit packed version to is able to use 10 bits per symbol where the aligned version...

[WIP] Implement `min_child_samples` for `hist` and `approx`.

I think the criticism of this heuristic is quite valid, it makes the result unpredictable. The assumption that the hessian is uniformly distributed seems dubious. I think the only reliable...

[WIP] Implement `min_child_samples` for `hist` and `approx`.

After working on the row partitioning I realized that if we have count information it makes this step much faster. So if we counted rows in the histogram we lose...

Optimization/apply split/column matrix

In my opinion this PR should not be merged. It looks like unrelated pieces of code copy pasted from the original PR. It's definitely possible to implement the desired improvements...

[FEA] Allow gpu_treeshap to accept const input data

Does just adding an additional ConstFloatPointer type work for you?

Kmeans||

We can switch to doing some work on xgboost after you have dealt with the review issues until the continuous integration problem gets resolved.

Tracking deprecated features.

Its always 1 in code. According to @tqchen it was intended to be used for multiclass trees (he can maybe confirm this). There is also something in metainfo to do...

Run the central Rabit process on a worker

So for my xgboost integration (https://github.com/dmlc/xgboost/pull/4473) I will try the approach of running the tracker on worker zero and assume the performance load of the tracker is negligible.