Jared Knowles
Jared Knowles
I started work awhile ago on a much less ambitious project than `wakefield` to attempt to generate random data sets on the fly with a known correlation structure. You can...
This is a four part task: - [ ] Create a more robust prediction interval for `predict.caretEnsemble` - [ ] Apply this method to `predict.caretStack` to allow it to produce...
I'd like to see us add a function that prunes the train models stored in the `caretEnsemble` object. There is a lot of stuff stored in these individual train objects...
If you ensemble a series of models using external packages (e.g. `mda` or `glmnet`) and then clear your workspace and R session and load the `caretEnsemble` object and attempt to...
I'm not sure how common folks are using `train` on large datasets, but my use case for `caretEnsemble` involves models with a minimum of 40k training observations. In this environment,...
See the errors here: https://www.r-project.org/nosvn/R.check/r-devel-linux-x86_64-debian-clang/eeptools-00check.html From Kurt @ CRAN: > > > The previous method can be requested using RNGkind() or > RNGversion() if necessary for reproduction of old results....
Report the largest and smallest N values for a vector and also their frequency. Make a method for factors that reports the most common and least common values. Apply method...
Implement graphics in the documentation of the datasets. Also include datasets with missing data to better demonstrate R's missing data handling to others and to test functions on.
The time has come to refactor `predictInterval`. It always should have been multiple functions since each piece of the prediction interval can be sampled independently. To help make future maintenance...