scikit-rebate
scikit-rebate copied to clipboard
A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
# Context TL:DR - I was able to implement some significant performance improvements for ReliefF on binary + discrete data. For a GAMETES generated binary class discrete data file with...
One of the major challenges of making the Relief-based algorithms of ReBATE flexible enough to handle different dataset types, i.e. (1) continuous, discrete, or mixed feature types, (2) binary, multiclass,...
We currently import several NumPy functions directly, e.g., [here](https://landscape.io/github/EpistasisLab/scikit-rebate/17/modules/skrebate/multisurf.py#L26). Normally this isn't an issue, but `min`, `max`, `mean`, etc. override the standard definitions of these functions in Python. We should...
File: scoring_utils.py Function: compute_score(attr, **mcmap**, NN, feature, inst, nan_entries, headers, class_type, X, y, labels_std, data_type, near=True) In compute_score, the parameter mcmap stores class frequencies, but it doesnot seem to have...
Resolved #53
Hello, I am new to python and machine learning but need to use the library for a project. I read the website and the sample code but am still confused...
Hi, I run VLSRelief in a small dataset (100 features) in order to check if it run without any problems. However, after I rerun it in a large dataset (>160...
Hello, I am currently trying to use TuRF to get my feature importance scores, and my code is almost the same as the example code in the docs: ``` from...
When the number of features is odd, TuRF often leaves out one feature (causing a value error at this line https://github.com/EpistasisLab/scikit-rebate/blob/master/skrebate/turf.py#L166) because segmenting of features into selected and non_selected is...
Currently the fit method fails if you pass a pandas dataframe object to the `fit()` and `predict()` adding using the sklearn util check_array (http://scikit-learn.org/stable/modules/generated/sklearn.utils.check_array.html#sklearn.utils.check_array) will by default convert the pandas...