Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run "bootstrapped supervised binning"-model on unseen data #1

Open
claczny opened this issue May 22, 2017 · 0 comments
Open

Run "bootstrapped supervised binning"-model on unseen data #1

claczny opened this issue May 22, 2017 · 0 comments
Assignees

Comments

@claczny
Copy link
Owner

claczny commented May 22, 2017

Bootstrapped supervised binning builds internally a classification model to accelerate the binning process.
More specifically, only "cluster points", i.e., a subset of the input sequences, are used to train this model.
The trained model is then used to predict a "bin"-assignment for the remaining data.

Currently, all this is performed on the same input, i.e., all the data must be available during the entire computation.
However, it would be nice to simply apply the trained model on "unseen" data, i.e., data that was not provided as input (not even in the form of "non-cluster" points, e.g., border points or remaining points).

@claczny claczny self-assigned this May 22, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant