Evaluate the ranger library for batch training with Random Forest #19
Labels
enhancement
This issue is related to a new feature
epic
This issue is an epic and needs to be broken downs into smaller issues
Description
Use a lightweight C++ library for Random Forest batch training using the training data logged in the training.csv file.
The ranger Libraries
ranger is a fast implementation of random forests (Breiman 2001) or recursive partitioning, particularly suited for high dimensional data. Classification, regression, and survival forests are supported. Classification and regression forests are implemented as in the original Random Forest (Breiman 2001), survival forests as in Random Survival Forests (Ishwaran et al. 2008). Includes implementations of extremely randomized trees (Geurts et al. 2006) and quantile regression forests (Meinshausen 2006).
The GitHub repo: https://github.com/imbs-hl/ranger
Saving the trained models seems possible:
https://github.com/imbs-hl/ranger/blob/ce497711884c783e133fb36750b60de4c140773f/src/Forest.cpp#L403-L443
The training input
Use the training inputs collected during flight:
The training inputs collected during flight:
https://github.com/georgeslabreche/opssat-orbitai/blob/main/results/learning/mochi-2021-04-18_02-51-48/logs/training.csv
What to do
We can discuss how we approach step 7 once step 1 to 6 have been completed.
The text was updated successfully, but these errors were encountered: