Evaluate the ranger library for batch training with Random Forest #19

georgeslabreche · 2021-05-05T16:00:18Z

Description

Use a lightweight C++ library for Random Forest batch training using the training data logged in the training.csv file.

The ranger Libraries

ranger is a fast implementation of random forests (Breiman 2001) or recursive partitioning, particularly suited for high dimensional data. Classification, regression, and survival forests are supported. Classification and regression forests are implemented as in the original Random Forest (Breiman 2001), survival forests as in Random Survival Forests (Ishwaran et al. 2008). Includes implementations of extremely randomized trees (Geurts et al. 2006) and quantile regression forests (Meinshausen 2006).

The GitHub repo: https://github.com/imbs-hl/ranger

Saving the trained models seems possible:
https://github.com/imbs-hl/ranger/blob/ce497711884c783e133fb36750b60de4c140773f/src/Forest.cpp#L403-L443

The training input

Use the training inputs collected during flight:

PD values
Target label is in the 8th column.

The training inputs collected during flight:
https://github.com/georgeslabreche/opssat-orbitai/blob/main/results/learning/mochi-2021-04-18_02-51-48/logs/training.csv

What to do

Compile and run C++ implementation of ranger.
Check that trained model can indeed be saved and loaded.
Transform the training.csv data file into whatever format is expected by ranger to train with PD inputs and the target label.
Compile for ARM architecture and check that it can run on the flatsat.
Run ranger with the PD values and target label inputs.
Calculate the classification metrics of the trained model by using the same dataset used in the paper when evaluating the performance of the Mochi models.
Integrate ranger into the OrbitAI app.

We can discuss how we approach step 7 once step 1 to 6 have been completed.

georgeslabreche added the epic This issue is an epic and needs to be broken downs into smaller issues label May 5, 2021

georgeslabreche mentioned this issue May 5, 2021

Breakdown Epics into smaller issues #20

Closed

TanguySoto assigned TanguySoto, ghost and georgeslabreche May 6, 2021

TanguySoto added the enhancement This issue is related to a new feature label May 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate the ranger library for batch training with Random Forest #19

Evaluate the ranger library for batch training with Random Forest #19

georgeslabreche commented May 5, 2021

Evaluate the ranger library for batch training with Random Forest #19

Evaluate the ranger library for batch training with Random Forest #19

Comments

georgeslabreche commented May 5, 2021

Description

The ranger Libraries

The training input

What to do