Brulee neural networks with two hidden layers #1187

topepo · 2024-09-09T15:29:42Z

A complement to tidymodels/brulee#84 and the previous tidymodels/brulee#80

Tests will be in extratests

This reverts commit bec0423.

hfrick · 2024-09-13T11:05:59Z

Leaving a note for both of us that I'm waiting with review until the tests are in extratests and you've had a chance to go through the "Add a new parsnip model/engine" checklist of tidymodels/tidymodels#97 👍

hfrick

Hello and welcome new engine! 😍 I have two outstanding items from the checklist that I'd like to see included and I am making the case for sticking with established code patterns, given the size of parsnip. Other than that only a couple of minor things down to nits for readability.

The one failing GHA is a failing test on speed which we should look into but not as part of this PR - I don't see how adding this engine would have caused this or how we would speed up the engine registration in this PR.

From the checklist:

If the engine is added in parsnip, update vignettes/articles/Examples.Rmd. If the engine is added in an extension package: is there a corresponding place that needs updating (e.g., does the README list available models/engines)?
Add a documentation entry in NEWS.md

hfrick · 2024-09-17T12:34:16Z

R/mlp_brulee_two_layer.R

@@ -0,0 +1,11 @@
+#' Multilayer perceptron via brulee with two hidden layers
+#'
+#' [brulee::brulee_mlp_two_layer()] fits a neural network (with version 0.3.0.9000 or higher of brulee)


Suggested change

#' [brulee::brulee_mlp_two_layer()] fits a neural network (with version 0.3.0.9000 or higher of brulee)

#' [brulee::brulee_mlp_two_layer()] fits a neural network

This is going to look outdated very soon. Let's only specify the version requirement in code.

Shouldn't brulee be in Suggests in the DESCRIPTION? 🤔

hfrick · 2024-09-17T12:36:09Z

R/mlp_data.R

+
+set_model_engine("mlp", "classification", "brulee_two_layer")
+set_model_engine("mlp", "regression", "brulee_two_layer")
+set_dependency("mlp", "brulee_two_layer", "brulee")


This is missing the specification of the mode.

hfrick · 2024-09-17T13:04:17Z

R/mlp_data.R

+set_model_arg(
+  model = "mlp",
+  eng = "brulee_two_layer",
+  parsnip = "epochs",
+  original = "epochs",
+  func = list(pkg = "dials", fun = "epochs"),
+  has_submodel = FALSE
+)
+
+set_model_arg(
+  model = "mlp",
+  eng = "brulee_two_layer",
+  parsnip = "dropout",
+  original = "dropout",
+  func = list(pkg = "dials", fun = "dropout"),
+  has_submodel = FALSE
+)


nit: can we list them in the same order as they are arguments to mlp()? that would mean first dropout then epochs.

hfrick · 2024-09-17T13:04:57Z

R/mlp_data.R

+set_model_arg(
+  model = "mlp",
+  eng = "brulee_two_layer",
+  parsnip = "learn_rate",
+  original = "learn_rate",
+  func = list(pkg = "dials", fun = "learn_rate", range = c(-2.5, -0.5)),
+  has_submodel = FALSE
+)
+
+set_model_arg(
+  model = "mlp",
+  eng = "brulee_two_layer",
+  parsnip = "activation",
+  original = "activation",
+  func = list(pkg = "dials", fun = "activation", values = c('relu', 'elu', 'tanh')),
+  has_submodel = FALSE
+)


same nit here, first activation then learn_rate

hfrick · 2024-09-17T13:23:32Z

man/rmd/mlp_brulee.Rmd

-
-Parsnip changes the default range for `learn_rate` to `c(-2.5, -0.5)`.
-
+ - `rate_schedule()`: A function to change the learning rate over epochs. See [brulee::schedule_decay_time()] for details. 


Why do the argument names have parentheses after them? That makes these look like functions. If that's intentional, then let's explain why. Otherwise, let's remove the parentheses.

hfrick · 2024-09-17T13:39:02Z

man/rmd/mlp_brulee_two_layer.Rmd

+  translate()
+```
+
+Note that parsnip automatically sets linear activation in the last layer. 


Suggested change

Note that parsnip automatically sets linear activation in the last layer.

Note that parsnip automatically sets the linear activation in the last layer.

hfrick · 2024-09-17T17:24:05Z

R/tunable.R

@@ -246,34 +245,19 @@ tunable.linear_reg <- function(x, ...) {
    res$call_info[res$name == "mixture"] <-
      list(list(pkg = "dials", fun = "mixture", range = c(0.05, 1.00)))
  } else if (x$engine == "brulee") {
-    res <- add_engine_parameters(res, brulee_linear_engine_args)


Can we stick with this established pattern of using add_engine_parameters()?

hfrick · 2024-09-17T17:26:53Z

R/tunable.R

-  }
-  res
-}
+tunable.logistic_reg <- tunable.linear_reg


Can we keep the separate definitions of tunable() methods per model type? It is a bit of code duplication but allows us to add tunable engine arguments for different model types independently. I think that modularisation is worth the duplication.

hfrick · 2024-09-17T17:28:45Z

R/tunable.R

-      list(list(pkg = "dials", fun = "learn_rate", range = c(-3, -1/2)))
-    res$call_info[res$name == "epochs"] <-
-      list(list(pkg = "dials", fun = "epochs", range = c(5L, 500L)))
+  if (grepl("brulee", x$engine)) {


Here again, I'd argue for sticking with the pattern, given how big parsnip is by now. If we think a different pattern is better, we should do that in a separate issue-PR pair for all models/engines.

hfrick · 2024-09-17T17:32:57Z

man/rmd/aaa.Rmd

+  } else if (any(names(x) == "values")) {
+    cl <- rlang::call2(x$fun, .ns = x$pkg, values = x$values)


This is reaching the length where it might be nice to just splice remaining components into the call but I'm also okay with this staying as is.

‘topepo’ added 11 commits June 20, 2024 10:23

engine work for two-layer mlp models

eede3b7

temp change for tunable

c0d917e

enable parsnip to work with functions wit parameterized labels

bec0423

Revert "enable parsnip to work with functions wit parameterized labels"

5514daf

This reverts commit bec0423.

update tunable method for brulee mlps

f80c731

update for other brulee engines

598bdb5

add rate_schedule to tunables

c02b6e1

Merge branch 'main' into brulee-two-layer

e040933

doc updates

46cee8e

typo fix

38e3c7e

update test

d4be8e4

topepo mentioned this pull request Sep 9, 2024

Updates for brulee_mlp_two_layer tidymodels/brulee#84

Merged

topepo marked this pull request as ready for review September 9, 2024 15:49

topepo requested a review from hfrick September 9, 2024 16:06

fix two bugs in brulee tunable methods

9589f01

topepo mentioned this pull request Sep 13, 2024

tests for Brulee engines tidymodels/extratests#222

Open

hfrick reviewed Sep 17, 2024

View reviewed changes

hfrick mentioned this pull request Sep 17, 2024

Improve make_parameter_list() to respect argument order in function signature #1205

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Brulee neural networks with two hidden layers #1187

Brulee neural networks with two hidden layers #1187

topepo commented Sep 9, 2024 •

edited

Loading

hfrick commented Sep 13, 2024

hfrick left a comment

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

hfrick Sep 17, 2024

	#' [brulee::brulee_mlp_two_layer()] fits a neural network (with version 0.3.0.9000 or higher of brulee)
	#' [brulee::brulee_mlp_two_layer()] fits a neural network


		Parsnip changes the default range for `learn_rate` to `c(-2.5, -0.5)`.

		- `rate_schedule()`: A function to change the learning rate over epochs. See [brulee::schedule_decay_time()] for details.

	Note that parsnip automatically sets linear activation in the last layer.
	Note that parsnip automatically sets the linear activation in the last layer.

		} else if (any(names(x) == "values")) {
		cl <- rlang::call2(x$fun, .ns = x$pkg, values = x$values)

Brulee neural networks with two hidden layers #1187

Are you sure you want to change the base?

Brulee neural networks with two hidden layers #1187

Conversation

topepo commented Sep 9, 2024 • edited Loading

hfrick commented Sep 13, 2024

hfrick left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

topepo commented Sep 9, 2024 •

edited

Loading