Clarification on training inputs and outputs #12

DocDriven · 2021-10-23T18:41:24Z

The HLD is helpful, but what I think is still hard to understand is how you actually train the model, speak how inputs and outputs of the model look like that get used to calculate the loss.

Following the diagram and assuming one input of each type of feature (binary, numeric, categorical), you have 3 inputs looking like this:

[ 1 
  44
  9 ]

These are the inputs of your model, which get initially transformed and concatenated into a vector with 8 features like:

[ 1        // binary
  0.22     // rescaled
  0.2      // rest is random embedding of emb size 6
 -0.12
  0.65
  0.11
 -0.96
 -1.01 ]

As I understand it, you use an autoencoder to reconstruct the concatenated layer, e.g. you minimize the reconstruction error between the concatenated and the output layer, both with 8 features. But doesn't this completely ignore the training of the embedding?

Would you be so kind to give a simple example, which errors you minimize between which inputs and outputs (concrete values are also fine!)

Thanks a lot!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on training inputs and outputs #12

Clarification on training inputs and outputs #12

DocDriven commented Oct 23, 2021

Clarification on training inputs and outputs #12

Clarification on training inputs and outputs #12

Comments

DocDriven commented Oct 23, 2021