gradOutput Ignored #43

parajain · 2017-12-22T06:48:18Z

Line 31 in 83a5f17

function ReinforceCategorical:updateGradInput(input, gradOutput)

Hi,
I am trying to understand the logic in reinforce implementation. I am new to this so please bear with my basic questions.
Why is gradOutput being ignored? If we multiply gradOutput with rewards, will it be wrong?
Also, what is happening here:self.gradInput:copy(self.output)? Output is a probability distribution right?

Thanks,
Parag

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gradOutput Ignored #43

gradOutput Ignored #43

parajain commented Dec 22, 2017

gradOutput Ignored #43

gradOutput Ignored #43

Comments

parajain commented Dec 22, 2017