Idea for generating test cases #2

ethancaballero · 2017-04-07T21:13:31Z

Train seq2seq RNNs to generate syntactically valid inputs to gold code, and then use gold code as oracle to get output for generated input (if the input has correct syntax).   Inside of try/except, seq2seq receives gold code as input and has to generate an input for gold code that doesn’t set off exception.   Reward seq2seq +1 if no exception, -1 if exception. & add entropy (or minibatch discrimination?) to loss so it’ll have varied generations.  To learn which syntax error it made, have seq2seq predict which exception it set off and lessen negative reward if prediction is correct.   Maybe SL pretrain with already provided test cases (if we have any) before RL stage (entropy in RL loss will ensure RL stage stays varied).

Avmb · 2017-04-08T20:33:56Z

Interesting idea, though I expect this to be a very hard problem.

ethancaballero · 2017-07-10T22:46:57Z

This might be more plausible with "Hindsight Experience Replay": https://arxiv.org/abs/1707.01495

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Idea for generating test cases #2

Idea for generating test cases #2

ethancaballero commented Apr 7, 2017 •

edited

Loading

Avmb commented Apr 8, 2017

ethancaballero commented Jul 10, 2017 •

edited

Loading

Idea for generating test cases #2

Idea for generating test cases #2

Comments

ethancaballero commented Apr 7, 2017 • edited Loading

Avmb commented Apr 8, 2017

ethancaballero commented Jul 10, 2017 • edited Loading

ethancaballero commented Apr 7, 2017 •

edited

Loading

ethancaballero commented Jul 10, 2017 •

edited

Loading