Baseline fix #1

kudkudak · 2017-03-11T01:14:37Z

Turns out it is important to calculate counts only based on tokens of given passage, rather than estimated on all stories. It is important to produce similar results as in https://www.microsoft.com/en-us/research/wp-content/uploads/2016/11/MCTest_EMNLP2013.pdf. Performance changes from

From:

		correct
question_type	subset
multiple	mc160	0.523438
multiple	mc500	0.414634
one	mc160	0.508929
one	mc500	0.477941

To:

		correct
question_type	subset
multiple	mc160	0.578125
multiple	mc500	0.539634
one	mc160	0.678571
one	mc500	0.544118

Fix is to calculate icounts as _icounts = compute_inverse_counts([tokens]) in SlidingWindow.predict_target

The text was updated successfully, but these errors were encountered:

kudkudak · 2017-03-11T08:41:11Z

Another fix: baseline dist calculation should use float as: closest = np.abs(last_q - last_a) / (float(len(passage)) - 1). I understand stopwords removal is done during data preprocessing

allanj · 2017-10-11T15:22:04Z

I couldn't obtain the results you list there after the suggested modification.

kudkudak changed the title ~~Baseline~~ Baseline fix Mar 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Baseline fix #1

Baseline fix #1

kudkudak commented Mar 11, 2017 •

edited

Loading

kudkudak commented Mar 11, 2017

allanj commented Oct 11, 2017

Baseline fix #1

Baseline fix #1

Comments

kudkudak commented Mar 11, 2017 • edited Loading

kudkudak commented Mar 11, 2017

allanj commented Oct 11, 2017

kudkudak commented Mar 11, 2017 •

edited

Loading