Is word level bi-lstm reflected in the code ? #58

unktkn · 2018-12-30T17:18:10Z

Hi there,

Excuse the naiveness, I am having a bit of trouble understanding how the word-level bi-directional lstm (before the CRF layer) in the paper is reflected in the code, to be specific see the arrows in the image below.

Why is there only one word_lstm in the lm_lstm_crf.py ?

I see that in train_wc.py , if co_train is enabled, a bi-directional word level lstm is applied, but this after the CRF layer ?, shouldn't it be before the concatenated output passed to CRF as shown in the model architecture in the paper?
May be I am missing something really big & obvious :/

Thanks,

The text was updated successfully, but these errors were encountered:

nooralahzadeh · 2019-04-10T07:59:22Z

By selecting co-train you just update the loss coming from LM, the bi-directional world level has been already applied on ner-model in lm_lstm_crf.py from following line in the code:

LM-LSTM-CRF/model/lm_lstm_crf.py

Line 245 in e468974

word_input = torch.cat((d_word_emb, d_char_out), dim = 2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is word level bi-lstm reflected in the code ? #58

Is word level bi-lstm reflected in the code ? #58

unktkn commented Dec 30, 2018

nooralahzadeh commented Apr 10, 2019

Is word level bi-lstm reflected in the code ? #58

Is word level bi-lstm reflected in the code ? #58

Comments

unktkn commented Dec 30, 2018

nooralahzadeh commented Apr 10, 2019