[Research / Analysis] Fine-tune embedding model on tweet dataset #2

AbrahamSanders · 2020-04-11T22:24:59Z

Currently we are using the pre-trained Universal Sentence Encoder (large) from TensorFlow hub.

Open area for investigation:
The model parameters are marked trainable, so it should be possible to fine-tune on our own COVID tweet dataset.

Alternatively, explore fine tuning other models such as BERT on a semantic similarity task as done here: Sentence-BERT

Comparison of base pre-trained vs. fine-tuned Universal Sentence Encoder (USE) can be done quantitatively or qualitatively , see #1. Same goes for comparing USE vs. BERT or any other model.

AbrahamSanders · 2020-08-11T02:01:11Z

A good candidate for a pretrained BERT model is covid-twitter-bert

AbrahamSanders added the research needs investigation or trial of one or more approaches label Apr 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Research / Analysis] Fine-tune embedding model on tweet dataset #2

[Research / Analysis] Fine-tune embedding model on tweet dataset #2

AbrahamSanders commented Apr 11, 2020

AbrahamSanders commented Aug 11, 2020

[Research / Analysis] Fine-tune embedding model on tweet dataset #2

[Research / Analysis] Fine-tune embedding model on tweet dataset #2

Comments

AbrahamSanders commented Apr 11, 2020

AbrahamSanders commented Aug 11, 2020