Benchmarks for LaBSE #10

loretoparisi · 2020-07-14T19:25:03Z

It could be worth to add benchmarks for the new Language-agnostic BERT Sentence Embedding (LaBSE)

https://arxiv.org/pdf/2007.01852.pdf

The model is available already on tensorflow hub

MastafaF · 2020-08-04T13:17:32Z

I have been off on vacation for a while. Will look into it as soon as I have a bit of time. Thanks for the suggestion! 😄

loretoparisi · 2020-08-19T08:37:55Z

Thanks a lot, that would be very interesting also because recently a official blog post came out, pointing out a comparison among cross-lingual models (tatoeba dataset)

Model 14 Langs 36 Langs 82 Langs All Langs
m~USE* 93.9 — — —
LASER 95.3 84.4 75.9 65.5
LaBSE 95.3 95.0 87.3 83.7

The interesting part is about the support to "unsupported" languages

"...For one third of these languages the LaBSE accuracy is higher than 75% and only 8 have accuracy lower than 25%, indicating very strong transfer performance to languages without training data"

and - my opinion - minor/low resources languages.

If I can help, let me know.

Thank you!

MastafaF · 2020-10-29T15:44:08Z

Hi @loretoparisi ,

Sorry for the late answer, it would be great to see the results on LaBSE. Unfortunately I do not have much capacity lately. Did you take a close look at it yourself? It should be fairly simple to replicate the propose work in this repo if LaBSE has been integrated to pytorch. Happy to connect on that. 😃

loretoparisi · 2020-10-29T16:36:49Z

@MastafaF thank you anyways, I will have a look.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks for LaBSE #10

Benchmarks for LaBSE #10

loretoparisi commented Jul 14, 2020

MastafaF commented Aug 4, 2020

loretoparisi commented Aug 19, 2020 •

edited

Loading

MastafaF commented Oct 29, 2020

loretoparisi commented Oct 29, 2020

Benchmarks for LaBSE #10

Benchmarks for LaBSE #10

Comments

loretoparisi commented Jul 14, 2020

MastafaF commented Aug 4, 2020

loretoparisi commented Aug 19, 2020 • edited Loading

MastafaF commented Oct 29, 2020

loretoparisi commented Oct 29, 2020

loretoparisi commented Aug 19, 2020 •

edited

Loading