Model.encode() error #7

mertozlutiras · 2022-04-29T20:53:30Z

Problem:

When I run the sample code provided for evaluation, I run for the following error in the file supert.py:

Checking the SBERT documentation, I see that model.encode() only returns embeddings, not the tokens itself.

I tried to use the AutoTokenizer from huggingface in order to return tokens, however I ran into assertion error as length of tokens were not equal to length of embeddings.

Before doing more reverse engineering, I wanted to ask you what you wanted to achieve here and if it is a deprecated usage how can we solve it?

Thank you for your help!

mertozlutiras · 2022-04-29T21:34:11Z

I couldn't really understand what is meant by all_tokens.

Why does the token tensors' size should be equal to embedding tensors' size?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model.encode() error #7

Model.encode() error #7

mertozlutiras commented Apr 29, 2022

mertozlutiras commented Apr 29, 2022

Model.encode() error #7

Model.encode() error #7

Comments

mertozlutiras commented Apr 29, 2022

mertozlutiras commented Apr 29, 2022