Skip to content

Commit

Permalink
fix
Browse files Browse the repository at this point in the history
  • Loading branch information
lena-voita committed Aug 26, 2020
1 parent 56765e2 commit acf0b9a
Showing 1 changed file with 21 additions and 0 deletions.
21 changes: 21 additions & 0 deletions nlp_course/word_embeddings.html
Original file line number Diff line number Diff line change
Expand Up @@ -1031,6 +1031,27 @@ <h3><u>Relation to PMI Matrix Factorization</u></h3>
</div>
</div>

<h3><u>The Effect of Window Size</u></h3>

<p>The size of the sliding window has a strong effect on the resulting
vector similarities.
For example, <a href="https://arxiv.org/pdf/1510.00726.pdf">this paper</a> notes that
larger windows tend to produce more topical similarities
(i.e. <font class="data_text"><strong>dog</strong></font>,
<font class="data_text"><strong>bark</strong></font> and
<font class="data_text"><strong>leash</strong></font> will be grouped together,
as well as
<font class="data_text"><strong>walked</strong></font>,
<font class="data_text"><strong>run</strong></font> and
<font class="data_text"><strong>walking</strong></font>),
while smaller windows tend to produce more functional and syntactic similarities
(i.e. <font class="data_text"><strong>Poodle</strong></font>,
<font class="data_text"><strong>Pitbull</strong></font>,
<font class="data_text"><strong>Rottweiler</strong></font>, or
<font class="data_text"><strong>walking</strong></font>,
<font class="data_text"><strong>running</strong></font>,
<font class="data_text"><strong>approaching</strong></font>).</p>

<h3><u>(Somewhat) Standard Hyperparameters</u></h3>
As always, the choice of hyperparameters usually depends on the task at hand;
you can look at the original papers for more details.
Expand Down

0 comments on commit acf0b9a

Please sign in to comment.