Add an unsupervised warm up for the models #140

gabrieltseng · 2019-12-10T10:41:27Z

Inspired by tile2vec, pretrains the models by training the models to make embeddings that are far away from each other more different than embeddings that are close to one another.

It's a less rigid way of communicating the latlon information to the models

tommylees112 · 2019-12-19T11:35:31Z

This is super cool Gabi! Thanks so much. Just reviewing now

tommylees112

This is amazing work dude!! Just a few qs - thanks so much for implementing

src/models/data.py

src/models/neural_networks/triplet_data.py

tommylees112 · 2019-12-19T14:03:23Z

src/models/neural_networks/triplet_data.py

+        neighbour_indices: List[int] = []
+        distant_indices: List[int] = []
+
+        outer_distance = tuple(multiplier * val for val in distance)


whats the role of the multiplier?

It's basically to enforce a minimum distance between the neighbouring instance and the distant instance.

The neighbour will be within neighbouring_distance of the anchor. The distant instance will be further than multiplier * neighbouring_distance from the anchor

gotcha! so basically enforcing how large an area our spatial differences should be over

tommylees112 · 2019-12-19T14:04:16Z

src/models/neural_networks/ealstm.py

@@ -288,11 +289,14 @@ def forward(

        x = self.rnn_dropout(hidden_state[:, -1, :])

+        if return_embedding:


is this for interpreting the static embedding?

No - the loss in tile2vec compares the embedding, not the final value. This is to return that embedding for the loss, before it gets put through the final linear layer

yes makes sense! Could this be used for interpreting the embedding layer too though?

yea, 100%. Although here the "embedding" is the final output of the model before the linear regression layer

tommylees112 · 2019-12-19T14:05:33Z

src/models/neural_networks/base.py

+        # initialize the model
+        if self.model is None:
+            x_ref, _, _ = next(iter(train_dataloader))
+            model = self._initialize_model(self._input_to_tuple(x_ref))


does this train the LSTM model? Don't we need to initialise with a CNN as they use in Tile2Vec?

The principles of tile2vec can be used with any model that takes a raw input and outputs an embedding.

So yea, in this case it can also train the (EA)LSTM model

okay gotcha.

So have i interpreted this correctly:

"We use the unsupervised learning algorithm described in Tile2Vec to pretrain (initialise) the weights of the EALSTM. This allows us to produce weights in the network that produce sensible spatial patterns. Mainly that pixels close together are more similar than pixels that are far apart."

yea, that's exactly right

…aper

gabrieltseng added 14 commits December 9, 2019 14:42

Get started

ea07c72

Merge branch 'master' into models/unsupervised

3b8af6d

Add triplet data loader

e7113c3

Add a triplet chunker

7961edd

Add an unsupervised training function

283dcb0

Get it working, and tested

a82bbdd

Add the multiplier as an argument

9842ee1

Add a script to run this

aee369f

Clean up scripts

c7ab333

Bugfix, update test to catch it

dc2fae9

Be better about not finding matches

47fc420

Update default values

de759e7

Typo

62c29b9

Default values

db8d35d

gabrieltseng changed the title ~~Adds an unsupervised warm up for the models~~ Add an unsupervised warm up for the models Dec 10, 2019

gabrieltseng added 3 commits December 12, 2019 08:15

Allow the final layers only to be trained

1a0011b

Fix merge conflicts

39a5c8f

Again

6de8fa5

gabrieltseng requested a review from tommylees112 December 18, 2019 06:00

tommylees112 reviewed Dec 19, 2019

View reviewed changes

gabrieltseng added 6 commits December 20, 2019 21:14

Update neighbourhood_distance name to match what is in the tile2vec p…

7439579

…aper

Train the final layers before training the whole model

585af57

Merge branch 'master' into models/unsupervised

1d49905

Update scripts

4313ed0

Fix triplet dataloader

4d41fe1

Fix triplet dataloader

2d8570c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an unsupervised warm up for the models #140

Add an unsupervised warm up for the models #140

gabrieltseng commented Dec 10, 2019 •

edited

Loading

tommylees112 commented Dec 19, 2019

tommylees112 left a comment

tommylees112 Dec 19, 2019

gabrieltseng Dec 20, 2019

tommylees112 Dec 20, 2019

gabrieltseng Dec 20, 2019

tommylees112 Dec 19, 2019

gabrieltseng Dec 20, 2019

tommylees112 Dec 20, 2019

gabrieltseng Dec 20, 2019

tommylees112 Dec 19, 2019

gabrieltseng Dec 20, 2019

tommylees112 Dec 20, 2019

gabrieltseng Dec 20, 2019

		@@ -288,11 +289,14 @@ def forward(

		x = self.rnn_dropout(hidden_state[:, -1, :])

		if return_embedding:

Add an unsupervised warm up for the models #140

Are you sure you want to change the base?

Add an unsupervised warm up for the models #140

Conversation

gabrieltseng commented Dec 10, 2019 • edited Loading

tommylees112 commented Dec 19, 2019

tommylees112 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gabrieltseng commented Dec 10, 2019 •

edited

Loading