From ad82bb2c48564a6f23e005e01c89e4c12318510c Mon Sep 17 00:00:00 2001
From: peng_windows <2686728826@qq.com>
Date: Tue, 26 Mar 2024 12:26:17 +0800
Subject: [PATCH 1/6] llm-introduction-attention-transformer
---
.../_config.yml | 1 +
open-machine-learning-jupyter-book/_toc.yml | 9 +
.../llm/basic/transformer-architecture.ipynb | 730 +
.../llm/basic/attention.ipynb | 367 +
.../llm/basic/basic.ipynb | 64 +
.../llm/basic/transformer.ipynb | 20020 ++++++++++++++++
.../llm/image/attention_example.svg | 9628 ++++++++
.../llm/image/cifar100_example_anomaly.png | Bin 0 -> 155392 bytes
.../llm/image/comparison_conv_rnn.svg | 1809 ++
.../llm/image/implicit-order.png | Bin 0 -> 34807 bytes
.../llm/image/llm.png | Bin 0 -> 166613 bytes
.../llm/image/multihead_attention.svg | 288 +
.../llm/image/scaled_dot_product_attn.svg | 351 +
.../llm/image/scaling-laws.png | Bin 0 -> 86847 bytes
.../llm/image/transformer_architecture.svg | 118 +
.../llm/image/warmup_loss_plot.svg | 1579 ++
.../llm/introduction.ipynb | 155 +
17 files changed, 35119 insertions(+)
create mode 100644 open-machine-learning-jupyter-book/assignments/llm/basic/transformer-architecture.ipynb
create mode 100644 open-machine-learning-jupyter-book/llm/basic/attention.ipynb
create mode 100644 open-machine-learning-jupyter-book/llm/basic/basic.ipynb
create mode 100644 open-machine-learning-jupyter-book/llm/basic/transformer.ipynb
create mode 100644 open-machine-learning-jupyter-book/llm/image/attention_example.svg
create mode 100644 open-machine-learning-jupyter-book/llm/image/cifar100_example_anomaly.png
create mode 100644 open-machine-learning-jupyter-book/llm/image/comparison_conv_rnn.svg
create mode 100644 open-machine-learning-jupyter-book/llm/image/implicit-order.png
create mode 100644 open-machine-learning-jupyter-book/llm/image/llm.png
create mode 100644 open-machine-learning-jupyter-book/llm/image/multihead_attention.svg
create mode 100644 open-machine-learning-jupyter-book/llm/image/scaled_dot_product_attn.svg
create mode 100644 open-machine-learning-jupyter-book/llm/image/scaling-laws.png
create mode 100644 open-machine-learning-jupyter-book/llm/image/transformer_architecture.svg
create mode 100644 open-machine-learning-jupyter-book/llm/image/warmup_loss_plot.svg
create mode 100644 open-machine-learning-jupyter-book/llm/introduction.ipynb
diff --git a/open-machine-learning-jupyter-book/_config.yml b/open-machine-learning-jupyter-book/_config.yml
index e7464e4cd..a0f9b925d 100644
--- a/open-machine-learning-jupyter-book/_config.yml
+++ b/open-machine-learning-jupyter-book/_config.yml
@@ -23,6 +23,7 @@ execute:
- 'ml-advanced/unsupervised-learning-pca-and-clustering.ipynb'
- 'ml-advanced/unsupervised-learning.ipynb'
- 'data-science/data-science-in-the-cloud/the-azure-ml-sdk-way.ipynb'
+ - 'llm/basic/transformer.ipynb'
parse:
myst_enable_extensions:
diff --git a/open-machine-learning-jupyter-book/_toc.yml b/open-machine-learning-jupyter-book/_toc.yml
index a09d32cd1..5c9b8c367 100644
--- a/open-machine-learning-jupyter-book/_toc.yml
+++ b/open-machine-learning-jupyter-book/_toc.yml
@@ -122,6 +122,14 @@ parts:
- file: machine-learning-productionization/data-engineering
- file: machine-learning-productionization/model-training-and-evaluation
- file: machine-learning-productionization/model-deployment
+- caption: Large Language Models
+ numbered: True
+ chapters:
+ - file: llm/introduction
+ - file: llm/basic/basic
+ sections:
+ - file: llm/basic/attention
+ - file: llm/basic/transformer
- caption: OTHERS
numbered: True
maxdepth: 1
@@ -237,6 +245,7 @@ parts:
- file: assignments/deep-learning/nlp/getting-start-nlp-with-classification-task
- file: assignments/deep-learning/nlp/beginner-guide-to-text-preprocessing
- file: assignments/deep-learning/nlp/news-topic-classification-tasks
+ - file: assignments/llm/basic/transformer-architecture
- file: slides/introduction
sections:
- file: slides/python-programming/python-programming-introduction
diff --git a/open-machine-learning-jupyter-book/assignments/llm/basic/transformer-architecture.ipynb b/open-machine-learning-jupyter-book/assignments/llm/basic/transformer-architecture.ipynb
new file mode 100644
index 000000000..274f60583
--- /dev/null
+++ b/open-machine-learning-jupyter-book/assignments/llm/basic/transformer-architecture.ipynb
@@ -0,0 +1,730 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "# Complete the transformer architecture"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "# set up the env\n",
+ "\n",
+ "import pytest\n",
+ "import ipytest\n",
+ "import unittest\n",
+ "\n",
+ "ipytest.autoconfig()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Transformer Model\n",
+ "\n",
+ "The encoder-decoder architecture based on the Transformer structure is illustrated in figure below. The left and right sides correspond to the encoder and decoder structures, respectively. They consist of several basic Transformer blocks (represented by the gray boxes in the figure), stacked N times. Each component comprises multiple Transformer blocks, which are stacked N times.\n",
+ "\n",
+ "Here's an overview of the key components and processes involved in the semantic abstraction process from input to output:\n",
+ "\n",
+ "Encoder:\n",
+ "\n",
+ "The encoder takes an input sequence {xi}ti=1, where each xi represents the representation of a word in the text sequence.\n",
+ "It consists of stacked Transformer blocks. Each block includes:\n",
+ "Attention Layer: Utilizes multi-head attention mechanisms to capture dependencies between words in the input sequence, facilitating the modeling of long-range dependencies without traditional recurrent structures.\n",
+ "Position-wise Feedforward Layer: Applies complex transformations to the representations of each word in the input sequence.\n",
+ "Residual Connections: Directly connect the input and output of the attention and feedforward layers, aiding in efficient information flow and model optimization.\n",
+ "Layer Normalization: Normalizes the output representations of the attention and feedforward layers, stabilizing optimization.\n",
+ "Decoder:\n",
+ "\n",
+ "The decoder generates an output sequence {yi}ti=1 based on the representations learned by the encoder.\n",
+ "Similar to the encoder, it consists of stacked Transformer blocks, each including the same components as described above.\n",
+ "In addition, the decoder includes an additional attention mechanism that focuses on the encoder's output to incorporate context information during sequence generation.\n",
+ "Overall, the encoder-decoder architecture based on the Transformer structure allows for effective semantic abstraction by leveraging attention mechanisms, position-wise feedforward layers, residual connections, and layer normalization. This architecture enables the model to capture complex dependencies between words in the input sequence and generate meaningful outputs for various sequence-to-sequence tasks.\n",
+ "\n",
+ ":::{figure} https://media.geeksforgeeks.org/wp-content/uploads/20230531140926/Transformer-python-(1).png\n",
+ "---\n",
+ "\n",
+ "width: 90%\n",
+ "---\n",
+ "Transformer-based encoder and decoder Architecture\n",
+ ":::\n",
+ "\n",
+ "Next, we'll discuss the specific functionalities and implementation methods of each module in detail."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Embedding Layer\n",
+ "\n",
+ "The Embedding Layer in the Transformer model is responsible for converting discrete token indices into continuous vector representations. Each token index is mapped to a high-dimensional vector, which is learned during the training process. These embeddings capture semantic and syntactic information about the tokens.\n",
+ "\n",
+ "Implementation in PyTorch:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "import math\n",
+ "\n",
+ "class PositionalEncoder(nn.Module):\n",
+ " def __init__(self, d_model, max_seq_len=80):\n",
+ " super().__init__()\n",
+ " self.d_model = d_model\n",
+ " ## Create a constant PE matrix based on pos and i\n",
+ " pe = torch.zeros(max_seq_len, d_model)\n",
+ " for pos in range(max_seq_len):\n",
+ " for i in range(0, d_model, 2):\n",
+ " pe[pos, i] = math.sin(pos / (10000 ** ((2 * i) / d_model)))\n",
+ " pe[pos, i + 1] = math.cos(pos / (10000 ** ((2 * (i + 1)) / d_model)))\n",
+ " pe = pe.unsqueeze(0)\n",
+ " self.register_buffer('pe', pe)\n",
+ "\n",
+ " def forward(self, x):\n",
+ " ## Scale word embedding representations\n",
+ " x = x * math.sqrt(self.d_model)\n",
+ " ## Add positional constants to word embedding representations\n",
+ " seq_len = x.size(1)\n",
+ " pe = torch.autograd.Variable(self.pe[:, :seq_len], requires_grad=False).cuda()\n",
+ " x = x + pe\n",
+ " return x"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "
Check result by executing below... 📝 "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {
+ "jupyter": {
+ "source_hidden": true
+ },
+ "tags": [
+ "hide-input"
+ ]
+ },
+ "outputs": [],
+ "source": [
+ "%%ipytest -qq\n",
+ "\n",
+ "class TestPositionalEncoder(unittest.TestCase):\n",
+ " def setUp(self):\n",
+ " self.d_model = 512\n",
+ " self.max_seq_len = 10 # Maximum sequence length for testing\n",
+ " self.positional_encoder = PositionalEncoder(self.d_model, self.max_seq_len)\n",
+ "\n",
+ " def test_forward(self):\n",
+ " # Create a sample input tensor representing word embeddings\n",
+ " batch_size = 2\n",
+ " seq_length = 5\n",
+ " word_embeddings = torch.randn(batch_size, seq_length, self.d_model)\n",
+ "\n",
+ " # Forward pass through the PositionalEncoder module\n",
+ " output = self.positional_encoder(word_embeddings)\n",
+ "\n",
+ " # Check if the output shape matches the input shape\n",
+ " assert output.shape == (batch_size, seq_length, self.d_model)\n",
+ "\n",
+ " # Check if positional encoding is correctly applied\n",
+ " # Example: Verify if the first element of the first embedding vector matches the expected value\n",
+ " expected_first_element = torch.sin(torch.tensor([0.0])) * math.sqrt(self.d_model)\n",
+ " assert math.isclose(output[0, 0, 0].item(), expected_first_element.item(), rel_tol=1e-6)\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In this code:\n",
+ "\n",
+ "We define a PositionalEncoder class that inherits from nn.Module.\n",
+ "The constructor initializes the positional encoding matrix (pe) based on the given d_model (dimension of the model) and max_seq_len (maximum sequence length).\n",
+ "The forward method scales the input embeddings (x) by the square root of the model dimension and adds the positional encoding matrix (pe) to the input embeddings.\n",
+ "Note that we're using PyTorch's Variable and autograd to ensure that the positional encoding is compatible with the autograd mechanism for backpropagation.\n",
+ "Finally, the PositionalEncoder class can be used within a larger PyTorch model to incorporate positional information into word embeddings."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Attention Layer\n",
+ "The Attention Layer in the Transformer model enables the model to focus on different parts of the input sequence when processing each token. It computes attention scores between each pair of tokens in the input sequence and generates a context vector for each token based on the importance of other tokens. This mechanism allows the model to capture long-range dependencies in the input sequence effectively.\n",
+ "\n",
+ "Implementation in PyTorch:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "import torch.nn.functional as F\n",
+ "import math\n",
+ "\n",
+ "class MultiHeadAttention(nn.Module):\n",
+ " def __init__(self, heads, d_model, dropout=0.1):\n",
+ " super().__init__()\n",
+ " self.d_model = d_model\n",
+ " self.d_k = d_model // heads\n",
+ " self.h = heads\n",
+ " self.q_linear = nn.Linear(d_model, d_model)\n",
+ " self.v_linear = nn.Linear(d_model, d_model)\n",
+ " self.k_linear = nn.Linear(d_model, d_model)\n",
+ " self.dropout = nn.Dropout(dropout)\n",
+ " self.out = nn.Linear(d_model, d_model)\n",
+ " \n",
+ " def attention(self, q, k, v, d_k, mask=None, dropout=None):\n",
+ " scores = torch.matmul(q, k.transpose(-2, -1)) / math.sqrt(d_k)\n",
+ " if mask is not None:\n",
+ " mask = mask.unsqueeze(1)\n",
+ " scores = scores.masked_fill(mask == 0, -1e9)\n",
+ " scores = F.softmax(scores, dim=-1)\n",
+ " if dropout is not None:\n",
+ " scores = dropout(scores)\n",
+ " output = torch.matmul(scores, v)\n",
+ " return output\n",
+ " \n",
+ " def forward(self, q, k, v, mask=None):\n",
+ " bs = q.size(0)\n",
+ " k = self.k_linear(k).view(bs, -1, self.h, self.d_k)\n",
+ " q = self.q_linear(q).view(bs, -1, self.h, self.d_k)\n",
+ " v = self.v_linear(v).view(bs, -1, self.h, self.d_k)\n",
+ " k = k.transpose(1, 2)\n",
+ " q = q.transpose(1, 2)\n",
+ " v = v.transpose(1, 2)\n",
+ " scores = self.attention(q, k, v, self.d_k, mask, self.dropout)\n",
+ " concat = scores.transpose(1, 2).contiguous().view(bs, -1, self.d_model)\n",
+ " output = self.out(concat)\n",
+ " return output"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Check result by executing below... 📝 "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {
+ "jupyter": {
+ "source_hidden": true
+ },
+ "tags": [
+ "hide-input"
+ ]
+ },
+ "outputs": [],
+ "source": [
+ "%%ipytest -qq\n",
+ "\n",
+ "class TestMultiHeadAttention(unittest.TestCase):\n",
+ " def test_forward(self):\n",
+ " # Instantiate MultiHeadAttention module\n",
+ " heads = 4\n",
+ " d_model = 64\n",
+ " dropout = 0.1\n",
+ " multihead_attn = MultiHeadAttention(heads, d_model, dropout)\n",
+ "\n",
+ " # Create sample input tensors\n",
+ " batch_size = 2\n",
+ " seq_length = 5\n",
+ " q = torch.randn(batch_size, seq_length, d_model)\n",
+ " k = torch.randn(batch_size, seq_length, d_model)\n",
+ " v = torch.randn(batch_size, seq_length, d_model)\n",
+ " mask = torch.randint(0, 2, (batch_size, 1, seq_length)) # Example mask tensor\n",
+ "\n",
+ " # Forward pass through the MultiHeadAttention module\n",
+ " output = multihead_attn(q, k, v, mask)\n",
+ "\n",
+ " # Check output shape\n",
+ " self.assertEqual(output.shape, (batch_size, seq_length, d_model))\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In this implementation:\n",
+ "\n",
+ "The MultiHeadAttention class defines a multi-head self-attention layer.\n",
+ "The forward method performs linear operations to divide inputs into multiple heads, computes attention scores, and aggregates the outputs of multiple heads."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Feedforward Layer\n",
+ "\n",
+ "The Position-wise Feedforward Layer in the Transformer model applies a simple feedforward neural network independently to each position in the sequence. It consists of two linear transformations with a non-linear activation function (commonly ReLU) applied in between. This layer helps capture complex interactions between different dimensions of the input embeddings.\n",
+ "\n",
+ "Implementation in PyTorch:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "import torch.nn.functional as F\n",
+ "\n",
+ "class FeedForward(nn.Module):\n",
+ " def __init__(self, d_model, d_ff=2048, dropout=0.1):\n",
+ " super().__init__()\n",
+ " ## Set d_ff default to 2048\n",
+ " self.linear_1 = nn.Linear(d_model, d_ff)\n",
+ " self.dropout = nn.Dropout(dropout)\n",
+ " self.linear_2 = nn.Linear(d_ff, d_model)\n",
+ "\n",
+ " def forward(self, x):\n",
+ " x = self.dropout(F.relu(self.linear_1(x)))\n",
+ " x = self.linear_2(x)\n",
+ " return x"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Check result by executing below... 📝 "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {
+ "jupyter": {
+ "source_hidden": true
+ },
+ "tags": [
+ "hide-input"
+ ]
+ },
+ "outputs": [],
+ "source": [
+ "%%ipytest -qq\n",
+ "\n",
+ "class TestFeedForward(unittest.TestCase):\n",
+ " def test_forward(self):\n",
+ " # Instantiate FeedForward module\n",
+ " d_model = 512\n",
+ " d_ff = 2048\n",
+ " dropout = 0.1\n",
+ " feed_forward = FeedForward(d_model, d_ff, dropout)\n",
+ "\n",
+ " # Create sample input tensor\n",
+ " batch_size = 2\n",
+ " seq_length = 5\n",
+ " input_tensor = torch.randn(batch_size, seq_length, d_model)\n",
+ "\n",
+ " # Forward pass through the FeedForward module\n",
+ " output = feed_forward(input_tensor)\n",
+ "\n",
+ " # Check output shape\n",
+ " self.assertEqual(output.shape, (batch_size, seq_length, d_model))"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In this implementation:\n",
+ "\n",
+ "The FeedForward class defines a feedforward layer.\n",
+ "The forward method applies ReLU activation to the output of the first linear transformation, followed by dropout, and then performs the second linear transformation to produce the final output."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Residual Connection and Layer Normalization\n",
+ "\n",
+ "Residual Connection:\n",
+ "The Residual Connection, also known as skip connection, is a technique used in deep neural networks to mitigate the vanishing gradient problem and facilitate the flow of information through the network. In the context of the Transformer model, residual connections are added around each sub-layer (such as attention and feedforward layers) before applying layer normalization. This allows the model to learn residual representations and thus ease the optimization process.\n",
+ "\n",
+ "Layer Normalization:\n",
+ "Layer Normalization is a technique used to stabilize the training of deep neural networks by normalizing the activations of each layer. In the Transformer model, layer normalization is applied after each sub-layer (such as attention and feedforward layers) and before the residual connection. It normalizes the activations along the feature dimension, allowing the model to learn more robust representations and accelerate convergence during training.\n",
+ "\n",
+ "Implementation in PyTorch:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "\n",
+ "class NormLayer(nn.Module):\n",
+ " def __init__(self, d_model, eps=1e-6):\n",
+ " super().__init__()\n",
+ " self.size = d_model\n",
+ " ## Layer normalization includes two learnable parameters\n",
+ " self.alpha = nn.Parameter(torch.ones(self.size))\n",
+ " self.bias = nn.Parameter(torch.zeros(self.size))\n",
+ " self.eps = eps\n",
+ " \n",
+ " def forward(self, x):\n",
+ " norm = self.alpha * (x - x.mean(dim=-1, keepdim=True)) / (x.std(dim=-1, keepdim=True) + self.eps) + self.bias\n",
+ " return norm"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Check result by executing below... 📝 "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {
+ "jupyter": {
+ "source_hidden": true
+ },
+ "tags": [
+ "hide-input"
+ ]
+ },
+ "outputs": [],
+ "source": [
+ "%%ipytest -qq\n",
+ "\n",
+ "class TestNormLayer(unittest.TestCase):\n",
+ " def test_forward(self):\n",
+ " # Instantiate NormLayer module\n",
+ " d_model = 512\n",
+ " eps = 1e-6\n",
+ " norm_layer = NormLayer(d_model, eps)\n",
+ "\n",
+ " # Create sample input tensor\n",
+ " batch_size = 2\n",
+ " seq_length = 5\n",
+ " input_tensor = torch.randn(batch_size, seq_length, d_model)\n",
+ "\n",
+ " # Forward pass through the NormLayer module\n",
+ " output = norm_layer(input_tensor)\n",
+ "\n",
+ " # Check output shape\n",
+ " self.assertEqual(output.shape, (batch_size, seq_length, d_model))"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In this implementation:\n",
+ "\n",
+ "The NormLayer class defines a layer normalization layer.\n",
+ "The forward method computes the layer normalization using the given input tensor x."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Encoder and Decoder Structure\n",
+ "Encoder Structure:\n",
+ "The Encoder in the Transformer model consists of multiple stacked Encoder layers. Each Encoder layer typically contains a Multi-Head Attention sub-layer followed by a FeedForward sub-layer, each with Residual Connection and Layer Normalization.\n",
+ "\n",
+ "Decoder Structure:\n",
+ "Similarly, the Decoder in the Transformer model also consists of multiple stacked Decoder layers. Each Decoder layer contains three sub-layers:\n",
+ "\n",
+ "Masked Multi-Head Attention sub-layer to attend to previous tokens in the output sequence.\n",
+ "Multi-Head Attention sub-layer that attends to the encoder's output.\n",
+ "FeedForward sub-layer. Again, each sub-layer is followed by Residual Connection and Layer Normalization.\n",
+ "\n",
+ "Below are the Python implementations for the Encoder and Decoder structures:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class EncoderLayer(nn.Module):\n",
+ " def __init__(self, d_model, heads, dropout=0.1):\n",
+ " super().__init__()\n",
+ " self.norm_1 = Norm(d_model)\n",
+ " self.norm_2 = Norm(d_model)\n",
+ " self.attn = MultiHeadAttention(heads, d_model, dropout=dropout)\n",
+ " self.ff = FeedForward(d_model, dropout=dropout)\n",
+ " self.dropout_1 = nn.Dropout(dropout)\n",
+ " self.dropout_2 = nn.Dropout(dropout)\n",
+ "\n",
+ " def forward(self, x, mask):\n",
+ " x2 = self.norm_1(x)\n",
+ " x = x + self.dropout_1(self.attn(x2, x2, x2, mask))\n",
+ " x2 = self.norm_2(x)\n",
+ " x = x + self.dropout_2(self.ff(x2))\n",
+ " return x\n",
+ "\n",
+ "class Encoder(nn.Module):\n",
+ " def __init__(self, vocab_size, d_model, N, heads):\n",
+ " super().__init__()\n",
+ " self.N = N\n",
+ " self.embed = Embedder(vocab_size, d_model)\n",
+ " self.pe = PositionalEncoder(d_model)\n",
+ " self.layers = get_clones(EncoderLayer(d_model, heads), N)\n",
+ " self.norm = Norm(d_model)\n",
+ "\n",
+ " def forward(self, src, mask):\n",
+ " x = self.embed(src)\n",
+ " x = self.pe(x)\n",
+ " for i in range(self.N):\n",
+ " x = self.layers[i](x, mask)\n",
+ " return self.norm(x)"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class DecoderLayer(nn.Module):\n",
+ " def __init__(self, d_model, heads, dropout=0.1):\n",
+ " super().__init__()\n",
+ " self.norm_1 = Norm(d_model)\n",
+ " self.norm_2 = Norm(d_model)\n",
+ " self.norm_3 = Norm(d_model)\n",
+ " self.dropout_1 = nn.Dropout(dropout)\n",
+ " self.dropout_2 = nn.Dropout(dropout)\n",
+ " self.dropout_3 = nn.Dropout(dropout)\n",
+ " self.attn_1 = MultiHeadAttention(heads, d_model, dropout=dropout)\n",
+ " self.attn_2 = MultiHeadAttention(heads, d_model, dropout=dropout)\n",
+ " self.ff = FeedForward(d_model, dropout=dropout)\n",
+ "\n",
+ " def forward(self, x, e_outputs, src_mask, trg_mask):\n",
+ " x2 = self.norm_1(x)\n",
+ " x = x + self.dropout_1(self.attn_1(x2, x2, x2, trg_mask))\n",
+ " x2 = self.norm_2(x)\n",
+ " x = x + self.dropout_2(self.attn_2(x2, e_outputs, e_outputs, src_mask))\n",
+ " x2 = self.norm_3(x)\n",
+ " x = x + self.dropout_3(self.ff(x2))\n",
+ " return x\n",
+ "\n",
+ "class Decoder(nn.Module):\n",
+ " def __init__(self, vocab_size, d_model, N, heads, dropout):\n",
+ " super().__init__()\n",
+ " self.N = N\n",
+ " self.embed = Embedder(vocab_size, d_model)\n",
+ " self.pe = PositionalEncoder(d_model, dropout=dropout)\n",
+ " self.layers = get_clones(DecoderLayer(d_model, heads, dropout), N)\n",
+ " self.norm = Norm(d_model)\n",
+ "\n",
+ " def forward(self, trg, e_outputs, src_mask, trg_mask):\n",
+ " x = self.embed(trg)\n",
+ " x = self.pe(x)\n",
+ " for i in range(self.N):\n",
+ " x = self.layers[i](x, e_outputs, src_mask, trg_mask)\n",
+ " return self.norm(x)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In these implementations:\n",
+ "\n",
+ "The EncoderLayer and DecoderLayer classes define encoder and decoder layers, respectively.\n",
+ "The Encoder and Decoder classes define encoder and decoder modules, respectively, composed of multiple layers of encoder or decoder layers.\n",
+ "These classes follow the architecture described in the text, including the use of multi-head attention, feedforward layers, residual connections, and layer normalization."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "The overall implementation of the Transformer encoder and decoder structure:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "import torch.nn.functional as F\n",
+ "import time\n",
+ "import numpy as np\n",
+ "\n",
+ "class Transformer(nn.Module):\n",
+ " def __init__(self, src_vocab, trg_vocab, d_model, N, heads, dropout):\n",
+ " super().__init__()\n",
+ " self.encoder = Encoder(src_vocab, d_model, N, heads, dropout)\n",
+ " self.decoder = Decoder(trg_vocab, d_model, N, heads, dropout)\n",
+ " self.out = nn.Linear(d_model, trg_vocab)\n",
+ "\n",
+ " def forward(self, src, trg, src_mask, trg_mask):\n",
+ " e_outputs = self.encoder(src, src_mask)\n",
+ " d_output = self.decoder(trg, e_outputs, src_mask, trg_mask)\n",
+ " output = self.out(d_output)\n",
+ " return output\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "The training process for the Transformer model:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "## Model parameters\n",
+ "d_model = 512\n",
+ "heads = 8\n",
+ "N = 6\n",
+ "src_vocab = len(EN_TEXT.vocab)\n",
+ "trg_vocab = len(FR_TEXT.vocab)\n",
+ "\n",
+ "## Initialize the model\n",
+ "model = Transformer(src_vocab, trg_vocab, d_model, N, heads)\n",
+ "\n",
+ "## Initialize optimizer\n",
+ "optim = torch.optim.Adam(model.parameters(), lr=0.0001, betas=(0.9, 0.98), eps=1e-9)\n",
+ "\n",
+ "## Training function\n",
+ "def train_model(epochs, print_every=100):\n",
+ " model.train()\n",
+ " start = time.time()\n",
+ " temp = start\n",
+ " total_loss = 0\n",
+ "\n",
+ " for epoch in range(epochs):\n",
+ " for i, batch in enumerate(train_iter):\n",
+ " src = batch.English.transpose(0, 1)\n",
+ " trg = batch.French.transpose(0, 1)\n",
+ " trg_input = trg[:, :-1]\n",
+ " targets = trg[:, 1:].contiguous().view(-1)\n",
+ " src_mask, trg_mask = create_masks(src, trg_input)\n",
+ "\n",
+ " preds = model(src, trg_input, src_mask, trg_mask)\n",
+ " optim.zero_grad()\n",
+ " loss = F.cross_entropy(preds.view(-1, preds.size(-1)), targets, ignore_index=target_pad)\n",
+ " loss.backward()\n",
+ " optim.step()\n",
+ " total_loss += loss.data[0]\n",
+ "\n",
+ " if (i + 1) % print_every == 0:\n",
+ " loss_avg = total_loss / print_every\n",
+ " print(\"time = %dm, epoch %d, iter = %d, loss = %.3f, %ds per %d iters\" % (\n",
+ " (time.time() - start) // 60, epoch + 1, i + 1, loss_avg, time.time() - temp, print_every))\n",
+ " total_loss = 0\n",
+ " temp = time.time()\n",
+ "\n",
+ "## Train the model\n",
+ "train_model(epochs=10)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Test the trained model:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "def translate(model, src, max_len=80, custom_string=False):\n",
+ " model.eval()\n",
+ " if custom_string == True:\n",
+ " src = tokenize_en(src)\n",
+ " sentence = Variable(torch.LongTensor([[EN_TEXT.vocab.stoi[tok] for tok in sentence]])).cuda()\n",
+ " src_mask = (src != input_pad).unsqueeze(-2)\n",
+ " e_outputs = model.encoder(src, src_mask)\n",
+ " outputs = torch.zeros(max_len).type_as(src.data)\n",
+ " outputs[0] = torch.LongTensor([FR_TEXT.vocab.stoi['']])\n",
+ "\n",
+ " for i in range(1, max_len):\n",
+ " trg_mask = np.triu(np.ones((1, i, i), k=1).astype('uint8'))\n",
+ " trg_mask = Variable(torch.from_numpy(trg_mask) == 0).cuda()\n",
+ " out = model.out(model.decoder(outputs[:i].unsqueeze(0), e_outputs, src_mask, trg_mask))\n",
+ "\n",
+ " out = F.softmax(out, dim=-1)\n",
+ " val, ix = out[:, -1].data.topk(1)\n",
+ " outputs[i] = ix[0][0]\n",
+ "\n",
+ " if ix[0][0] == FR_TEXT.vocab.stoi['']:\n",
+ " break\n",
+ "\n",
+ " return ' '.join([FR_TEXT.vocab.itos[ix] for ix in outputs[:i]])\n"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Acknowledgments\n",
+ "\n",
+ "Thanks to the awesome open source project for Transformer learning, which inspire this chapter.\n",
+ "\n",
+ "- [chatgpt](https://openai.com/product/chatgpt)"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "open-machine-learning-jupyter-book",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.9.18"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
diff --git a/open-machine-learning-jupyter-book/llm/basic/attention.ipynb b/open-machine-learning-jupyter-book/llm/basic/attention.ipynb
new file mode 100644
index 000000000..2c4b9c098
--- /dev/null
+++ b/open-machine-learning-jupyter-book/llm/basic/attention.ipynb
@@ -0,0 +1,367 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "tags": [
+ "remove-cell"
+ ]
+ },
+ "source": [
+ "---\n",
+ "license:\n",
+ " code: MIT\n",
+ " content: CC-BY-4.0\n",
+ "github: https://github.com/ocademy-ai/machine-learning\n",
+ "venue: By Ocademy\n",
+ "open_access: true\n",
+ "bibliography:\n",
+ " - https://raw.githubusercontent.com/ocademy-ai/machine-learning/main/open-machine-learning-jupyter-book/references.bib\n",
+ "---"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "# Attention\n",
+ "## What is Attention?\n",
+ "\n",
+ "The attention mechanism describes a recent new group of layers in neural networks that has attracted a lot of interest in the past few years, especially in sequence tasks. There are a lot of different possible definitions of \"attention\" in the literature, but the one we will use here is the following: _the attention mechanism describes a weighted average of (sequence) elements with the weights dynamically computed based on an input query and elements' keys_. So what does this exactly mean? The goal is to take an average over the features of multiple elements. However, instead of weighting each element equally, we want to weight them depending on their actual values. In other words, we want to dynamically decide on which inputs we want to \"attend\" more than others. In particular, an attention mechanism has usually four parts we need to specify:\n",
+ "\n",
+ "* **Query**: The query is a feature vector that describes what we are looking for in the sequence, i.e. what would we maybe want to pay attention to.\n",
+ "* **Keys**: For each input element, we have a key which is again a feature vector. This feature vector roughly describes what the element is \"offering\", or when it might be important. The keys should be designed such that we can identify the elements we want to pay attention to based on the query.\n",
+ "* **Values**: For each input element, we also have a value vector. This feature vector is the one we want to average over.\n",
+ "* **Score function**: To rate which elements we want to pay attention to, we need to specify a score function $f_{attn}$. The score function takes the query and a key as input, and output the score/attention weight of the query-key pair. It is usually implemented by simple similarity metrics like a dot product, or a small MLP.\n",
+ "\n",
+ "\n",
+ "The weights of the average are calculated by a softmax over all score function outputs. Hence, we assign those value vectors a higher weight whose corresponding key is most similar to the query. If we try to describe it with pseudo-math, we can write: \n",
+ "\n",
+ "$$\n",
+ "\\alpha_i = \\frac{\\exp\\left(f_{attn}\\left(\\text{key}_i, \\text{query}\\right)\\right)}{\\sum_j \\exp\\left(f_{attn}\\left(\\text{key}_j, \\text{query}\\right)\\right)}, \\hspace{5mm} \\text{out} = \\sum_i \\alpha_i \\cdot \\text{value}_i\n",
+ "$$\n",
+ "\n",
+ "Visually, we can show the attention over a sequence of words as follows:\n",
+ "\n",
+ ":::{figure} ../image/attention_example.svg\n",
+ ":::\n",
+ "\n",
+ "For every word, we have one key and one value vector. The query is compared to all keys with a score function (in this case the dot product) to determine the weights. The softmax is not visualized for simplicity. Finally, the value vectors of all words are averaged using the attention weights.\n",
+ "\n",
+ "Most attention mechanisms differ in terms of what queries they use, how the key and value vectors are defined, and what score function is used. The attention applied inside the Transformer architecture is called **self-attention**. In self-attention, each sequence element provides a key, value, and query. For each element, we perform an attention layer where based on its query, we check the similarity of the all sequence elements' keys, and returned a different, averaged value vector for each element. We will now go into a bit more detail by first looking at the specific implementation of the attention mechanism which is in the Transformer case the scaled dot product attention."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Scaled Dot Product Attention\n",
+ "\n",
+ "The core concept behind self-attention is the scaled dot product attention. Our goal is to have an attention mechanism with which any element in a sequence can attend to any other while still being efficient to compute. The dot product attention takes as input a set of queries $Q\\in\\mathbb{R}^{T\\times d_k}$, keys $K\\in\\mathbb{R}^{T\\times d_k}$ and values $V\\in\\mathbb{R}^{T\\times d_v}$ where $T$ is the sequence length, and $d_k$ and $d_v$ are the hidden dimensionality for queries/keys and values respectively. For simplicity, we neglect the batch dimension for now. The attention value from element $i$ to $j$ is based on its similarity of the query $Q_i$ and key $K_j$, using the dot product as the similarity metric. In math, we calculate the dot product attention as follows:\n",
+ "\n",
+ "$$\\text{Attention}(Q,K,V)=\\text{softmax}\\left(\\frac{QK^T}{\\sqrt{d_k}}\\right)V$$\n",
+ "\n",
+ "The matrix multiplication $QK^T$ performs the dot product for every possible pair of queries and keys, resulting in a matrix of the shape $T\\times T$. Each row represents the attention logits for a specific element $i$ to all other elements in the sequence. On these, we apply a softmax and multiply with the value vector to obtain a weighted mean (the weights being determined by the attention). Another perspective on this attention mechanism offers the computation graph which is visualized below (figure credit - [Vaswani et al., 2017](https://arxiv.org/abs/1706.03762)).\n",
+ "\n",
+ ":::{figure} ../image/scaled_dot_product_attn.svg\n",
+ ":::\n",
+ "\n",
+ "One aspect we haven't discussed yet is the scaling factor of $1/\\sqrt{d_k}$. This scaling factor is crucial to maintain an appropriate variance of attention values after initialization. Remember that we intialize our layers with the intention of having equal variance throughout the model, and hence, $Q$ and $K$ might also have a variance close to $1$. However, performing a dot product over two vectors with a variance $\\sigma^2$ results in a scalar having $d_k$-times higher variance: \n",
+ "\n",
+ "$$q_i \\sim \\mathcal{N}(0,\\sigma^2), k_i \\sim \\mathcal{N}(0,\\sigma^2) \\to \\text{Var}\\left(\\sum_{i=1}^{d_k} q_i\\cdot k_i\\right) = \\sigma^4\\cdot d_k$$\n",
+ "\n",
+ "\n",
+ "If we do not scale down the variance back to $\\sim\\sigma^2$, the softmax over the logits will already saturate to $1$ for one random element and $0$ for all others. The gradients through the softmax will be close to zero so that we can't learn the parameters appropriately. Note that the extra factor of $\\sigma^2$, i.e., having $\\sigma^4$ instead of $\\sigma^2$, is usually not an issue, since we keep the original variance $\\sigma^2$ close to $1$ anyways.\n",
+ "\n",
+ "The block `Mask (opt.)` in the diagram above represents the optional masking of specific entries in the attention matrix. This is for instance used if we stack multiple sequences with different lengths into a batch. To still benefit from parallelization in PyTorch, we pad the sentences to the same length and mask out the padding tokens during the calculation of the attention values. This is usually done by setting the respective attention logits to a very low value. \n",
+ "\n",
+ "After we have discussed the details of the scaled dot product attention block, we can write a function below which computes the output features given the triple of queries, keys, and values:"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Below, we import the standard libraries."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 16,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Device: cpu\n"
+ ]
+ }
+ ],
+ "source": [
+ "## Standard libraries\n",
+ "import os\n",
+ "import numpy as np\n",
+ "import random\n",
+ "import math\n",
+ "import json\n",
+ "from functools import partial\n",
+ "\n",
+ "## Imports for plotting\n",
+ "import matplotlib.pyplot as plt\n",
+ "plt.set_cmap('cividis')\n",
+ "%matplotlib inline\n",
+ "from matplotlib.colors import to_rgb\n",
+ "import matplotlib\n",
+ "matplotlib.rcParams['lines.linewidth'] = 2.0\n",
+ "import seaborn as sns\n",
+ "sns.reset_orig()\n",
+ "\n",
+ "## tqdm for loading bars\n",
+ "from tqdm.notebook import tqdm\n",
+ "\n",
+ "## PyTorch\n",
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "import torch.nn.functional as F\n",
+ "import torch.utils.data as data\n",
+ "import torch.optim as optim\n",
+ "\n",
+ "## Torchvision\n",
+ "import torchvision\n",
+ "from torchvision.datasets import CIFAR100\n",
+ "from torchvision import transforms\n",
+ "\n",
+ "# PyTorch Lightning\n",
+ "try:\n",
+ " import pytorch_lightning as pl\n",
+ "except ModuleNotFoundError: # Google Colab does not have PyTorch Lightning installed by default. Hence, we do it here if necessary\n",
+ " !pip install --quiet pytorch-lightning>=1.4\n",
+ " import pytorch_lightning as pl\n",
+ "from pytorch_lightning.callbacks import LearningRateMonitor, ModelCheckpoint\n",
+ "\n",
+ "# Ensure that all operations are deterministic on GPU (if used) for reproducibility\n",
+ "torch.backends.cudnn.deterministic = True\n",
+ "torch.backends.cudnn.benchmark = False\n",
+ "\n",
+ "device = torch.device(\"cuda:0\") if torch.cuda.is_available() else torch.device(\"cpu\")\n",
+ "print(\"Device:\", device)"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 17,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "def scaled_dot_product(q, k, v, mask=None):\n",
+ " d_k = q.size()[-1]\n",
+ " attn_logits = torch.matmul(q, k.transpose(-2, -1))\n",
+ " attn_logits = attn_logits / math.sqrt(d_k)\n",
+ " if mask is not None:\n",
+ " attn_logits = attn_logits.masked_fill(mask == 0, -9e15)\n",
+ " attention = F.softmax(attn_logits, dim=-1)\n",
+ " values = torch.matmul(attention, v)\n",
+ " return values, attention"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Note that our code above supports any additional dimensionality in front of the sequence length so that we can also use it for batches. However, for a better understanding, let's generate a few random queries, keys, and value vectors, and calculate the attention outputs:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 18,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stderr",
+ "output_type": "stream",
+ "text": [
+ "Seed set to 42\n"
+ ]
+ },
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Q\n",
+ " tensor([[ 0.3367, 0.1288],\n",
+ " [ 0.2345, 0.2303],\n",
+ " [-1.1229, -0.1863]])\n",
+ "K\n",
+ " tensor([[ 2.2082, -0.6380],\n",
+ " [ 0.4617, 0.2674],\n",
+ " [ 0.5349, 0.8094]])\n",
+ "V\n",
+ " tensor([[ 1.1103, -1.6898],\n",
+ " [-0.9890, 0.9580],\n",
+ " [ 1.3221, 0.8172]])\n",
+ "Values\n",
+ " tensor([[ 0.5698, -0.1520],\n",
+ " [ 0.5379, -0.0265],\n",
+ " [ 0.2246, 0.5556]])\n",
+ "Attention\n",
+ " tensor([[0.4028, 0.2886, 0.3086],\n",
+ " [0.3538, 0.3069, 0.3393],\n",
+ " [0.1303, 0.4630, 0.4067]])\n"
+ ]
+ }
+ ],
+ "source": [
+ "seq_len, d_k = 3, 2\n",
+ "pl.seed_everything(42)\n",
+ "q = torch.randn(seq_len, d_k)\n",
+ "k = torch.randn(seq_len, d_k)\n",
+ "v = torch.randn(seq_len, d_k)\n",
+ "values, attention = scaled_dot_product(q, k, v)\n",
+ "print(\"Q\\n\", q)\n",
+ "print(\"K\\n\", k)\n",
+ "print(\"V\\n\", v)\n",
+ "print(\"Values\\n\", values)\n",
+ "print(\"Attention\\n\", attention)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Before continuing, make sure you can follow the calculation of the specific values here, and also check it by hand. It is important to fully understand how the scaled dot product attention is calculated."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Multi-Head Attention\n",
+ "\n",
+ "The scaled dot product attention allows a network to attend over a sequence. However, often there are multiple different aspects a sequence element wants to attend to, and a single weighted average is not a good option for it. This is why we extend the attention mechanisms to multiple heads, i.e. multiple different query-key-value triplets on the same features. Specifically, given a query, key, and value matrix, we transform those into $h$ sub-queries, sub-keys, and sub-values, which we pass through the scaled dot product attention independently. Afterward, we concatenate the heads and combine them with a final weight matrix. Mathematically, we can express this operation as:\n",
+ "\n",
+ "$$\n",
+ "\\begin{split}\n",
+ " \\text{Multihead}(Q,K,V) & = \\text{Concat}(\\text{head}_1,...,\\text{head}_h)W^{O}\\\\\n",
+ " \\text{where } \\text{head}_i & = \\text{Attention}(QW_i^Q,KW_i^K, VW_i^V)\n",
+ "\\end{split}\n",
+ "$$\n",
+ "\n",
+ "We refer to this as Multi-Head Attention layer with the learnable parameters $W_{1...h}^{Q}\\in\\mathbb{R}^{D\\times d_k}$, $W_{1...h}^{K}\\in\\mathbb{R}^{D\\times d_k}$, $W_{1...h}^{V}\\in\\mathbb{R}^{D\\times d_v}$, and $W^{O}\\in\\mathbb{R}^{h\\cdot d_v\\times d_{out}}$ ($D$ being the input dimensionality). Expressed in a computational graph, we can visualize it as below (figure credit - [Vaswani et al., 2017](https://arxiv.org/abs/1706.03762)).\n",
+ "\n",
+ ":::{figure} ../image/multihead_attention.svg\n",
+ ":::\n",
+ "\n",
+ "How are we applying a Multi-Head Attention layer in a neural network, where we don't have an arbitrary query, key, and value vector as input? Looking at the computation graph above, a simple but effective implementation is to set the current feature map in a NN, $X\\in\\mathbb{R}^{B\\times T\\times d_{\\text{model}}}$, as $Q$, $K$ and $V$ ($B$ being the batch size, $T$ the sequence length, $d_{\\text{model}}$ the hidden dimensionality of $X$). The consecutive weight matrices $W^{Q}$, $W^{K}$, and $W^{V}$ can transform $X$ to the corresponding feature vectors that represent the queries, keys, and values of the input. Using this approach, we can implement the Multi-Head Attention module below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 19,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "# Helper function to support different mask shapes.\n",
+ "# Output shape supports (batch_size, number of heads, seq length, seq length)\n",
+ "# If 2D: broadcasted over batch size and number of heads\n",
+ "# If 3D: broadcasted over number of heads\n",
+ "# If 4D: leave as is\n",
+ "def expand_mask(mask):\n",
+ " assert mask.ndim >= 2, \"Mask must be at least 2-dimensional with seq_length x seq_length\"\n",
+ " if mask.ndim == 3:\n",
+ " mask = mask.unsqueeze(1)\n",
+ " while mask.ndim < 4:\n",
+ " mask = mask.unsqueeze(0)\n",
+ " return mask"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 20,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class MultiheadAttention(nn.Module):\n",
+ " \n",
+ " def __init__(self, input_dim, embed_dim, num_heads):\n",
+ " super().__init__()\n",
+ " assert embed_dim % num_heads == 0, \"Embedding dimension must be 0 modulo number of heads.\"\n",
+ " \n",
+ " self.embed_dim = embed_dim\n",
+ " self.num_heads = num_heads\n",
+ " self.head_dim = embed_dim // num_heads\n",
+ " \n",
+ " # Stack all weight matrices 1...h together for efficiency\n",
+ " # Note that in many implementations you see \"bias=False\" which is optional\n",
+ " self.qkv_proj = nn.Linear(input_dim, 3*embed_dim)\n",
+ " self.o_proj = nn.Linear(embed_dim, embed_dim)\n",
+ " \n",
+ " self._reset_parameters()\n",
+ "\n",
+ " def _reset_parameters(self):\n",
+ " # Original Transformer initialization, see PyTorch documentation\n",
+ " nn.init.xavier_uniform_(self.qkv_proj.weight)\n",
+ " self.qkv_proj.bias.data.fill_(0)\n",
+ " nn.init.xavier_uniform_(self.o_proj.weight)\n",
+ " self.o_proj.bias.data.fill_(0)\n",
+ "\n",
+ " def forward(self, x, mask=None, return_attention=False):\n",
+ " batch_size, seq_length, _ = x.size()\n",
+ " if mask is not None:\n",
+ " mask = expand_mask(mask)\n",
+ " qkv = self.qkv_proj(x)\n",
+ " \n",
+ " # Separate Q, K, V from linear output\n",
+ " qkv = qkv.reshape(batch_size, seq_length, self.num_heads, 3*self.head_dim)\n",
+ " qkv = qkv.permute(0, 2, 1, 3) # [Batch, Head, SeqLen, Dims]\n",
+ " q, k, v = qkv.chunk(3, dim=-1)\n",
+ " \n",
+ " # Determine value outputs\n",
+ " values, attention = scaled_dot_product(q, k, v, mask=mask)\n",
+ " values = values.permute(0, 2, 1, 3) # [Batch, SeqLen, Head, Dims]\n",
+ " values = values.reshape(batch_size, seq_length, self.embed_dim)\n",
+ " o = self.o_proj(values)\n",
+ " \n",
+ " if return_attention:\n",
+ " return o, attention\n",
+ " else:\n",
+ " return o"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "One crucial characteristic of the multi-head attention is that it is permutation-equivariant with respect to its inputs. This means that if we switch two input elements in the sequence, e.g. $X_1\\leftrightarrow X_2$ (neglecting the batch dimension for now), the output is exactly the same besides the elements 1 and 2 switched. Hence, the multi-head attention is actually looking at the input not as a sequence, but as a set of elements. This property makes the multi-head attention block and the Transformer architecture so powerful and widely applicable! But what if the order of the input is actually important for solving the task, like language modeling? The answer is to encode the position in the input features, which we will take a closer look at later (topic _Positional encodings_ below).\n",
+ "\n",
+ "Before moving on to creating the Transformer architecture, we can compare the self-attention operation with our other common layer competitors for sequence data: convolutions and recurrent neural networks. Below you can find a table by [Vaswani et al. (2017)](https://arxiv.org/abs/1706.03762) on the complexity per layer, the number of sequential operations, and maximum path length. The complexity is measured by the upper bound of the number of operations to perform, while the maximum path length represents the maximum number of steps a forward or backward signal has to traverse to reach any other position. The lower this length, the better gradient signals can backpropagate for long-range dependencies. Let's take a look at the table below:\n",
+ "\n",
+ ":::{figure} ../image/comparison_conv_rnn.svg\n",
+ ":::\n",
+ "\n",
+ "$n$ is the sequence length, $d$ is the representation dimension and $k$ is the kernel size of convolutions. In contrast to recurrent networks, the self-attention layer can parallelize all its operations making it much faster to execute for smaller sequence lengths. However, when the sequence length exceeds the hidden dimensionality, self-attention becomes more expensive than RNNs. One way of reducing the computational cost for long sequences is by restricting the self-attention to a neighborhood of inputs to attend over, denoted by $r$. Nevertheless, there has been recently a lot of work on more efficient Transformer architectures that still allow long dependencies, of which you can find an overview in the paper by [Tay et al. (2020)](https://arxiv.org/abs/2009.06732) if interested."
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "Python 3 (ipykernel)",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.10.4"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
diff --git a/open-machine-learning-jupyter-book/llm/basic/basic.ipynb b/open-machine-learning-jupyter-book/llm/basic/basic.ipynb
new file mode 100644
index 000000000..ef8e960b7
--- /dev/null
+++ b/open-machine-learning-jupyter-book/llm/basic/basic.ipynb
@@ -0,0 +1,64 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "tags": [
+ "remove-cell"
+ ]
+ },
+ "source": [
+ "---\n",
+ "license:\n",
+ " code: MIT\n",
+ " content: CC-BY-4.0\n",
+ "github: https://github.com/ocademy-ai/machine-learning\n",
+ "venue: By Ocademy\n",
+ "open_access: true\n",
+ "bibliography:\n",
+ " - https://raw.githubusercontent.com/ocademy-ai/machine-learning/main/open-machine-learning-jupyter-book/references.bib\n",
+ "---"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "# Large Language Models Basic\n",
+ "In these sections, we will explore the attention mechanism, which allows models to focus on specific parts of the input during processing. We will study the Transformer model architecture, which serves as the cornerstone for many state-of-the-art language models, and how it has fundamentally transformed the field of Natural Language Processing (NLP). Additionally, we will introduce generative pre-trained language models like GPT, delve into the network structures of large language models, optimization techniques for attention mechanisms, and practical applications stemming from these foundations.\n",
+ "\n",
+ ":::{figure} ../image/llm.png\n",
+ ":::"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ ":::{tableofcontents}\n",
+ ":::"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "open-machine-learning-jupyter-book",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.9.18"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
diff --git a/open-machine-learning-jupyter-book/llm/basic/transformer.ipynb b/open-machine-learning-jupyter-book/llm/basic/transformer.ipynb
new file mode 100644
index 000000000..4143ec705
--- /dev/null
+++ b/open-machine-learning-jupyter-book/llm/basic/transformer.ipynb
@@ -0,0 +1,20020 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "metadata": {
+ "tags": [
+ "remove-cell"
+ ]
+ },
+ "source": [
+ "---\n",
+ "license:\n",
+ " code: MIT\n",
+ " content: CC-BY-4.0\n",
+ "github: https://github.com/ocademy-ai/machine-learning\n",
+ "venue: By Ocademy\n",
+ "open_access: true\n",
+ "bibliography:\n",
+ " - https://raw.githubusercontent.com/ocademy-ai/machine-learning/main/open-machine-learning-jupyter-book/references.bib\n",
+ "---"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "# Transformer\n",
+ "In this tutorial, we will discuss one of the most impactful architectures of the last 2 years: the Transformer model. Since the paper [Attention Is All You Need](https://arxiv.org/abs/1706.03762) by Vaswani et al. had been published in 2017, the Transformer architecture has continued to beat benchmarks in many domains, most importantly in Natural Language Processing. Transformers with an incredible amount of parameters can generate long, convincing [essays](https://www.theguardian.com/commentisfree/2020/sep/08/robot-wrote-this-article-gpt-3), and opened up new application fields of AI. As the hype of the Transformer architecture seems not to come to an end in the next years, it is important to understand how it works, and have implemented it yourself, which we will do in this notebook. We focus here on what makes the Transformer and self-attention so powerful in general.\n",
+ "\n",
+ "Below, we import the standard libraries."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 1,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Device: cuda:0\n"
+ ]
+ }
+ ],
+ "source": [
+ "## Standard libraries\n",
+ "import os\n",
+ "import numpy as np \n",
+ "import random\n",
+ "import math\n",
+ "import json\n",
+ "from functools import partial\n",
+ "\n",
+ "## Imports for plotting\n",
+ "import matplotlib.pyplot as plt\n",
+ "plt.set_cmap('cividis')\n",
+ "%matplotlib inline \n",
+ "from matplotlib.colors import to_rgb\n",
+ "import matplotlib\n",
+ "matplotlib.rcParams['lines.linewidth'] = 2.0\n",
+ "import seaborn as sns\n",
+ "sns.reset_orig()\n",
+ "\n",
+ "## tqdm for loading bars\n",
+ "from tqdm.notebook import tqdm\n",
+ "\n",
+ "## PyTorch\n",
+ "import torch\n",
+ "import torch.nn as nn\n",
+ "import torch.nn.functional as F\n",
+ "import torch.utils.data as data\n",
+ "import torch.optim as optim\n",
+ "\n",
+ "## Torchvision\n",
+ "import torchvision\n",
+ "from torchvision.datasets import CIFAR100\n",
+ "from torchvision import transforms\n",
+ "\n",
+ "# PyTorch Lightning\n",
+ "try:\n",
+ " import pytorch_lightning as pl\n",
+ "except ModuleNotFoundError: # Google Colab does not have PyTorch Lightning installed by default. Hence, we do it here if necessary\n",
+ " !pip install --quiet pytorch-lightning>=1.4\n",
+ " import pytorch_lightning as pl\n",
+ "from pytorch_lightning.callbacks import LearningRateMonitor, ModelCheckpoint\n",
+ "\n",
+ "# Path to the folder where the datasets are/should be downloaded (e.g. CIFAR10)\n",
+ "DATASET_PATH = \"./data\"\n",
+ "# Path to the folder where the pretrained models are saved\n",
+ "CHECKPOINT_PATH = \"./saved_models\"\n",
+ "\n",
+ "# Setting the seed\n",
+ "pl.seed_everything(42)\n",
+ "\n",
+ "# Ensure that all operations are deterministic on GPU (if used) for reproducibility\n",
+ "torch.backends.cudnn.deterministic = True\n",
+ "torch.backends.cudnn.benchmark = False\n",
+ "\n",
+ "device = torch.device(\"cuda:0\") if torch.cuda.is_available() else torch.device(\"cpu\")\n",
+ "print(\"Device:\", device)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Two pre-trained models are downloaded below. Make sure to have adjusted your `CHECKPOINT_PATH` before running this code if not already done."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 2,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import urllib.request\n",
+ "from urllib.error import HTTPError\n",
+ "# Github URL where saved models are stored for this tutorial\n",
+ "base_url = \"https://raw.githubusercontent.com/phlippe/saved_models/main/tutorial6/\"\n",
+ "# Files to download\n",
+ "pretrained_files = [\"ReverseTask.ckpt\", \"SetAnomalyTask.ckpt\"]\n",
+ "\n",
+ "# Create checkpoint path if it doesn't exist yet\n",
+ "os.makedirs(CHECKPOINT_PATH, exist_ok=True)\n",
+ "\n",
+ "# For each file, check whether it already exists. If not, try downloading it.\n",
+ "for file_name in pretrained_files:\n",
+ " file_path = os.path.join(CHECKPOINT_PATH, file_name)\n",
+ " if \"/\" in file_name:\n",
+ " os.makedirs(file_path.rsplit(\"/\",1)[0], exist_ok=True)\n",
+ " if not os.path.isfile(file_path):\n",
+ " file_url = base_url + file_name\n",
+ " print(f\"Downloading {file_url}...\")\n",
+ " try:\n",
+ " urllib.request.urlretrieve(file_url, file_path)\n",
+ " except HTTPError as e:\n",
+ " print(\"Something went wrong. Please try to download the file from the GDrive folder, or contact the author with the full output including the following error:\\n\", e)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## The Transformer architecture\n",
+ "\n",
+ "In the first part of this notebook, we will implement the Transformer architecture by hand. As the architecture is so popular, there already exists a Pytorch module `nn.Transformer` ([documentation](https://pytorch.org/docs/stable/generated/torch.nn.Transformer.html)) and a [tutorial](https://pytorch.org/tutorials/beginner/transformer_tutorial.html) on how to use it for next token prediction. However, we will implement it here ourselves, to get through to the smallest details."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "### Transformer Encoder\n",
+ "\n",
+ "Next, we will look at how to apply the multi-head attention block inside the Transformer architecture. Originally, the Transformer model was designed for machine translation. Hence, it got an encoder-decoder structure where the encoder takes as input the sentence in the original language and generates an attention-based representation. On the other hand, the decoder attends over the encoded information and generates the translated sentence in an autoregressive manner, as in a standard RNN. While this structure is extremely useful for Sequence-to-Sequence tasks with the necessity of autoregressive decoding, we will focus here on the encoder part. Many advances in NLP have been made using pure encoder-based Transformer models (if interested, models include the [BERT](https://arxiv.org/abs/1810.04805)-family, the [Vision Transformer](https://arxiv.org/abs/2010.11929), and more), and in our tutorial, we will also mainly focus on the encoder part. If you have understood the encoder architecture, the decoder is a very small step to implement as well. The full Transformer architecture looks as follows (figure credit - [Vaswani et al., 2017](https://arxiv.org/abs/1706.03762)).:\n",
+ "\n",
+ ":::{figure} ../image/transformer_architecture.svg\n",
+ ":::\n",
+ "\n",
+ "The encoder consists of $N$ identical blocks that are applied in sequence. Taking as input $x$, it is first passed through a Multi-Head Attention block as we have implemented above. The output is added to the original input using a residual connection, and we apply a consecutive Layer Normalization on the sum. Overall, it calculates $\\text{LayerNorm}(x+\\text{Multihead}(x,x,x))$ ($x$ being $Q$, $K$ and $V$ input to the attention layer). The residual connection is crucial in the Transformer architecture for two reasons: \n",
+ "\n",
+ "1. Similar to ResNets, Transformers are designed to be very deep. Some models contain more than 24 blocks in the encoder. Hence, the residual connections are crucial for enabling a smooth gradient flow through the model.\n",
+ "2. Without the residual connection, the information about the original sequence is lost. Remember that the Multi-Head Attention layer ignores the position of elements in a sequence, and can only learn it based on the input features. Removing the residual connections would mean that this information is lost after the first attention layer (after initialization), and with a randomly initialized query and key vector, the output vectors for position $i$ has no relation to its original input. All outputs of the attention are likely to represent similar/same information, and there is no chance for the model to distinguish which information came from which input element. An alternative option to residual connection would be to fix at least one head to focus on its original input, but this is very inefficient and does not have the benefit of the improved gradient flow.\n",
+ "\n",
+ "The Layer Normalization also plays an important role in the Transformer architecture as it enables faster training and provides small regularization. Additionally, it ensures that the features are in a similar magnitude among the elements in the sequence. We are not using Batch Normalization because it depends on the batch size which is often small with Transformers (they require a lot of GPU memory), and BatchNorm has shown to perform particularly bad in language as the features of words tend to have a much higher variance (there are many, very rare words which need to be considered for a good distribution estimate).\n",
+ "\n",
+ "Additionally to the Multi-Head Attention, a small fully connected feed-forward network is added to the model, which is applied to each position separately and identically. Specifically, the model uses a Linear$\\to$ReLU$\\to$Linear MLP. The full transformation including the residual connection can be expressed as: \n",
+ "\n",
+ "$$\n",
+ "\\begin{split}\n",
+ " \\text{FFN}(x) & = \\max(0, xW_1+b_1)W_2 + b_2\\\\\n",
+ " x & = \\text{LayerNorm}(x + \\text{FFN}(x))\n",
+ "\\end{split}\n",
+ "$$\n",
+ "\n",
+ "This MLP adds extra complexity to the model and allows transformations on each sequence element separately. You can imagine as this allows the model to \"post-process\" the new information added by the previous Multi-Head Attention, and prepare it for the next attention block. Usually, the inner dimensionality of the MLP is 2-8$\\times$ larger than $d_{\\text{model}}$, i.e. the dimensionality of the original input $x$. The general advantage of a wider layer instead of a narrow, multi-layer MLP is the faster, parallelizable execution.\n",
+ "\n",
+ "Finally, after looking at all parts of the encoder architecture, we can start implementing it below. We first start by implementing a single encoder block. Additionally to the layers described above, we will add dropout layers in the MLP and on the output of the MLP and Multi-Head Attention for regularization."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 6,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class EncoderBlock(nn.Module):\n",
+ " \n",
+ " def __init__(self, input_dim, num_heads, dim_feedforward, dropout=0.0):\n",
+ " \"\"\"\n",
+ " Inputs:\n",
+ " input_dim - Dimensionality of the input\n",
+ " num_heads - Number of heads to use in the attention block\n",
+ " dim_feedforward - Dimensionality of the hidden layer in the MLP\n",
+ " dropout - Dropout probability to use in the dropout layers\n",
+ " \"\"\"\n",
+ " super().__init__()\n",
+ " \n",
+ " # Attention layer\n",
+ " self.self_attn = MultiheadAttention(input_dim, input_dim, num_heads)\n",
+ " \n",
+ " # Two-layer MLP\n",
+ " self.linear_net = nn.Sequential(\n",
+ " nn.Linear(input_dim, dim_feedforward),\n",
+ " nn.Dropout(dropout),\n",
+ " nn.ReLU(inplace=True),\n",
+ " nn.Linear(dim_feedforward, input_dim)\n",
+ " )\n",
+ " \n",
+ " # Layers to apply in between the main layers\n",
+ " self.norm1 = nn.LayerNorm(input_dim)\n",
+ " self.norm2 = nn.LayerNorm(input_dim)\n",
+ " self.dropout = nn.Dropout(dropout)\n",
+ "\n",
+ " def forward(self, x, mask=None):\n",
+ " # Attention part\n",
+ " attn_out = self.self_attn(x, mask=mask)\n",
+ " x = x + self.dropout(attn_out)\n",
+ " x = self.norm1(x)\n",
+ " \n",
+ " # MLP part\n",
+ " linear_out = self.linear_net(x)\n",
+ " x = x + self.dropout(linear_out)\n",
+ " x = self.norm2(x)\n",
+ " \n",
+ " return x"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Based on this block, we can implement a module for the full Transformer encoder. Additionally to a forward function that iterates through the sequence of encoder blocks, we also provide a function called `get_attention_maps`. The idea of this function is to return the attention probabilities for all Multi-Head Attention blocks in the encoder. This helps us in understanding, and in a sense, explaining the model. However, the attention probabilities should be interpreted with a grain of salt as it does not necessarily reflect the true interpretation of the model (there is a series of papers about this, including [Attention is not Explanation](https://arxiv.org/abs/1902.10186) and [Attention is not not Explanation](https://arxiv.org/abs/1908.04626))."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 7,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class TransformerEncoder(nn.Module):\n",
+ " \n",
+ " def __init__(self, num_layers, **block_args):\n",
+ " super().__init__()\n",
+ " self.layers = nn.ModuleList([EncoderBlock(**block_args) for _ in range(num_layers)])\n",
+ "\n",
+ " def forward(self, x, mask=None):\n",
+ " for l in self.layers:\n",
+ " x = l(x, mask=mask)\n",
+ " return x\n",
+ "\n",
+ " def get_attention_maps(self, x, mask=None):\n",
+ " attention_maps = []\n",
+ " for l in self.layers:\n",
+ " _, attn_map = l.self_attn(x, mask=mask, return_attention=True)\n",
+ " attention_maps.append(attn_map)\n",
+ " x = l(x)\n",
+ " return attention_maps"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "### Positional encoding\n",
+ "\n",
+ "We have discussed before that the Multi-Head Attention block is permutation-equivariant, and cannot distinguish whether an input comes before another one in the sequence or not. In tasks like language understanding, however, the position is important for interpreting the input words. The position information can therefore be added via the input features. We could learn a embedding for every possible position, but this would not generalize to a dynamical input sequence length. Hence, the better option is to use feature patterns that the network can identify from the features and potentially generalize to larger sequences. The specific pattern chosen by Vaswani et al. are sine and cosine functions of different frequencies, as follows:\n",
+ "\n",
+ "$$\n",
+ "PE_{(pos,i)} = \\begin{cases}\n",
+ " \\sin\\left(\\frac{pos}{10000^{i/d_{\\text{model}}}}\\right) & \\text{if}\\hspace{3mm} i \\text{ mod } 2=0\\\\\n",
+ " \\cos\\left(\\frac{pos}{10000^{(i-1)/d_{\\text{model}}}}\\right) & \\text{otherwise}\\\\\n",
+ "\\end{cases}\n",
+ "$$\n",
+ "\n",
+ "$PE_{(pos,i)}$ represents the position encoding at position $pos$ in the sequence, and hidden dimensionality $i$. These values, concatenated for all hidden dimensions, are added to the original input features (in the Transformer visualization above, see \"Positional encoding\"), and constitute the position information. We distinguish between even ($i \\text{ mod } 2=0$) and uneven ($i \\text{ mod } 2=1$) hidden dimensionalities where we apply a sine/cosine respectively. The intuition behind this encoding is that you can represent $PE_{(pos+k,:)}$ as a linear function of $PE_{(pos,:)}$, which might allow the model to easily attend to relative positions. The wavelengths in different dimensions range from $2\\pi$ to $10000\\cdot 2\\pi$.\n",
+ "\n",
+ "The positional encoding is implemented below. The code is taken from the [PyTorch tutorial](https://pytorch.org/tutorials/beginner/transformer_tutorial.html#define-the-model) about Transformers on NLP and adjusted for our purposes."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 8,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class PositionalEncoding(nn.Module):\n",
+ "\n",
+ " def __init__(self, d_model, max_len=5000):\n",
+ " \"\"\"\n",
+ " Inputs\n",
+ " d_model - Hidden dimensionality of the input.\n",
+ " max_len - Maximum length of a sequence to expect.\n",
+ " \"\"\"\n",
+ " super().__init__()\n",
+ "\n",
+ " # Create matrix of [SeqLen, HiddenDim] representing the positional encoding for max_len inputs\n",
+ " pe = torch.zeros(max_len, d_model)\n",
+ " position = torch.arange(0, max_len, dtype=torch.float).unsqueeze(1)\n",
+ " div_term = torch.exp(torch.arange(0, d_model, 2).float() * (-math.log(10000.0) / d_model))\n",
+ " pe[:, 0::2] = torch.sin(position * div_term)\n",
+ " pe[:, 1::2] = torch.cos(position * div_term)\n",
+ " pe = pe.unsqueeze(0)\n",
+ " \n",
+ " # register_buffer => Tensor which is not a parameter, but should be part of the modules state.\n",
+ " # Used for tensors that need to be on the same device as the module.\n",
+ " # persistent=False tells PyTorch to not add the buffer to the state dict (e.g. when we save the model) \n",
+ " self.register_buffer('pe', pe, persistent=False)\n",
+ "\n",
+ " def forward(self, x):\n",
+ " x = x + self.pe[:, :x.size(1)]\n",
+ " return x"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "To understand the positional encoding, we can visualize it below. We will generate an image of the positional encoding over hidden dimensionality and position in a sequence. Each pixel, therefore, represents the change of the input feature we perform to encode the specific position. Let's do it below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 9,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:19.865866 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ }
+ ],
+ "source": [
+ "encod_block = PositionalEncoding(d_model=48, max_len=96)\n",
+ "pe = encod_block.pe.squeeze().T.cpu().numpy()\n",
+ "\n",
+ "fig, ax = plt.subplots(nrows=1, ncols=1, figsize=(8,3))\n",
+ "pos = ax.imshow(pe, cmap=\"RdGy\", extent=(1,pe.shape[1]+1,pe.shape[0]+1,1))\n",
+ "fig.colorbar(pos, ax=ax)\n",
+ "ax.set_xlabel(\"Position in sequence\")\n",
+ "ax.set_ylabel(\"Hidden dimension\")\n",
+ "ax.set_title(\"Positional encoding over hidden dimensions\")\n",
+ "ax.set_xticks([1]+[i*10 for i in range(1,1+pe.shape[1]//10)])\n",
+ "ax.set_yticks([1]+[i*10 for i in range(1,1+pe.shape[0]//10)])\n",
+ "plt.show()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "You can clearly see the sine and cosine waves with different wavelengths that encode the position in the hidden dimensions. Specifically, we can look at the sine/cosine wave for each hidden dimension separately, to get a better intuition of the pattern. Below we visualize the positional encoding for the hidden dimensions $1$, $2$, $3$ and $4$."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 10,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:20.404975 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {},
+ "output_type": "display_data"
+ }
+ ],
+ "source": [
+ "sns.set_theme()\n",
+ "fig, ax = plt.subplots(2, 2, figsize=(12,4))\n",
+ "ax = [a for a_list in ax for a in a_list]\n",
+ "for i in range(len(ax)):\n",
+ " ax[i].plot(np.arange(1,17), pe[i,:16], color=f'C{i}', marker=\"o\", markersize=6, markeredgecolor=\"black\")\n",
+ " ax[i].set_title(f\"Encoding in hidden dimension {i+1}\")\n",
+ " ax[i].set_xlabel(\"Position in sequence\", fontsize=10)\n",
+ " ax[i].set_ylabel(\"Positional encoding\", fontsize=10)\n",
+ " ax[i].set_xticks(np.arange(1,17))\n",
+ " ax[i].tick_params(axis='both', which='major', labelsize=10)\n",
+ " ax[i].tick_params(axis='both', which='minor', labelsize=8)\n",
+ " ax[i].set_ylim(-1.2, 1.2)\n",
+ "fig.subplots_adjust(hspace=0.8)\n",
+ "sns.reset_orig()\n",
+ "plt.show()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "As we can see, the patterns between the hidden dimension $1$ and $2$ only differ in the starting angle. The wavelength is $2\\pi$, hence the repetition after position $6$. The hidden dimensions $2$ and $3$ have about twice the wavelength. "
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "### Learning rate warm-up\n",
+ "\n",
+ "One commonly used technique for training a Transformer is learning rate warm-up. This means that we gradually increase the learning rate from 0 on to our originally specified learning rate in the first few iterations. Thus, we slowly start learning instead of taking very large steps from the beginning. In fact, training a deep Transformer without learning rate warm-up can make the model diverge and achieve a much worse performance on training and testing. Take for instance the following plot by [Liu et al. (2019)](https://arxiv.org/pdf/1908.03265.pdf) comparing Adam-vanilla (i.e. Adam without warm-up) vs Adam with a warm-up:\n",
+ "\n",
+ ":::{figure} ../image/warmup_loss_plot.svg\n",
+ ":::\n",
+ "\n",
+ "Clearly, the warm-up is a crucial hyperparameter in the Transformer architecture. Why is it so important? There are currently two common explanations. Firstly, Adam uses the bias correction factors which however can lead to a higher variance in the adaptive learning rate during the first iterations. Improved optimizers like [RAdam](https://arxiv.org/abs/1908.03265) have been shown to overcome this issue, not requiring warm-up for training Transformers. Secondly, the iteratively applied Layer Normalization across layers can lead to very high gradients during the first iterations, which can be solved by using [Pre-Layer Normalization](https://proceedings.icml.cc/static/paper_files/icml/2020/328-Paper.pdf) (similar to Pre-Activation ResNet), or replacing Layer Normalization by other techniques ([Adaptive Normalization](https://proceedings.icml.cc/static/paper_files/icml/2020/328-Paper.pdf), [Power Normalization](https://arxiv.org/abs/2003.07845)). \n",
+ "\n",
+ "Nevertheless, many applications and papers still use the original Transformer architecture with Adam, because warm-up is a simple, yet effective way of solving the gradient problem in the first iterations. There are many different schedulers we could use. For instance, the original Transformer paper used an exponential decay scheduler with a warm-up. However, the currently most popular scheduler is the cosine warm-up scheduler, which combines warm-up with a cosine-shaped learning rate decay. We can implement it below, and visualize the learning rate factor over epochs. "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 11,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class CosineWarmupScheduler(optim.lr_scheduler._LRScheduler):\n",
+ " \n",
+ " def __init__(self, optimizer, warmup, max_iters):\n",
+ " self.warmup = warmup\n",
+ " self.max_num_iters = max_iters\n",
+ " super().__init__(optimizer)\n",
+ " \n",
+ " def get_lr(self):\n",
+ " lr_factor = self.get_lr_factor(epoch=self.last_epoch)\n",
+ " return [base_lr * lr_factor for base_lr in self.base_lrs]\n",
+ " \n",
+ " def get_lr_factor(self, epoch):\n",
+ " lr_factor = 0.5 * (1 + np.cos(np.pi * epoch / self.max_num_iters))\n",
+ " if epoch <= self.warmup:\n",
+ " lr_factor *= epoch * 1.0 / self.warmup\n",
+ " return lr_factor"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 12,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:20.987233 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {},
+ "output_type": "display_data"
+ }
+ ],
+ "source": [
+ "# Needed for initializing the lr scheduler\n",
+ "p = nn.Parameter(torch.empty(4,4))\n",
+ "optimizer = optim.Adam([p], lr=1e-3)\n",
+ "lr_scheduler = CosineWarmupScheduler(optimizer=optimizer, warmup=100, max_iters=2000)\n",
+ "\n",
+ "# Plotting\n",
+ "epochs = list(range(2000))\n",
+ "sns.set()\n",
+ "plt.figure(figsize=(8,3))\n",
+ "plt.plot(epochs, [lr_scheduler.get_lr_factor(e) for e in epochs])\n",
+ "plt.ylabel(\"Learning rate factor\")\n",
+ "plt.xlabel(\"Iterations (in batches)\")\n",
+ "plt.title(\"Cosine Warm-up Learning Rate Scheduler\")\n",
+ "plt.show()\n",
+ "sns.reset_orig()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In the first 100 iterations, we increase the learning rate factor from 0 to 1, whereas for all later iterations, we decay it using the cosine wave. Pre-implementations of this scheduler can be found in the popular NLP Transformer library [huggingface](https://huggingface.co/transformers/main_classes/optimizer_schedules.html?highlight=cosine#transformers.get_cosine_schedule_with_warmup)."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "### PyTorch Lightning Module\n",
+ "\n",
+ "Finally, we can embed the Transformer architecture into a PyTorch lightning module. From Tutorial 5, you know that PyTorch Lightning simplifies our training and test code, as well as structures the code nicely in separate functions. We will implement a template for a classifier based on the Transformer encoder. Thereby, we have a prediction output per sequence element. If we would need a classifier over the whole sequence, the common approach is to add an additional `[CLS]` token to the sequence, representing the classifier token. However, here we focus on tasks where we have an output per element. \n",
+ "\n",
+ "Additionally to the Transformer architecture, we add a small input network (maps input dimensions to model dimensions), the positional encoding, and an output network (transforms output encodings to predictions). We also add the learning rate scheduler, which takes a step each iteration instead of once per epoch. This is needed for the warmup and the smooth cosine decay. The training, validation, and test step is left empty for now and will be filled for our task-specific models."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 13,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class TransformerPredictor(pl.LightningModule):\n",
+ "\n",
+ " def __init__(self, input_dim, model_dim, num_classes, num_heads, num_layers, lr, warmup, max_iters, dropout=0.0, input_dropout=0.0):\n",
+ " \"\"\"\n",
+ " Inputs:\n",
+ " input_dim - Hidden dimensionality of the input\n",
+ " model_dim - Hidden dimensionality to use inside the Transformer\n",
+ " num_classes - Number of classes to predict per sequence element\n",
+ " num_heads - Number of heads to use in the Multi-Head Attention blocks\n",
+ " num_layers - Number of encoder blocks to use.\n",
+ " lr - Learning rate in the optimizer\n",
+ " warmup - Number of warmup steps. Usually between 50 and 500\n",
+ " max_iters - Number of maximum iterations the model is trained for. This is needed for the CosineWarmup scheduler\n",
+ " dropout - Dropout to apply inside the model\n",
+ " input_dropout - Dropout to apply on the input features\n",
+ " \"\"\"\n",
+ " super().__init__()\n",
+ " self.save_hyperparameters()\n",
+ " self._create_model()\n",
+ "\n",
+ " def _create_model(self):\n",
+ " # Input dim -> Model dim\n",
+ " self.input_net = nn.Sequential(\n",
+ " nn.Dropout(self.hparams.input_dropout),\n",
+ " nn.Linear(self.hparams.input_dim, self.hparams.model_dim)\n",
+ " )\n",
+ " # Positional encoding for sequences\n",
+ " self.positional_encoding = PositionalEncoding(d_model=self.hparams.model_dim)\n",
+ " # Transformer\n",
+ " self.transformer = TransformerEncoder(num_layers=self.hparams.num_layers,\n",
+ " input_dim=self.hparams.model_dim,\n",
+ " dim_feedforward=2*self.hparams.model_dim,\n",
+ " num_heads=self.hparams.num_heads,\n",
+ " dropout=self.hparams.dropout)\n",
+ " # Output classifier per sequence lement\n",
+ " self.output_net = nn.Sequential(\n",
+ " nn.Linear(self.hparams.model_dim, self.hparams.model_dim),\n",
+ " nn.LayerNorm(self.hparams.model_dim),\n",
+ " nn.ReLU(inplace=True),\n",
+ " nn.Dropout(self.hparams.dropout),\n",
+ " nn.Linear(self.hparams.model_dim, self.hparams.num_classes)\n",
+ " ) \n",
+ "\n",
+ " def forward(self, x, mask=None, add_positional_encoding=True):\n",
+ " \"\"\"\n",
+ " Inputs:\n",
+ " x - Input features of shape [Batch, SeqLen, input_dim]\n",
+ " mask - Mask to apply on the attention outputs (optional)\n",
+ " add_positional_encoding - If True, we add the positional encoding to the input.\n",
+ " Might not be desired for some tasks.\n",
+ " \"\"\"\n",
+ " x = self.input_net(x)\n",
+ " if add_positional_encoding:\n",
+ " x = self.positional_encoding(x)\n",
+ " x = self.transformer(x, mask=mask)\n",
+ " x = self.output_net(x)\n",
+ " return x\n",
+ "\n",
+ " @torch.no_grad()\n",
+ " def get_attention_maps(self, x, mask=None, add_positional_encoding=True):\n",
+ " \"\"\"\n",
+ " Function for extracting the attention matrices of the whole Transformer for a single batch.\n",
+ " Input arguments same as the forward pass.\n",
+ " \"\"\"\n",
+ " x = self.input_net(x)\n",
+ " if add_positional_encoding:\n",
+ " x = self.positional_encoding(x)\n",
+ " attention_maps = self.transformer.get_attention_maps(x, mask=mask)\n",
+ " return attention_maps\n",
+ "\n",
+ " def configure_optimizers(self):\n",
+ " optimizer = optim.Adam(self.parameters(), lr=self.hparams.lr)\n",
+ " \n",
+ " # Apply lr scheduler per step\n",
+ " lr_scheduler = CosineWarmupScheduler(optimizer, \n",
+ " warmup=self.hparams.warmup, \n",
+ " max_iters=self.hparams.max_iters)\n",
+ " return [optimizer], [{'scheduler': lr_scheduler, 'interval': 'step'}]\n",
+ "\n",
+ " def training_step(self, batch, batch_idx):\n",
+ " raise NotImplementedError\n",
+ "\n",
+ " def validation_step(self, batch, batch_idx):\n",
+ " raise NotImplementedError \n",
+ "\n",
+ " def test_step(self, batch, batch_idx):\n",
+ " raise NotImplementedError "
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Experiments\n",
+ "\n",
+ "After having finished the implementation of the Transformer architecture, we can start experimenting and apply it to various tasks. In this notebook, we will focus on two tasks: parallel Sequence-to-Sequence, and set anomaly detection. The two tasks focus on different properties of the Transformer architecture, and we go through them below.\n",
+ "\n",
+ "### Sequence to Sequence\n",
+ "\n",
+ "A Sequence-to-Sequence task represents a task where the input _and_ the output is a sequence, not necessarily of the same length. Popular tasks in this domain include machine translation and summarization. For this, we usually have a Transformer encoder for interpreting the input sequence, and a decoder for generating the output in an autoregressive manner. Here, however, we will go back to a much simpler example task and use only the encoder. Given a sequence of $N$ numbers between $0$ and $M$, the task is to reverse the input sequence. In Numpy notation, if our input is $x$, the output should be $x$[::-1]. Although this task sounds very simple, RNNs can have issues with such because the task requires long-term dependencies. Transformers are built to support such, and hence, we expect it to perform very well. \n",
+ "\n",
+ "First, let's create a dataset class below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 14,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class ReverseDataset(data.Dataset):\n",
+ "\n",
+ " def __init__(self, num_categories, seq_len, size):\n",
+ " super().__init__()\n",
+ " self.num_categories = num_categories\n",
+ " self.seq_len = seq_len\n",
+ " self.size = size\n",
+ " \n",
+ " self.data = torch.randint(self.num_categories, size=(self.size, self.seq_len))\n",
+ " \n",
+ " def __len__(self):\n",
+ " return self.size\n",
+ "\n",
+ " def __getitem__(self, idx):\n",
+ " inp_data = self.data[idx]\n",
+ " labels = torch.flip(inp_data, dims=(0,))\n",
+ " return inp_data, labels"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "We create an arbitrary number of random sequences of numbers between 0 and `num_categories-1`. The label is simply the tensor flipped over the sequence dimension. We can create the corresponding data loaders below. "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 15,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "dataset = partial(ReverseDataset, 10, 16)\n",
+ "train_loader = data.DataLoader(dataset(50000), batch_size=128, shuffle=True, drop_last=True, pin_memory=True)\n",
+ "val_loader = data.DataLoader(dataset(1000), batch_size=128)\n",
+ "test_loader = data.DataLoader(dataset(10000), batch_size=128)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Let's look at an arbitrary sample of the dataset:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 16,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Input data: tensor([9, 6, 2, 0, 6, 2, 7, 9, 7, 3, 3, 4, 3, 7, 0, 9])\n",
+ "Labels: tensor([9, 0, 7, 3, 4, 3, 3, 7, 9, 7, 2, 6, 0, 2, 6, 9])\n"
+ ]
+ }
+ ],
+ "source": [
+ "inp_data, labels = train_loader.dataset[0]\n",
+ "print(\"Input data:\", inp_data)\n",
+ "print(\"Labels: \", labels)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "During training, we pass the input sequence through the Transformer encoder and predict the output for each input token. We use the standard Cross-Entropy loss to perform this. Every number is represented as a one-hot vector. Remember that representing the categories as single scalars decreases the expressiveness of the model extremely as $0$ and $1$ are not closer related than $0$ and $9$ in our example. An alternative to a one-hot vector is using a learned embedding vector as it is provided by the PyTorch module `nn.Embedding`. However, using a one-hot vector with an additional linear layer as in our case has the same effect as an embedding layer (`self.input_net` maps one-hot vector to a dense vector, where each row of the weight matrix represents the embedding for a specific category).\n",
+ "\n",
+ "To implement the training dynamic, we create a new class inheriting from `TransformerPredictor` and overwriting the training, validation and test step functions."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 17,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class ReversePredictor(TransformerPredictor):\n",
+ " \n",
+ " def _calculate_loss(self, batch, mode=\"train\"):\n",
+ " # Fetch data and transform categories to one-hot vectors\n",
+ " inp_data, labels = batch\n",
+ " inp_data = F.one_hot(inp_data, num_classes=self.hparams.num_classes).float()\n",
+ " \n",
+ " # Perform prediction and calculate loss and accuracy\n",
+ " preds = self.forward(inp_data, add_positional_encoding=True)\n",
+ " loss = F.cross_entropy(preds.view(-1,preds.size(-1)), labels.view(-1))\n",
+ " acc = (preds.argmax(dim=-1) == labels).float().mean()\n",
+ " \n",
+ " # Logging\n",
+ " self.log(f\"{mode}_loss\", loss)\n",
+ " self.log(f\"{mode}_acc\", acc)\n",
+ " return loss, acc\n",
+ " \n",
+ " def training_step(self, batch, batch_idx):\n",
+ " loss, _ = self._calculate_loss(batch, mode=\"train\")\n",
+ " return loss\n",
+ " \n",
+ " def validation_step(self, batch, batch_idx):\n",
+ " _ = self._calculate_loss(batch, mode=\"val\")\n",
+ " \n",
+ " def test_step(self, batch, batch_idx):\n",
+ " _ = self._calculate_loss(batch, mode=\"test\")"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Finally, we can create a training function similar to the one we have seen in Tutorial 5 for PyTorch Lightning. We create a `pl.Trainer` object, running for $N$ epochs, logging in TensorBoard, and saving our best model based on the validation. Afterward, we test our models on the test set. An additional parameter we pass to the trainer here is `gradient_clip_val`. This clips the norm of the gradients for all parameters before taking an optimizer step and prevents the model from diverging if we obtain very high gradients at, for instance, sharp loss surfaces (see many good blog posts on gradient clipping, like [DeepAI glossary](https://deepai.org/machine-learning-glossary-and-terms/gradient-clipping)). For Transformers, gradient clipping can help to further stabilize the training during the first few iterations, and also afterward. In plain PyTorch, you can apply gradient clipping via `torch.nn.utils.clip_grad_norm_(...)` (see [documentation](https://pytorch.org/docs/stable/generated/torch.nn.utils.clip_grad_norm_.html#torch.nn.utils.clip_grad_norm_)). The clip value is usually between 0.5 and 10, depending on how harsh you want to clip large gradients. After having explained this, let's implement the training function:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 18,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "def train_reverse(**kwargs):\n",
+ " # Create a PyTorch Lightning trainer with the generation callback\n",
+ " root_dir = os.path.join(CHECKPOINT_PATH, \"ReverseTask\")\n",
+ " os.makedirs(root_dir, exist_ok=True)\n",
+ " trainer = pl.Trainer(default_root_dir=root_dir, \n",
+ " callbacks=[ModelCheckpoint(save_weights_only=True, mode=\"max\", monitor=\"val_acc\")],\n",
+ " accelerator=\"gpu\" if str(device).startswith(\"cuda\") else \"cpu\",\n",
+ " devices=1,\n",
+ " max_epochs=10,\n",
+ " gradient_clip_val=5)\n",
+ " trainer.logger._default_hp_metric = None # Optional logging argument that we don't need\n",
+ " \n",
+ " # Check whether pretrained model exists. If yes, load it and skip training\n",
+ " pretrained_filename = os.path.join(CHECKPOINT_PATH, \"ReverseTask.ckpt\")\n",
+ " if os.path.isfile(pretrained_filename):\n",
+ " print(\"Found pretrained model, loading...\")\n",
+ " model = ReversePredictor.load_from_checkpoint(pretrained_filename)\n",
+ " else:\n",
+ " model = ReversePredictor(max_iters=trainer.max_epochs*len(train_loader), **kwargs)\n",
+ " trainer.fit(model, train_loader, val_loader)\n",
+ " \n",
+ " # Test best model on validation and test set\n",
+ " val_result = trainer.test(model, val_loader, verbose=False)\n",
+ " test_result = trainer.test(model, test_loader, verbose=False)\n",
+ " result = {\"test_acc\": test_result[0][\"test_acc\"], \"val_acc\": val_result[0][\"test_acc\"]}\n",
+ " \n",
+ " model = model.to(device)\n",
+ " return model, result"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Finally, we can train the model. In this setup, we will use a single encoder block and a single head in the Multi-Head Attention. This is chosen because of the simplicity of the task, and in this case, the attention can actually be interpreted as an \"explanation\" of the predictions (compared to the other papers above dealing with deep Transformers). "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "def scaled_dot_product(q, k, v, mask=None):\n",
+ " d_k = q.size()[-1]\n",
+ " attn_logits = torch.matmul(q, k.transpose(-2, -1))\n",
+ " attn_logits = attn_logits / math.sqrt(d_k)\n",
+ " if mask is not None:\n",
+ " attn_logits = attn_logits.masked_fill(mask == 0, -9e15)\n",
+ " attention = F.softmax(attn_logits, dim=-1)\n",
+ " values = torch.matmul(attention, v)\n",
+ " return values, attention\n",
+ "\n",
+ "class MultiheadAttention(nn.Module):\n",
+ " \n",
+ " def __init__(self, input_dim, embed_dim, num_heads):\n",
+ " super().__init__()\n",
+ " assert embed_dim % num_heads == 0, \"Embedding dimension must be 0 modulo number of heads.\"\n",
+ " \n",
+ " self.embed_dim = embed_dim\n",
+ " self.num_heads = num_heads\n",
+ " self.head_dim = embed_dim // num_heads\n",
+ " \n",
+ " # Stack all weight matrices 1...h together for efficiency\n",
+ " # Note that in many implementations you see \"bias=False\" which is optional\n",
+ " self.qkv_proj = nn.Linear(input_dim, 3*embed_dim)\n",
+ " self.o_proj = nn.Linear(embed_dim, embed_dim)\n",
+ " \n",
+ " self._reset_parameters()\n",
+ "\n",
+ " def _reset_parameters(self):\n",
+ " # Original Transformer initialization, see PyTorch documentation\n",
+ " nn.init.xavier_uniform_(self.qkv_proj.weight)\n",
+ " self.qkv_proj.bias.data.fill_(0)\n",
+ " nn.init.xavier_uniform_(self.o_proj.weight)\n",
+ " self.o_proj.bias.data.fill_(0)\n",
+ "\n",
+ " def forward(self, x, mask=None, return_attention=False):\n",
+ " batch_size, seq_length, _ = x.size()\n",
+ " if mask is not None:\n",
+ " mask = expand_mask(mask)\n",
+ " qkv = self.qkv_proj(x)\n",
+ " \n",
+ " # Separate Q, K, V from linear output\n",
+ " qkv = qkv.reshape(batch_size, seq_length, self.num_heads, 3*self.head_dim)\n",
+ " qkv = qkv.permute(0, 2, 1, 3) # [Batch, Head, SeqLen, Dims]\n",
+ " q, k, v = qkv.chunk(3, dim=-1)\n",
+ " \n",
+ " # Determine value outputs\n",
+ " values, attention = scaled_dot_product(q, k, v, mask=mask)\n",
+ " values = values.permute(0, 2, 1, 3) # [Batch, SeqLen, Head, Dims]\n",
+ " values = values.reshape(batch_size, seq_length, self.embed_dim)\n",
+ " o = self.o_proj(values)\n",
+ " \n",
+ " if return_attention:\n",
+ " return o, attention\n",
+ " else:\n",
+ " return o"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 19,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stderr",
+ "output_type": "stream",
+ "text": [
+ "GPU available: True, used: True\n",
+ "TPU available: False, using: 0 TPU cores\n",
+ "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0]\n"
+ ]
+ },
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Found pretrained model, loading...\n"
+ ]
+ }
+ ],
+ "source": [
+ "reverse_model, reverse_result = train_reverse(input_dim=train_loader.dataset.num_categories,\n",
+ " model_dim=32,\n",
+ " num_heads=1,\n",
+ " num_classes=train_loader.dataset.num_categories,\n",
+ " num_layers=1,\n",
+ " dropout=0.0,\n",
+ " lr=5e-4,\n",
+ " warmup=50)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "The warning of PyTorch Lightning regarding the number of workers can be ignored for now. As the data set is so simple and the `__getitem__` finishes a neglectable time, we don't need subprocesses to provide us the data (in fact, more workers can slow down the training as we have communication overhead among processes/threads). First, let's print the results:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 20,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Val accuracy: 100.00%\n",
+ "Test accuracy: 100.00%\n"
+ ]
+ }
+ ],
+ "source": [
+ "print(f\"Val accuracy: {(100.0 * reverse_result['val_acc']):4.2f}%\")\n",
+ "print(f\"Test accuracy: {(100.0 * reverse_result['test_acc']):4.2f}%\")"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "As we would have expected, the Transformer can correctly solve the task. However, how does the attention in the Multi-Head Attention block looks like for an arbitrary input? Let's try to visualize it below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 21,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "data_input, labels = next(iter(val_loader))\n",
+ "inp_data = F.one_hot(data_input, num_classes=reverse_model.hparams.num_classes).float()\n",
+ "inp_data = inp_data.to(device)\n",
+ "attention_maps = reverse_model.get_attention_maps(inp_data)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "The object `attention_maps` is a list of length $N$ where $N$ is the number of layers. Each element is a tensor of shape [Batch, Heads, SeqLen, SeqLen], which we can verify below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 22,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "text/plain": [
+ "torch.Size([128, 1, 16, 16])"
+ ]
+ },
+ "execution_count": 22,
+ "metadata": {},
+ "output_type": "execute_result"
+ }
+ ],
+ "source": [
+ "attention_maps[0].shape"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Next, we will write a plotting function that takes as input the sequences, attention maps, and an index indicating for which batch element we want to visualize the attention map. We will create a plot where over rows, we have different layers, while over columns, we show the different heads. Remember that the softmax has been applied for each row separately."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 23,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "def plot_attention_maps(input_data, attn_maps, idx=0):\n",
+ " if input_data is not None:\n",
+ " input_data = input_data[idx].detach().cpu().numpy()\n",
+ " else:\n",
+ " input_data = np.arange(attn_maps[0][idx].shape[-1])\n",
+ " attn_maps = [m[idx].detach().cpu().numpy() for m in attn_maps]\n",
+ " \n",
+ " num_heads = attn_maps[0].shape[0]\n",
+ " num_layers = len(attn_maps)\n",
+ " seq_len = input_data.shape[0]\n",
+ " fig_size = 4 if num_heads == 1 else 3\n",
+ " fig, ax = plt.subplots(num_layers, num_heads, figsize=(num_heads*fig_size, num_layers*fig_size))\n",
+ " if num_layers == 1:\n",
+ " ax = [ax]\n",
+ " if num_heads == 1:\n",
+ " ax = [[a] for a in ax]\n",
+ " for row in range(num_layers):\n",
+ " for column in range(num_heads):\n",
+ " ax[row][column].imshow(attn_maps[row][column], origin='lower', vmin=0)\n",
+ " ax[row][column].set_xticks(list(range(seq_len)))\n",
+ " ax[row][column].set_xticklabels(input_data.tolist())\n",
+ " ax[row][column].set_yticks(list(range(seq_len)))\n",
+ " ax[row][column].set_yticklabels(input_data.tolist())\n",
+ " ax[row][column].set_title(f\"Layer {row+1}, Head {column+1}\")\n",
+ " fig.subplots_adjust(hspace=0.5)\n",
+ " plt.show()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Finally, we can plot the attention map of our trained Transformer on the reverse task:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 24,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:26.716937 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ }
+ ],
+ "source": [
+ "plot_attention_maps(data_input, attention_maps, idx=0)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "The model has learned to attend to the token that is on the flipped index of itself. Hence, it actually does what we intended it to do. We see that it however also pays some attention to values close to the flipped index. This is because the model doesn't need the perfect, hard attention to solve this problem, but is fine with this approximate, noisy attention map. The close-by indices are caused by the similarity of the positional encoding, which we also intended with the positional encoding."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "### Set Anomaly Detection\n",
+ "\n",
+ "Besides sequences, sets are another data structure that is relevant for many applications. In contrast to sequences, elements are unordered in a set. RNNs can only be applied on sets by assuming an order in the data, which however biases the model towards a non-existing order in the data. [Vinyals et al. (2015)](https://arxiv.org/abs/1511.06391) and other papers have shown that the assumed order can have a significant impact on the model's performance, and hence, we should try to not use RNNs on sets. Ideally, our model should be permutation-equivariant/invariant such that the output is the same no matter how we sort the elements in a set. \n",
+ "\n",
+ "Transformers offer the perfect architecture for this as the Multi-Head Attention is permutation-equivariant, and thus, outputs the same values no matter in what order we enter the inputs (inputs and outputs are permuted equally). The task we are looking at for sets is _Set Anomaly Detection_ which means that we try to find the element(s) in a set that does not fit the others. In the research community, the common application of anomaly detection is performed on a set of images, where $N-1$ images belong to the same category/have the same high-level features while one belongs to another category. Note that category does not necessarily have to relate to a class in a standard classification problem, but could be the combination of multiple features. For instance, on a face dataset, this could be people with glasses, male, beard, etc. An example of distinguishing different animals can be seen below. The first four images show foxes, while the last represents a different animal. We want to recognize that the last image shows a different animal, but it is not relevant which class of animal it is.\n",
+ "\n",
+ " \n",
+ "\n",
+ ":::{figure} ../image/warmup_loss_plot.svg\n",
+ ":::\n",
+ "\n",
+ "In this tutorial, we will use the CIFAR100 dataset. CIFAR100 has 600 images for 100 classes each with a resolution of 32x32, similar to CIFAR10. The larger amount of classes requires the model to attend to specific features in the images instead of coarse features as in CIFAR10, therefore making the task harder. We will show the model a set of 9 images of one class, and 1 image from another class. The task is to find the image that is from a different class than the other images.\n",
+ "Using the raw images directly as input to the Transformer is not a good idea, because it is not translation invariant as a CNN, and would need to learn to detect image features from high-dimensional input first of all. Instead, we will use a pre-trained ResNet34 model from the torchvision package to obtain high-level, low-dimensional features of the images. The ResNet model has been pre-trained on the [ImageNet](http://image-net.org/) dataset which contains 1 million images of 1k classes and varying resolutions. However, during training and testing, the images are usually scaled to a resolution of 224x224, and hence we rescale our CIFAR images to this resolution as well. Below, we will load the dataset, and prepare the data for being processed by the ResNet model."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 25,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Files already downloaded and verified\n",
+ "Files already downloaded and verified\n"
+ ]
+ }
+ ],
+ "source": [
+ "# ImageNet statistics\n",
+ "DATA_MEANS = np.array([0.485, 0.456, 0.406])\n",
+ "DATA_STD = np.array([0.229, 0.224, 0.225])\n",
+ "# As torch tensors for later preprocessing\n",
+ "TORCH_DATA_MEANS = torch.from_numpy(DATA_MEANS).view(1,3,1,1)\n",
+ "TORCH_DATA_STD = torch.from_numpy(DATA_STD).view(1,3,1,1)\n",
+ "\n",
+ "# Resize to 224x224, and normalize to ImageNet statistic\n",
+ "transform = transforms.Compose([transforms.Resize((224,224)),\n",
+ " transforms.ToTensor(),\n",
+ " transforms.Normalize(DATA_MEANS, DATA_STD)\n",
+ " ])\n",
+ "# Loading the training dataset. \n",
+ "train_set = CIFAR100(root=DATASET_PATH, train=True, transform=transform, download=True)\n",
+ "\n",
+ "# Loading the test set\n",
+ "test_set = CIFAR100(root=DATASET_PATH, train=False, transform=transform, download=True)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Next, we want to run the pre-trained ResNet model on the images, and extract the features before the classification layer. These are the most high-level features, and should sufficiently describe the images. CIFAR100 has some similarity to ImageNet, and thus we are not retraining the ResNet model in any form. However, if you would want to get the best performance and have a very large dataset, it would be better to add the ResNet to the computation graph during training and finetune its parameters as well. As we don't have a large enough dataset and want to train our model efficiently, we will extract the features beforehand. Let's load and prepare the model below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 26,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import os\n",
+ "os.environ[\"TORCH_HOME\"] = CHECKPOINT_PATH\n",
+ "pretrained_model = torchvision.models.resnet34(weights='IMAGENET1K_V1')\n",
+ "# Remove classification layer\n",
+ "# In some models, it is called \"fc\", others have \"classifier\"\n",
+ "# Setting both to an empty sequential represents an identity map of the final features.\n",
+ "pretrained_model.fc = nn.Sequential()\n",
+ "pretrained_model.classifier = nn.Sequential()\n",
+ "# To GPU\n",
+ "pretrained_model = pretrained_model.to(device)\n",
+ "\n",
+ "# Only eval, no gradient required\n",
+ "pretrained_model.eval()\n",
+ "for p in pretrained_model.parameters():\n",
+ " p.requires_grad = False"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "We will now write a extraction function for the features below. This cell requires access to a GPU, as the model is rather deep and the images relatively large. The GPUs on GoogleColab are sufficient, but running this cell can take 2-3 minutes. Once it is run, the features are exported on disk so they don't have to be recalculated every time you run the notebook. However, this requires >150MB free disk space. So it is recommended to run this only on a local computer if you have enough free disk and a GPU (GoogleColab is fine for this). If you do not have a GPU, you can download the features from the [GoogleDrive folder](https://drive.google.com/drive/folders/1DF7POc6j03pRiWQPWSl5QJX5iY-xK0sV?usp=sharing)."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 27,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "@torch.no_grad()\n",
+ "def extract_features(dataset, save_file):\n",
+ " if not os.path.isfile(save_file):\n",
+ " data_loader = data.DataLoader(dataset, batch_size=128, shuffle=False, drop_last=False, num_workers=4)\n",
+ " extracted_features = []\n",
+ " for imgs, _ in tqdm(data_loader):\n",
+ " imgs = imgs.to(device)\n",
+ " feats = pretrained_model(imgs)\n",
+ " extracted_features.append(feats)\n",
+ " extracted_features = torch.cat(extracted_features, dim=0)\n",
+ " extracted_features = extracted_features.detach().cpu()\n",
+ " torch.save(extracted_features, save_file)\n",
+ " else:\n",
+ " extracted_features = torch.load(save_file)\n",
+ " return extracted_features\n",
+ "\n",
+ "train_feat_file = os.path.join(CHECKPOINT_PATH, \"train_set_features.tar\")\n",
+ "train_set_feats = extract_features(train_set, train_feat_file)\n",
+ "\n",
+ "test_feat_file = os.path.join(CHECKPOINT_PATH, \"test_set_features.tar\")\n",
+ "test_feats = extract_features(test_set, test_feat_file)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Let's verify the feature shapes below. The training should have 50k elements, and the test 10k images. The feature dimension is 512 for the ResNet34. If you experiment with other models, you likely see a different feature dimension."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 28,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Train: torch.Size([50000, 512])\n",
+ "Test: torch.Size([10000, 512])\n"
+ ]
+ }
+ ],
+ "source": [
+ "print(\"Train:\", train_set_feats.shape)\n",
+ "print(\"Test: \", test_feats.shape)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "As usual, we want to create a validation set to detect when we should stop training. In this case, we will split the training set into 90% training, 10% validation. However, the difficulty is here that we need to ensure that the validation set has the same number of images for all 100 labels. Otherwise, we have a class imbalance which is not good for creating the image sets. Hence, we take 10% of the images for each class, and move them into the validation set. The code below does exactly this."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 29,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "## Split train into train+val\n",
+ "# Get labels from train set\n",
+ "labels = train_set.targets\n",
+ "\n",
+ "# Get indices of images per class\n",
+ "labels = torch.LongTensor(labels)\n",
+ "num_labels = labels.max()+1\n",
+ "sorted_indices = torch.argsort(labels).reshape(num_labels, -1) # [classes, num_imgs per class]\n",
+ "\n",
+ "# Determine number of validation images per class\n",
+ "num_val_exmps = sorted_indices.shape[1] // 10\n",
+ "\n",
+ "# Get image indices for validation and training\n",
+ "val_indices = sorted_indices[:,:num_val_exmps].reshape(-1)\n",
+ "train_indices = sorted_indices[:,num_val_exmps:].reshape(-1)\n",
+ "\n",
+ "# Group corresponding image features and labels\n",
+ "train_feats, train_labels = train_set_feats[train_indices], labels[train_indices]\n",
+ "val_feats, val_labels = train_set_feats[val_indices], labels[val_indices]"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Now we can prepare a dataset class for the set anomaly task. We define an epoch to be the sequence in which each image has been exactly once as an \"anomaly\". Hence, the length of the dataset is the number of images in it. For the training set, each time we access an item with `__getitem__`, we sample a random, different class than the image at the corresponding index `idx` has. In a second step, we sample $N-1$ images of this sampled class. The set of 10 images is finally returned. The randomness in the `__getitem__` allows us to see a slightly different set during each iteration. However, we can't use the same strategy for the test set as we want the test dataset to be the same every time we iterate over it. Hence, we sample the sets in the `__init__` method, and return those in `__getitem__`. The code below implements exactly this dynamic."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 30,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class SetAnomalyDataset(data.Dataset):\n",
+ " \n",
+ " def __init__(self, img_feats, labels, set_size=10, train=True):\n",
+ " \"\"\"\n",
+ " Inputs:\n",
+ " img_feats - Tensor of shape [num_imgs, img_dim]. Represents the high-level features.\n",
+ " labels - Tensor of shape [num_imgs], containing the class labels for the images\n",
+ " set_size - Number of elements in a set. N-1 are sampled from one class, and one from another one.\n",
+ " train - If True, a new set will be sampled every time __getitem__ is called.\n",
+ " \"\"\"\n",
+ " super().__init__()\n",
+ " self.img_feats = img_feats\n",
+ " self.labels = labels\n",
+ " self.set_size = set_size-1 # The set size is here the size of correct images\n",
+ " self.train = train\n",
+ " \n",
+ " # Tensors with indices of the images per class\n",
+ " self.num_labels = labels.max()+1\n",
+ " self.img_idx_by_label = torch.argsort(self.labels).reshape(self.num_labels, -1)\n",
+ " \n",
+ " if not train:\n",
+ " self.test_sets = self._create_test_sets()\n",
+ " \n",
+ " \n",
+ " def _create_test_sets(self):\n",
+ " # Pre-generates the sets for each image for the test set\n",
+ " test_sets = []\n",
+ " num_imgs = self.img_feats.shape[0]\n",
+ " np.random.seed(42)\n",
+ " test_sets = [self.sample_img_set(self.labels[idx]) for idx in range(num_imgs)]\n",
+ " test_sets = torch.stack(test_sets, dim=0)\n",
+ " return test_sets\n",
+ " \n",
+ " \n",
+ " def sample_img_set(self, anomaly_label):\n",
+ " \"\"\"\n",
+ " Samples a new set of images, given the label of the anomaly. \n",
+ " The sampled images come from a different class than anomaly_label\n",
+ " \"\"\"\n",
+ " # Sample class from 0,...,num_classes-1 while skipping anomaly_label as class\n",
+ " set_label = np.random.randint(self.num_labels-1)\n",
+ " if set_label >= anomaly_label:\n",
+ " set_label += 1\n",
+ " \n",
+ " # Sample images from the class determined above\n",
+ " img_indices = np.random.choice(self.img_idx_by_label.shape[1], size=self.set_size, replace=False)\n",
+ " img_indices = self.img_idx_by_label[set_label, img_indices]\n",
+ " return img_indices\n",
+ " \n",
+ " \n",
+ " def __len__(self):\n",
+ " return self.img_feats.shape[0]\n",
+ " \n",
+ " \n",
+ " def __getitem__(self, idx):\n",
+ " anomaly = self.img_feats[idx]\n",
+ " if self.train: # If train => sample\n",
+ " img_indices = self.sample_img_set(self.labels[idx])\n",
+ " else: # If test => use pre-generated ones\n",
+ " img_indices = self.test_sets[idx]\n",
+ " \n",
+ " # Concatenate images. The anomaly is always the last image for simplicity\n",
+ " img_set = torch.cat([self.img_feats[img_indices], anomaly[None]], dim=0)\n",
+ " indices = torch.cat([img_indices, torch.LongTensor([idx])], dim=0)\n",
+ " label = img_set.shape[0]-1\n",
+ " \n",
+ " # We return the indices of the images for visualization purpose. \"Label\" is the index of the anomaly\n",
+ " return img_set, indices, label"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Next, we can setup our datasets and data loaders below. Here, we will use a set size of 10, i.e. 9 images from one category + 1 anomaly. Feel free to change it if you want to experiment with the sizes. "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 31,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "SET_SIZE = 10\n",
+ "test_labels = torch.LongTensor(test_set.targets)\n",
+ "\n",
+ "train_anom_dataset = SetAnomalyDataset(train_feats, train_labels, set_size=SET_SIZE, train=True)\n",
+ "val_anom_dataset = SetAnomalyDataset(val_feats, val_labels, set_size=SET_SIZE, train=False)\n",
+ "test_anom_dataset = SetAnomalyDataset(test_feats, test_labels, set_size=SET_SIZE, train=False)\n",
+ "\n",
+ "train_anom_loader = data.DataLoader(train_anom_dataset, batch_size=64, shuffle=True, drop_last=True, num_workers=4, pin_memory=True)\n",
+ "val_anom_loader = data.DataLoader(val_anom_dataset, batch_size=64, shuffle=False, drop_last=False, num_workers=4)\n",
+ "test_anom_loader = data.DataLoader(test_anom_dataset, batch_size=64, shuffle=False, drop_last=False, num_workers=4)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "To understand the dataset a little better, we can plot below a few sets from the test dataset. Each row shows a different input set, where the first 9 are from the same class."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 32,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:30.487860 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ }
+ ],
+ "source": [
+ "def visualize_exmp(indices, orig_dataset):\n",
+ " images = [orig_dataset[idx][0] for idx in indices.reshape(-1)]\n",
+ " images = torch.stack(images, dim=0)\n",
+ " images = images * TORCH_DATA_STD + TORCH_DATA_MEANS\n",
+ " \n",
+ " img_grid = torchvision.utils.make_grid(images, nrow=SET_SIZE, normalize=True, pad_value=0.5, padding=16)\n",
+ " img_grid = img_grid.permute(1, 2, 0)\n",
+ "\n",
+ " plt.figure(figsize=(12,8))\n",
+ " plt.title(\"Anomaly examples on CIFAR100\")\n",
+ " plt.imshow(img_grid)\n",
+ " plt.axis('off')\n",
+ " plt.show()\n",
+ " plt.close()\n",
+ "\n",
+ "_, indices, _ = next(iter(test_anom_loader))\n",
+ "visualize_exmp(indices[:4], test_set)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "We can already see that for some sets the task might be easier than for others. Difficulties can especially arise if the anomaly is in a different, but yet visually similar class (e.g. train vs bus, flour vs worm, etc.).\n",
+ "\n",
+ "After having prepared the data, we can look closer at the model. Here, we have a classification of the whole set. For the prediction to be permutation-equivariant, we will output one logit for each image. Over these logits, we apply a softmax and train the anomaly image to have the highest score/probability. This is a bit different than a standard classification layer as the softmax is applied over images, not over output classes in the classical sense. However, if we swap two images in their position, we effectively swap their position in the output softmax. Hence, the prediction is equivariant with respect to the input. We implement this idea below in the subclass of the Transformer Lightning module."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 33,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "class AnomalyPredictor(TransformerPredictor):\n",
+ " \n",
+ " def _calculate_loss(self, batch, mode=\"train\"):\n",
+ " img_sets, _, labels = batch\n",
+ " preds = self.forward(img_sets, add_positional_encoding=False) # No positional encodings as it is a set, not a sequence!\n",
+ " preds = preds.squeeze(dim=-1) # Shape: [Batch_size, set_size]\n",
+ " loss = F.cross_entropy(preds, labels) # Softmax/CE over set dimension\n",
+ " acc = (preds.argmax(dim=-1) == labels).float().mean()\n",
+ " self.log(f\"{mode}_loss\", loss)\n",
+ " self.log(f\"{mode}_acc\", acc, on_step=False, on_epoch=True)\n",
+ " return loss, acc\n",
+ " \n",
+ " def training_step(self, batch, batch_idx):\n",
+ " loss, _ = self._calculate_loss(batch, mode=\"train\")\n",
+ " return loss\n",
+ " \n",
+ " def validation_step(self, batch, batch_idx):\n",
+ " _ = self._calculate_loss(batch, mode=\"val\")\n",
+ " \n",
+ " def test_step(self, batch, batch_idx):\n",
+ " _ = self._calculate_loss(batch, mode=\"test\")"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Finally, we write our train function below. It has the exact same structure as the reverse task one, hence not much of an explanation is needed here."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 34,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "def train_anomaly(**kwargs):\n",
+ " # Create a PyTorch Lightning trainer with the generation callback\n",
+ " root_dir = os.path.join(CHECKPOINT_PATH, \"SetAnomalyTask\")\n",
+ " os.makedirs(root_dir, exist_ok=True)\n",
+ " trainer = pl.Trainer(default_root_dir=root_dir, \n",
+ " callbacks=[ModelCheckpoint(save_weights_only=True, mode=\"max\", monitor=\"val_acc\")],\n",
+ " accelerator=\"gpu\" if str(device).startswith(\"cuda\") else \"cpu\",\n",
+ " devices=1,\n",
+ " max_epochs=100,\n",
+ " gradient_clip_val=2)\n",
+ " trainer.logger._default_hp_metric = None # Optional logging argument that we don't need\n",
+ " \n",
+ " # Check whether pretrained model exists. If yes, load it and skip training\n",
+ " pretrained_filename = os.path.join(CHECKPOINT_PATH, \"SetAnomalyTask.ckpt\")\n",
+ " if os.path.isfile(pretrained_filename):\n",
+ " print(\"Found pretrained model, loading...\")\n",
+ " model = AnomalyPredictor.load_from_checkpoint(pretrained_filename)\n",
+ " else:\n",
+ " model = AnomalyPredictor(max_iters=trainer.max_epochs*len(train_anom_loader), **kwargs)\n",
+ " trainer.fit(model, train_anom_loader, val_anom_loader)\n",
+ " model = AnomalyPredictor.load_from_checkpoint(trainer.checkpoint_callback.best_model_path)\n",
+ " \n",
+ " # Test best model on validation and test set\n",
+ " train_result = trainer.test(model, train_anom_loader, verbose=False)\n",
+ " val_result = trainer.test(model, val_anom_loader, verbose=False)\n",
+ " test_result = trainer.test(model, test_anom_loader, verbose=False)\n",
+ " result = {\"test_acc\": test_result[0][\"test_acc\"], \"val_acc\": val_result[0][\"test_acc\"], \"train_acc\": train_result[0][\"test_acc\"]}\n",
+ " \n",
+ " model = model.to(device)\n",
+ " return model, result"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Let's finally train our model. We will use 4 layers with 4 attention heads each. The hidden dimensionality of the model is 256, and we use a dropout of 0.1 throughout the model for good regularization. Note that we also apply the dropout on the input features, as this makes the model more robust against image noise and generalizes better. Again, we use warmup to slowly start our model training. "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 35,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stderr",
+ "output_type": "stream",
+ "text": [
+ "GPU available: True, used: True\n",
+ "WARNING: Logging before flag parsing goes to stderr.\n",
+ "I1109 10:43:31.036801 139648634296128 distributed.py:49] GPU available: True, used: True\n",
+ "TPU available: False, using: 0 TPU cores\n",
+ "I1109 10:43:31.038146 139648634296128 distributed.py:49] TPU available: False, using: 0 TPU cores\n",
+ "LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0]\n",
+ "I1109 10:43:31.039162 139648634296128 accelerator_connector.py:385] LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0]\n"
+ ]
+ }
+ ],
+ "source": [
+ "anomaly_model, anomaly_result = train_anomaly(input_dim=train_anom_dataset.img_feats.shape[-1],\n",
+ " model_dim=256,\n",
+ " num_heads=4,\n",
+ " num_classes=1,\n",
+ " num_layers=4,\n",
+ " dropout=0.1,\n",
+ " input_dropout=0.1,\n",
+ " lr=5e-4,\n",
+ " warmup=100)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "We can print the achieved accuracy below."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 36,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Train accuracy: 97.77%\n",
+ "Val accuracy: 94.38%\n",
+ "Test accuracy: 94.30%\n"
+ ]
+ }
+ ],
+ "source": [
+ "print(f\"Train accuracy: {(100.0*anomaly_result['train_acc']):4.2f}%\")\n",
+ "print(f\"Val accuracy: {(100.0*anomaly_result['val_acc']):4.2f}%\")\n",
+ "print(f\"Test accuracy: {(100.0*anomaly_result['test_acc']):4.2f}%\")"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "With ~94% validation and test accuracy, the model generalizes quite well. It should be noted that you might see slightly different scores depending on what computer/device you are running this notebook. This is because despite setting the seed before generating the test dataset, it is not the same across platforms and numpy versions. Nevertheless, we can conclude that the model performs quite well and can solve the task for most sets. Before trying to interpret the model, let's verify that our model is permutation-equivariant, and assigns the same predictions for different permutations of the input set. For this, we sample a batch from the test set and run it through the model to obtain the probabilities. "
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 37,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Preds\n",
+ " [5.4543594e-05 1.4208173e-04 6.6922468e-05 7.6413504e-05 7.7112330e-05\n",
+ " 8.7848457e-05 6.6820685e-05 9.9929154e-01 7.3219831e-05 6.3545609e-05]\n",
+ "Permuted preds\n",
+ " [5.4543532e-05 1.4208158e-04 6.6922395e-05 7.6413417e-05 7.7112243e-05\n",
+ " 8.7848362e-05 6.6820678e-05 9.9929142e-01 7.3219751e-05 6.3545544e-05]\n"
+ ]
+ }
+ ],
+ "source": [
+ "inp_data, indices, labels = next(iter(test_anom_loader))\n",
+ "inp_data = inp_data.to(device)\n",
+ "\n",
+ "anomaly_model.eval()\n",
+ "\n",
+ "with torch.no_grad():\n",
+ " preds = anomaly_model.forward(inp_data, add_positional_encoding=False)\n",
+ " preds = F.softmax(preds.squeeze(dim=-1), dim=-1)\n",
+ "\n",
+ " # Permut input data\n",
+ " permut = np.random.permutation(inp_data.shape[1])\n",
+ " perm_inp_data = inp_data[:,permut]\n",
+ " perm_preds = anomaly_model.forward(perm_inp_data, add_positional_encoding=False)\n",
+ " perm_preds = F.softmax(perm_preds.squeeze(dim=-1), dim=-1)\n",
+ "\n",
+ "assert (preds[:,permut] - perm_preds).abs().max() < 1e-5, \"Predictions are not permutation equivariant\"\n",
+ "\n",
+ "print(\"Preds\\n\", preds[0,permut].cpu().numpy())\n",
+ "print(\"Permuted preds\\n\", perm_preds[0].cpu().numpy())"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "You can see that the predictions are almost exactly the same, and only differ because of slight numerical differences inside the network operation.\n",
+ "\n",
+ "To interpret the model a little more, we can plot the attention maps inside the model. This will give us an idea of what information the model is sharing/communicating between images, and what each head might represent. First, we need to extract the attention maps for the test batch above, and determine the discrete predictions for simplicity."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 38,
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "attention_maps = anomaly_model.get_attention_maps(inp_data, add_positional_encoding=False)\n",
+ "predictions = preds.argmax(dim=-1)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Below we write a plot function which plots the images in the input set, the prediction of the model, and the attention maps of the different heads on layers of the transformer. Feel free to explore the attention maps for different input examples as well."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 39,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:35.755092 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ },
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Prediction: 9\n"
+ ]
+ },
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:36.649271 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ }
+ ],
+ "source": [
+ "def visualize_prediction(idx):\n",
+ " visualize_exmp(indices[idx:idx+1], test_set)\n",
+ " print(\"Prediction:\", predictions[idx].item())\n",
+ " plot_attention_maps(input_data=None, attn_maps=attention_maps, idx=idx)\n",
+ "\n",
+ "visualize_prediction(0)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Depending on the random seed, you might see a slightly different input set. For the version on the website, we compare 9 tree images with a volcano. We see that multiple heads, for instance, Layer 2 Head 1, Layer 2 Head 3, and Layer 3 Head 1 focus on the last image. Additionally, the heads in Layer 4 all seem to ignore the last image and assign a very low attention probability to it. This shows that the model has indeed recognized that the image doesn't fit the setting, and hence predicted it to be the anomaly. Layer 3 Head 2-4 seems to take a slightly weighted average of all images. That might indicate that the model extracts the \"average\" information of all images, to compare it to the image features itself. \n",
+ "\n",
+ "Let's try to find where the model actually makes a mistake. We can do this by identifying the sets where the model predicts something else than 9, as in the dataset, we ensured that the anomaly is always at the last position in the set."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 40,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Indices with mistake: [36 49]\n"
+ ]
+ }
+ ],
+ "source": [
+ "mistakes = torch.where(predictions != 9)[0].cpu().numpy()\n",
+ "print(\"Indices with mistake:\", mistakes)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "As our model achieves ~94% accuracy, we only have very little number of mistakes in a batch of 64 sets. Still, let's visualize one of them, for example the last one:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 41,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:37.728184 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ },
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Prediction: 2\n"
+ ]
+ },
+ {
+ "data": {
+ "application/pdf": "\n",
+ "image/svg+xml": [
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " 2020-11-09T10:43:38.742709 \n",
+ " image/svg+xml \n",
+ " \n",
+ " \n",
+ " Matplotlib v3.3.2, https://matplotlib.org/ \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " \n"
+ ],
+ "text/plain": [
+ ""
+ ]
+ },
+ "metadata": {
+ "needs_background": "light"
+ },
+ "output_type": "display_data"
+ },
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "Probabilities:\n",
+ "Image 0: 0.06%\n",
+ "Image 1: 1.63%\n",
+ "Image 2: 89.63%\n",
+ "Image 3: 0.01%\n",
+ "Image 4: 0.01%\n",
+ "Image 5: 0.01%\n",
+ "Image 6: 0.01%\n",
+ "Image 7: 0.01%\n",
+ "Image 8: 0.01%\n",
+ "Image 9: 8.63%\n"
+ ]
+ }
+ ],
+ "source": [
+ "visualize_prediction(mistakes[-1])\n",
+ "print(\"Probabilities:\")\n",
+ "for i, p in enumerate(preds[mistakes[-1]].cpu().numpy()):\n",
+ " print(f\"Image {i}: {100.0*p:4.2f}%\")"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "In this example, the model confuses a palm tree with a building, giving a probability of ~90% to image 2, and 8% to the actual anomaly. However, the difficulty here is that the picture of the building has been taken at a similar angle as the palms. Meanwhile, image 2 shows a rather unusual palm with a different color palette, which is why the model fails here. Nevertheless, in general, the model performs quite well."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Conclusion\n",
+ "\n",
+ "In this tutorial, we took a closer look at the Multi-Head Attention layer which uses a scaled dot product between queries and keys to find correlations and similarities between input elements. The Transformer architecture is based on the Multi-Head Attention layer and applies multiple of them in a ResNet-like block. The Transformer is a very important, recent architecture that can be applied to many tasks and datasets. Although it is best known for its success in NLP, there is so much more to it. We have seen its application on sequence-to-sequence tasks and set anomaly detection. Its property of being permutation-equivariant if we do not provide any positional encodings, allows it to generalize to many settings. Hence, it is important to know the architecture, but also its possible issues such as the gradient problem during the first iterations solved by learning rate warm-up."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Your turn! 🚀\n",
+ "You can practice your cnn skills by following the assignment [complete the transformer architecture](../../assignments/llm/basic/transformer-architecture.ipynb)."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Self study\n",
+ "\n",
+ "You can refer to those YouTube videos for further study:\n",
+ "\n",
+ "* [Transformer: A Novel Neural Network Architecture for Language Understanding (Jakob Uszkoreit, 2017)](https://ai.googleblog.com/2017/08/transformer-novel-neural-network.html) - The original Google blog post about the Transformer paper, focusing on the application in machine translation.\n",
+ "* [The Illustrated Transformer (Jay Alammar, 2018)](http://jalammar.github.io/illustrated-transformer/) - A very popular and great blog post intuitively explaining the Transformer architecture with many nice visualizations. The focus is on NLP.\n",
+ "* [Attention? Attention! (Lilian Weng, 2018)](https://lilianweng.github.io/lil-log/2018/06/24/attention-attention.html) - A nice blog post summarizing attention mechanisms in many domains including vision.\n",
+ "* [Illustrated: Self-Attention (Raimi Karim, 2019)](https://towardsdatascience.com/illustrated-self-attention-2d627e33b20a) - A nice visualization of the steps of self-attention. Recommended going through if the explanation below is too abstract for you.\n",
+ "* [The Transformer family (Lilian Weng, 2020)](https://lilianweng.github.io/lil-log/2020/04/07/the-transformer-family.html) - A very detailed blog post reviewing more variants of Transformers besides the original one."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "### Research trend\n",
+ "\n",
+ "Attention is all you need; Attentional Neural Network Models | Łukasz Kaiser | Masterclass:\n",
+ "\n",
+ "\n",
+ " VIDEO \n",
+ "
\n",
+ "\n",
+ "The Narrated Transformer Language Model:\n",
+ "\n",
+ "VIDEO \n",
+ "
"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "## Acknowledgments\n",
+ "\n",
+ "Thanks to [Phillip Lippe](https://github.com/phlippe) for creating the open-source course [UvA DL Notebooks](https://github.com/phlippe/uvadlc_notebooks). It inspires the majority of the content in this chapter.\n"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "Python 3 (ipykernel)",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.10.4"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
diff --git a/open-machine-learning-jupyter-book/llm/image/attention_example.svg b/open-machine-learning-jupyter-book/llm/image/attention_example.svg
new file mode 100644
index 000000000..45fd2897a
--- /dev/null
+++ b/open-machine-learning-jupyter-book/llm/image/attention_example.svg
@@ -0,0 +1,9628 @@
+
+
+
+
+
+ image/svg+xml
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/open-machine-learning-jupyter-book/llm/image/cifar100_example_anomaly.png b/open-machine-learning-jupyter-book/llm/image/cifar100_example_anomaly.png
new file mode 100644
index 0000000000000000000000000000000000000000..7e06e5a5d24157731d1976feca48d9bcb492e50c
GIT binary patch
literal 155392
zcmcF~MNnPg((QqRySqd1;O_3ho!}na-QC?C4(<-Y-66O;!QGudckoZ{o4l%5uXcA2
zy1v?b@~!Ib)oVv6%1a``;lTj_07PjiF=YS%_|E~&Fi`)ZTHJI#0D$J|sjBI$Z0JT}
z?`UUcX>Cg4>|t+8V(M;b1^~FPRcBZ_l5;19e2vh&!mJP=5kOb2Zsf;pZ^N@RvQ+*u
z`#q)2B8`C?4cN%*`SzUc_c`P9*{3m9%s
z_i^!JzV`GZqSSfxsBUNfjuR=jRdDNiAJ5_Q^L@U%7WBwJ`xHV3oO=1t_eCOGiDrHj~4^toB62lHj5=xAnf=HV(TZ}=(vy;R#b&d>e2
zl~L^NZLjCs=R?1sMzXJ;;iCOlL(bJqSI4H8KnH%Y^6aa)s=KeMOw{3dUH^gC5D~g~w*tMx2&x_5)ix%!Ed2?^k(~_P->3Fe{)eW9SJit$
zyW5Ss=@`8l=iF=$o{yih-nHx7XVoihx3L77WK>a|fBimuKWnkSlCF-niqVF!>mRBy
z&tyaxCKQkEvx+RFI=8!5b#{oiw;#jw{foyK2ufHuIk@`YvyHUhfjTc2PKR9kSLU>P
z`)^+0GK0-ALrmb`Lh%}h@PbMYn3YE)f}rRW+CkFLg<5O$EX*YA{JA)|HB(-E-6NE;
z>?f9O5fV7_2lpjM!i_S_*Zai*1%|7U8EQ^eaW44m%gWVDr)m_Od|nQBrCxVb>w4&rto-hft4p!2DD#;1ULrehG5A`pJsbvc^XmE*Gs9`d^ao~Q
zxl&G+Q^{)ktx9KFu2l)LhNx?DJTs_TYt8rm(Q&z(DzkKd;z-&iRbctJal_+}RI0OQ
zN7Qx;i>%aHwz9b?m`zShn}D^6Ex=)^I0Bn`>NjLw^7y3cU;9s}Z
z@4h(A7w4`5AvDJwl^K7&R?NgR#ZGx3@
zlqUFUnZ{(7^?54X-}XS&4ZR>kEr+U9#wB*K*zF1=^=>AJkRKBnp29WbhS;AC%R
z%A_Y6=rUeR{00^Ib@C6D#2jz0?a24L1^FW`i3!)SR-V!lK9~vaf}nSnlIfv-_%|QP
z%*>1Sl12ASLre?tHq&X9-g#iv2>gkV;k?2(waGM)b!A}$w=2DAIh}h
zsHhED9RI4evHN`q&lRAen7An0d+1~oZ*l)M`neiKWzTm^$F7m;G672he$hPFaT4
ze!o}i&0s=dwO+hn7X5{m*qUejUfr^LvXX{}svTzB6i4j`L0;tw27B?)M~`nj{}>T-
zq)f-k9BLy2Pzc2h-|>nXEjX5+bVawaF!Sh*b1;-h`E_N
z4mXFwX1B-HTsmA74iR=GyiydA0&J+uU4QXj8AGaJ^_r>Mb2JP^;(Vy%MCBS_yJ<50
z@oOEAKpUJxb~iE`_n**m?4z3I4RfkkM>gT$a5S+>>mD$%7||jn5)1S>$w~7_zI^
zid_a=*rty5F5{K%>i@}1K&=ST);1u=)bfwyjdoIAF*US)^vDO`vK$(v-N%|w@-Of%
zIOiW-eFH;xh2}9B)hiJm><8zuHI7IaZ;URKALIMG{L77IAcmqJoV{^Rc+0&6aV>ox
zfPQ1rh6)*By-H%FNFTmGD@W&7&Nwk?z-kS0M#@tHc!)Y!)u10{q&+5K^ouC_28!XT1-mR^-CVAP;2
z35;+{m`)5iJEP@6io%6K@zsDibDKMR!8xCbpfilF7l-EbM3fN8JVV&9zGc4Uvylw-
zgoe@on#BC=b!Z2rtr>k}c0;cCkcz!DLR!ac@KaufRjFWNEq~PqxD177^CuPx+(dkj
z<>Qj`P}
z$+{h%^q^tSpp2jPNuBcaC_nsizzT67S$}eg?`Gnn4@vP+DL9o7ex<@<;&0L1j#e3c
z=WdXfsgY~~q+!!@{xm0BRR6o71Deyi;AAB6+`X?d22YH(=wZy(j@13Pv&
zg%sCMN1Jw3G(C)iO$-~XMN;3Enx`5qHAJ2^J&u3pQX;thX>AgV^hQ({)3Fjwg&iiQ
zgs#l91QVFi
zJO3kNdO^)5)l+FTw0C2+Xv~IG&4Y{?OlvI`HrRjs9c9rBXhFpG>
zK$amf6P8hZCF*W2%qt^m3;eell5SHfJhgQ+%;SJ#|3m>~dR+i8%*Xi%5wK6_E^4j@
zGrL>mzu!A4M$L$iX=^V7>ivx0p*4V^J6e_{y@=kvjK?VFjRWhwasLCDC&FQNPraY4
z8i-}GRKK$4jH{~d8$HMwvbZ`EA>aoWL0QwWy+>k-zvB@mWHF^4Q;%Mg@+E{1@$oq@
zwIK7&y9Q43wigm5!jk4r2|uBbOBJ!1Ft1HuC|fVXN}F>|#t_!uAgq>X?s6*|4ny+X
zcKIkuGKf>k*>l6(ho5DkR>GnuaYS}ds5=MnuEPfeTg_G?b?pI74&p8)a7PQ=i=YRE
zn=)5D`A)+@aZ~Ip@DJv7I9SnG!V2F*ryI<44=1MZBSUG2dclbc!QCYBk-(~RFPL|jZ&BK_SwN@ut5wA!N_q-$SNg#mzZn$>{DV$
zjqcl|kkT4d-^QF9W=hB3cxl`*Pn>M>?<*K>JO0rt={!2|40Zk$T}anLNal;^9go3_
z(Je(&dytSZ>-%1h09Q&)C{VIr_tG5MX++{+a!NPNhlP5_&t
zKMRF&?OGf6z%@0PtBh4uY+c}?%IBwB!V40
z9*AokNLK{OUP*-Y%y5Cv-Z9H?55bQLvu6NTeZ%xE#-`dMnxY2O<&IQ`Eum~Np&xkf
zwP>*-bs>6Y^eH!N&MhR3DLYd!ZyhcbxaqDVcc{v~3QU<{JPAJi7;;$o(6Ck;FlFB8?
zq#MtDB`M1ghIGA)Cs}1ihi3-8lBtCEv{E4=jCRLR3Ktz|fA5v+P0pEPC5fISeH@V
zQB;V{c>aXZ(ugY&9tJkgMu=q_20}*-u%8G_uhh{>G~Yy%%j1xv(CiQ25;YvoR>yra
z>?J^}93Nf4CAS7+=EG$Qv_H@5g5O@c`fg~cA=9QDJgcijKH_}iQr
z1Qf4jOsD9&!>Iid=~07zgsp$(Nx5-|fv#9^)iU+|Gkzd*
zz;|a3K+J%-A++dPov|D$I8K48vRInVPkUUI7sk;oS7;C
zC8O7(L2^pTNk;7l_*0C#eR-Q1${T{v`*XFno>`YVSd$%>O9gzqs6{aPl^}{cDyrvI
z!g@_<%)%>$F|7>VN>gms;nRHg_ZB3zvmH~DVsJ1js7hvfjqz?&6WLXJ2F{^Ur6&h)
zGk+SRw4A>xhd{@K#m70aHtE2yIf5cPd&^+p-`u~mTirp{vBQLHeBW&-&rv29=D|Dk
zHn7kk7|6c|eBC(Idi!mC+x9h&1ZsO@Z4AkbZC7^Z&Fh^{4~g6fal(wby;r)ObxCol
z88q>DAg>0>!b+EPjNEbPw#Hj{A4hf9h*d2gk+0ZB>yhfHc(DL}mf*)oX{5-aT~C-2
znhZ$pgC0b(>~T7J=>r|;XAISL6lbUZR>9dWUY8^ert*0%R6wvO
ziRVkTxEV>MiT8*>VIDr?b6^WOM)5S)W@dfHvNoFSLYN!q=M?Qoc&`6E)___-No$5f
zGDs?h33wuptvlhqS#lqAfvt8i@o^6xJrNM7InV|6?8y6MZ|C`eT_fRKVHa~$0svqk
zmZGAH(xRgO%Rm7D)CoQb{89r#1Vj2N#o}~?=Olaa1(D&9N~n_6BB->%+68LRNwR{$
zl!($%QAIo2_SeSsr#pL9l|hvxFuwj=+&nbDr6Rs$;l3O3ab1pdc*S+N`71IqPo3fb
z*}9BM&>WbPbWwq|#}X)p(FU4ksmyfJ((bXfUeC{-*!x;rN1p`zzow1+6)18IL$yft
zsB+h;aI&8w$g8L;2ag7eD0SnX6V4(h?6TpJ;Q3gMN>rQ%QH|>V49nm*7M7zbph>Fz
zDD@`YZy5K8YE-XiIOKnU7v=Wxh&W?|j_(D>A{7n}q~%Z)fn?TpQKko32wv`9xxJ&c
zV>sIQCm0i%TEr6yJ`=xRE{!WhcSsj-{dqB3Ck_%az_wVhetTPK!Kd>7!fl(=`E(>q
z|D@s9Lv$Y&2vTq08^W!$q@FwP!#Lob)%7hkZj;_{R)F8OchD0lxn8L0>4KA2T9^hMoG!?N978mkbt4Fx!qT@sijCuv}ItKUD^_vsB5S_
zb-6+1bFP@EnI`tZAiMFp#u5T^_qqRX>@zXwG|KON#Bbr~#69-tzYJ$Y@MB(Z_U7u@
zL;L@Ho3Axrz=9Ls$bT8S6#svxoc=!?9Ql9x&U**Wi=>~h+D*^9!WZjSw=X0B#veR{
znnY3RO*-(G-=^Nl?nn2XQv~%VtQa6%=yOd!U?49p4-p{q;Dkx?=B@SsR4St#{XDsr
zGY~`)`RIH`Z1N!hBdHm&iF;lXgo6`m>IqdAB*yD`d;Wf^kg&$cx<12%p6;Iwb2JgX
zJQ0{S5uG*>IByA+3`Q|9lUa8aIHSLP{r3#NyR&QmE{zig;*7^w&z!y#c7jJVLNT&1
z+<-BXEYb%3yyLmiXdBXLJl&~3lWKBRFX>07>xJ5f4@e*?CurC6!R}4}tMv=hbTr@)#s97YwAc1{mHL7(;3%1lG`Y*LliRD4lzW;(4Y
zOlU#RioZ2XTIEYmEi40gag-h|oZTz;nO&EwPG5>ydKdZ7wC60af9p6rF#*X~i{06o
z?I4iz`f>QQ^Ke1v_j#9h!}xW}_?+hVg8lxI_Z3~M-?Qe1wB2=R7mh22ke_>YJ`yjS
z911D2o`G8S`f{^;;~T!6xAtREUg`_B$L?G@%&$kE#rZjh=kxRGi}fA(Yv1on_wLs9
zE%NK*>scSUQtwMWPF?Kwi+QoRrkfIFo%O9>sJm`~9o0qPrN3y<{|gDAsLO=pru(yJ
zxM$&-E!WDY3F_4fPkp!Bw!!>*ZM*!@MZNih_pN#ez`}aSXE?sPl$y-^H`=nH+3Z$Q
zH*Y_;J}N(av!{dhQQ027PaUpB*`5wEt5$r7xC17VR_uZY_Vvsc5?Lb}$^4G1nJs(d
z_fgs7$B6#AXMyW>*~&K1^~)lU(?K>`mmJ0U!N<*|67Cd&Wa3lLd|@7ivdhM2WgH-Wg`
zZM+Wm*k!N||5z<#cotHP3o90#43q6Y2{HZ1-C__fN%^Z$jtYgv9~Oj5vJD&~u5-!d
z4xk}bYG5B-zIgltzDO8J*o$rdGz>cV)rDP01gIbeI!N$mK{GAooz1X
zA^f;Z%PV&NCR4x6K@~Nn_1$&rtQsX3j3lK;GSTn3#^WWg<}3Q^?(FNZBV@`YG*G$^
zLnHTfyKN+4L`4~*0e;+erh%?0oZl5JvdO3ElWHvwe(=gCPDQn#P+$s!M8pV+*<+(O
zJvG`ghq;>da6Z^FqnA4XEdVX(Lu+t*AWs|+{4wV%lmH`zKmnfaJ-%cxx9qJJH?(Mc
z!_hN(iS75}+C%u<2?pgm%6IC|D(dd;vf6z(%Hd@512#h$bjQr^ww7Myn{F=pMr0_B
z7fKJWiW_5Z3@gjZq1B%p*bzABKhl*Y4x=Nem`05xXgI=TZ>-2sdbA7YVxO~#H7Y-o04%T;3=)sJ^E7ND7S0^f89o8*4YJKd>}5@>R*38AnFVH1(XR*1d_3;
zVgq=Eg1d0SD`wUyca`d{BEOp^(}>Vs^khl0l7v@<&I6D$d{BW+u8Y
zI`wNWcv6)WAB|>^azV*ZpnsmLEc2LMid0DOlgY^|?op5IVy#JB7vjy;1;xdMqs~zkojqg_B~~aBNvO%(s~cWQ3DJnx+HPh|bfbH^A5F6g
zi>W0gzsnRn!e1?3H9mh7!h}o?Vp1+0jkk_CK$WcT5*ZEtT?{@L4$hSc4hCEV0nTUZ
z^&v{=lxzLems2bz?%I4b|HKfe_;5TDkX(%mm%uXl@S2a3g6tn@0Ym5KWNQ?5d>Hm!
z5rg{T5g{NRI+42Nundd_9vqeytE`?naZO8ACU^DDL#yu#0HT+oum;!wwgK%TCs2J*
z!ostJ9p@_`cY~BQR;555Me?j`UpoBrYJ+j*O2|KBHT97fewcFQ)xucF&kxmOurNZd
zMCaZZ)UMDft*)1mw$z^Dv-qxE!fNr{Umrmth`Jk(%Xb9jA2x4|qIGJCkFn>7B)j4A2ibkJq4BeT%T^j4f
zX~m5TqEF`L*TdEK3wCxCQ;R}MPM8STOq}sOK@D&VTBwRqea>UAXNqQnEwkp}hxyCg
z9v^P|z=##aOq&da5z&$Y}0m)$)+)sG%#+2cBF!B3ml40xCKZ?Ca;lAag^@55*!Za1*AGxR8u
zOYd>%S@*}AJs!G#Z<}A~Ax*VzYc#JoDj%`%U;W}^(B0P+hScmP#Pj16Mo{?vwo=jP36*VOyvluV7Fa1&
zz(GC@D|u;f_IpPA7}KvMMb+PIz}zkvL%H{9jE2NuX51|-b_G7RawBuc?@?!d(-{e@
z#+hg(lj%pn3bk~;FV6jpdFt=Cp8bCt0OPsU4G0St>b+wPkmB%LcAV$M{^%OfP&7uP
zldR-peQE9-4axBzZqhO-IheDQQzxAA@qN!f#6ZFgx0;2I{Kg$N?r*n4tlB&>Z(`Xo
zj+S{VXF2G~Yd_{%&68f*H)N_Fn);C#4rNVBoCM)g#OB~AL`%Kj`L4Ma*9
z)Q4z6^Fav@-4Se9J03%+@E>V>D>CCU(YAn)jR|oqha6fyo>R7^H%87Sw#i=zG&zwo
zj8`c0CDB@_a}BuEr0qwc3kgvyP)$MZt$|aAjPjSU+=jK*QluTyYW01mT&G*fvDy4N
z?grn`owdELv_+L$Or%*({)K+NqR=N!X;^QYVA(t*cnSBm5
zw7FUrGb@Wct{r%%sMRoG=Y&&Xw0*2TI^6g5kf;k6M`$er+cFB4|Zk2{>`Atw`T
z=3$?Zkr2VG0P8fZY34P_?G>-1IDRSI0(n{7tG`aUpdpAyNkC}oT_9A!&?<*>)+_ZR
zE~+AzWYVd>!K7*;*khWj8vQ>gs_n_WF4doYewfE)WQ133w{fnBemX;nw=vUMy?*#0
zTp0xg)cAcT`)(=3M3N%($GVAIrPPmoR2jiOeA&VIu+p5GE}4C
z>r;PphnqI>pVZk&u&I_rL?pP*^yt_O>BCw%?N`Ule#{BN#evhM&P{6Np`q
zk;2%D
zT6^&CDL{1J6gM7ZUXmlsa841l7R18D*!-YIP^Ecf$h+}Jr$woqKVd7#adOQ!Y-l1Z
zv_8Xd%TEFUjf3%MW)V9ssnsg>drPL2pt$NMwpNybJk$tfMwHNDU~&OMC`_>7yd?E&
zOAIp{?GDgtaJQb!mOB#CuWRUcIBJ^(>=Lj5nI8g}UzG@TBGr_X8-j$eCm$PlkMgqq
zXb=8*WcjK9BViikL{2twxF%vc5lHE{l);)(O%cgvXOwSxvaA~$Y=9x&xpQdnr3%mq
zX-ktQBOH4;9>r)yz|%bRy+{1)uPROkN2;?V?N?M;A#Hh4=)aMBb&Mg4iNe)H5xP)P
zNF@trFaH!1o}~ySk1uS2Gz7gF+dA})(DaMN^A{@xi0253pi~@=V)RUBJwCHbJs34T
z58|NkIUqL#VkFqxJD~8rdlpa@{_0&D9I|LSwwr{!GyUb0j{UHu)Fny!MHH|w5~2M{
zNv>uE6i9A+SIl+-m6=L?L!m*_LA5d)PR1$q=(o&q5SZesW|f~XF|b^>U*|l33$BPV4QU7IF?QQv)gAZCD2um589}
zkOZ^@>KDVf(n<{~%8SNmZBmb%t<;8`N($|)@4I&Fou$oOJR$hsdj^#WhJNV+WvhwC
z!Bi{obSwW_!rQuEa+6%(E3AHxIx?VTsOdrq6WTi>NzY|7CHcLe*aQW2JX8Y%mf;E|
z>Zjr?J>cE=`**NqZhaN&()>Sc!T-b7_3H$ko?zGB!#q{CJl&GZ&`Hx?Q%(9&yXByj
z>kLk{wpFU1oiX)AH>zrh@CV<2`u$8zp&xub_;oGa|7eJfexTfV*cE2`5L8VyJPH~LC*q`?Co
z>B*ZayCZ6x(ys;`BnkAXX!_r=G%*JB)D&zK(xX;Mr=F9uI|YF>^6{_ysyr}HnNQj(
zY)t$Q(>KNiaC7PFuYD$Cu?5P?yloNR5L~)g42t<})Y0GLp;X#31YB6@Vaxa%4rrLv
z3ZI1@i63d*zuV=Ais2w?E}-$|4kieBt15=KnQ=5v&%RSN^oJe%?fF4cJjJ)0Ki_6v
zxE6SG6Em!oxEJR&UNa5PBh`T{f_^KlP!y!$^rCEO%Na2Sa-=cTMVl@i%f6nbn?AxV
zK3}ngU5=^WglWaM@1ohN9b#zHEE(Ofub}9kt$ze0gz`Me->5_P!-ym<3o+9uH;8na
z%KxiwdJOOkX_$EeIoED}zjp-RPawZ`6&Rm81aodq=5Z^6$skZgl=_!j@om>bTc7Cu
zae&5pl-9*fdoC`bKq%60S=SLUL-C*$^6-izO^E)~P16%Jfk>@e(O$^~!Udz9+kxnX
z7Dx%WOHcMKgZNJbMyzi}iZXg!c_VTgp*1>TEJHz7QUSse;8mfOKu3-`?b(-oYgL!9X_s&1`dB6TeUo!1$
zpifW3J!>!+$JGX>BMnSub##3i821%p!W7UK6_USH6*oX`z!77~tmTV;d;p(|gS>LH
zf*@jbkHc^yo*?rR&;mUpTIw681kOUtD^~eIDD&52^r}7o(K|C+1_hQStXm{tQUoBtJM)>WtLRpkfTif{8?d>mxcfJ(6L_h#>mahW-FKx!e;b`#Rb=_-6aaUJ!>k;m
zwlG^)l4iV%@=u8`nOm0D*s4lz*0%bk*7U>e8Br@SyLj#b0$3rr9pCC*_-!
z>Jzk31td6_+-`y*wSQS0rT~8ozghR$!BsvjGbz<@Hm!GB&VV84&_csVe+^pz@@hvWI9F<-nM)pC0K^z7;Pbnoi-
zbnZH5)?W;nI_Q=_*R8Igky43FM=N7+J5*-i^(EWH{lt-iORN}j@qmJj#Uda1@n{~n
z^x0Bf#tM@r`n#2NM(w|$U)iD-V$~5TIHIG
z3WTa+tAhbu1V0#uwP~WyJ>R~WNle4uu^94KbLq?5I(-Hzt0Vi>
zS7#SFRi8{C#U5eps4*6(43otxd9lE^gCJ!&BQL=iR6xQ;iELc58uS+}i5+L0e`9IY
zuaaf@Q@5p8%4X>2PNa61TELom!nl*au%`
zl#W)>ksRY<_2pJ*`5ZwOJ1cu@<`OM!c-O|!rH=xNCHmx!dl;MAQde*B;(*djQ^Cs)
ztOd-43^L5JwsmagARV*cD96iIrD}KjX?Zzo<9^$zs~ZPEn5sQLhYsw#N@$t^K4i)z&!rGV$E
z;u-=8xDl+hH%pT&(u$$B@Tbhw+EtZ?*hHJtqeqHTk8J`5@q)2K=0!DSs~6C&>`T-E
znO1hq?MnfQ1*m`}t0Ifo;1P7JZ>e}kS;LjH@U2oEfIB;6P+?XiRC(eYN+-KFH^%so
zPhK}HXfKa01=e)C?t*t~
zyVl*SXYb-8^yv)q>MCnd4)YdT5iWD9^MOgxotYkwNqUkp&G98>tORDYPjwm!q^G|O
zm9M9-;r;47d}i9mLd67Ty6)iBAri6KkO(G2sXrO*lHtI^=*L(r_X3HYJDmzi9
zSsQuY1;JrmHCp~$FRjDGCiNbUquhwDHXn@?yU31(3ByD^ddP&8<{P)wa}Le(J{!0U
z#I#k0OS^MmQQYYzF->|?gI{duSPgzkVexWGOB_US>f$cEXIT16_(i}G3R5VtL?%=|
z&@X;ZZCOx2HkO8P8THDiz?Gw(J)@3pim*VZr>F1G-nrx5ZJ;}o?r3{A69^#U6GaHS
zwXpENawx&8)KS}a6)x~jAlm4_AOQlDu%F9H!Hnyw%vRsQI_D0A9*S;4wnirVl8XcD^
z#KNdcZ4m+|H#1(lQ>mOZ`|qM$Ju
zCyzgW$ezx(R->eh9!(1<{itZH&GWv>^Cv-7BSi3SYIyI~Qw1_&-Qr-AxO~SB;SsM+
zoDtA@LG~AZc>*gNaHjNL={Y%87Y&c0plc384p^vnQ-x5m>Eq!5aOjR|ssy_0TeL}g
zb%(BMrg>!AYnx@KL8i?|HUi7;%iMx8BXGXK3K{qwm1OFUWy)O2zVDr@@ox
zC49ng@lKG+kTf1QoQu!>bI@YoW;Tu)E$K@{Vv`2n!r{_&;RX${Zk`ys*A58UfRwYU
z6dVRM8L0BsLjV2*I-r-!r2-`@buu4x|0B8r8?II+Vtt^UThF@?qAq-Hj(rB^ite2!
zk!DE^N;*uTw(UXuZKWZAq6V8gK0{sX&YukKANP^^U-xtZAGc9p<=Ctln#(H2ED~;<
zwhh6*x#ole;s1mwlk;4P6f6(eZzd04p*7ZHXdSKyo&s0U{n0n8O(Sc*y}?|+cfJf7
zD~vPay11DH6@ZB-#2R-%AV!qG8BLfOU&ScVg^yeBSUS4WvIgbwQ*s!Gi4AL;$+sXC
zA2d_XMXS;W>>3OOEavmseJIX4fl6T_
zkD|g}5K2F=s&XzqmEN2lcY3CNNK^_MRrtCUVD-z(z#E-rN;XFzX2%#I@q_*lieB~u
zfuwkZCs0Qtdrmq60OgIP>r{-tskLsHlC^%#gPL}Vn`nCro9~_;LdV2^4I;g3OOKC_
zMpR?+qwDYnE5XEO3)JC|`U^Mp^oyHj>A~b)#7os1MGgI&Fy&k_C`4{vYctRBe%bsY
zlu<=dN~??*ZJQcRe;3zb+4m6*B4}fjd>5so3~`KBuk>i$o*iWpGbTlTWxsbgC#m3|
zh!vt89fd`4MDE8(A!bYG@@<_*OtIJgc-9aAq#
z7b0AGkeLwK*iYj1_`6+w0LuTt4iXegLMtO**mUK;h8#*6mJ}sw@W34J>C9IV)Noyc
zjT|ffB}&JHFo}WQem?YXYAc02a_RHPaL_v%mie)&6^#J0W_x+N2z`3O->SWG@4%yT
z>A>vNhg{a_yU@1pP?K400GXy8Rw--Yl93z*o9fj~q91xVL}23kj|=6(sBnb$Xr6
z^Qr-7X(9Q*WLFcn_W8QHMJOkc7999SY+)ks)Pto{nSw?x=$Cb508KSR#8bhQfgv8d
zF`|Ly5{tQ0U1KImCE_@@@ittG&4BBAooQuEQB3ASnIbNh3&Ck(Ea5LHOyU>xQ_=m^0ER~Pv_ncZu$HP0rmCefpJ!k
zhP6}ceC$|)qzisUR|^SDMo_jGw9a~c-_*X-iOQNV2t8uRoykCK2)hTG4dz8TE4L
zhcsCT_L=8%i#rr`0>$o*eZq?_>kjK2Kke{qI(J=-H%#1QW%_*4kGorG<0vP>9}G9%
zHfbuQG1RI*L~gayL8If;lSOntA(SgbcU1Yr?1UnTAlffCN{Snl=T7S=7L3Z*7dQ#y
zS-tp;WxjKDR1(3mEAyn!@ljV39F`DiuH+fZAxepdPU8Ox#ZpA9;uIP~p_N+0kC-F5
zGi1EIl0c{quz=aZKN~H$0JD0o`L5QYSkQV
z8;lk5kk4uoK4)Y1F8ve#gNHR_jv|O%saVNZZHysux`RBv0x3{z4ySJkMZ@bmwugEsd_L3~=cYv`4me;qi)D9E`Quu)dzgJ!MM+Zwmoec~K90jaL$DV|rW#u}
zYZDiDCdW?ANX}L}X$25j$2jrZ05@;CjBtpM!E@#*nxN%|vY>1|CWJBe(C87rq+3;hCSnMVXX)uWf%
z6>*l%S(f~Pd&allL?6^-mqV3(2s@F;JcT=-$vlH=X|!PrmAh4Q6_exXw|(>2vU~A+
z|H9{A4MVH9*{;6|^r4GMh;%(t`L{Wss+tIN(XD5IKm50rGt7ihfhx(cZ=Y=YY1Y|Y
z^)F`2qaU;s{j^)D#cw%7+JrArWAYeGb%aV_wwqwdXtHY*D!?!<-14yc97stxofF`~
zPpL*mZ?rNm|K0=wCk?O&Y%XfpscJ7-xLhviRZ-)&E7lviBNvj%1TIP%cmWpI2wTH(
zX{ZipjQ`28&Z6f%`x_Y2k8sy*MlL+>&JUL;qRpIUkZ-B(9qOkCIu~bW4};D(+(zUI
z?D-cK>#{;X*>Y>G3~fKB-F}+{#`ib%fXq7Yv;bK_@zD+s4
zN1FatG-6~sMH}_X{g(m@g&VMQKB;`9j6ZLGPm4lMyg)v$Dgl+By)*o(-1By@Bx+F9MK)uOmOCG~4Q(RN5haQjJ=KS-Or;hDLRVvN6J$iQT
zNBAr?qXzb*2GC+r4|(g5o^+Q#OSa0l8VV*6Di{hPz6)UrtrGlGl>5dEQQnm77x4>J
zpyzemu$v1Yvr&K++8`-f82(ZCza}%5dsKa;`SuCD`~ups=?Jdwf`)wcBb}*#LK8Kr
zAcvez%Y9U-2>J|4)h|DCG_->ATZT%bwP7UyK%gg{A{h^93-Bqn1>isFc7C5w>$Z7L
zHO+4OY1l%LZ1ioundGIls*TldiAW0UT!Bj<(&N5%Kq8h~LDO=xdobPKDNzQS<{uV2l2zBHrR%X>0wvBSkeNDWdvFoYhux{(#
zvvvL0y?eHfsK0&w;<=f0i8JVUo(1{aut)S#5Dk+EM5GKr-R)M0Yn0c>feqV^IL2(LG^
z#cWY$wVHUod!PSD`h2h)Ec!PJ(=)0jcxO6lDe7O%s)wMH03?h=g7i`(P}TPU!bYZ`
zRX`^Sr%JAGl3S(hS;~QO=^8QZ+VGpF$sxBM^61#KT}lJRjI~$H!R49c%?zm+D{#J^
z#>MJ!nwkfpX#eL>L4ku1&-C0$Q0VNU%bEE;pk`BBl)l#1S;OjEvU43iWw@*(=*Le>
z9zi!p^gs@d3Tzd1Np`pglR-17g*Z`Yl11dI+i+B=;-cWA0>AQJV;uNc{9B@m~~WkHdgZ!#m3m3
znSzoBy(Wn&TI=CL4>}GaVeKtQMASPGbT+KI4y$&1xhjcd^gc}N4Y5IV@Lv15SxOJ~
zPra8ndHqklG+rm^^5Mon1&N=@{1j63fipMtq{86Phr1qmj~?IN3(#9k)b+)iUXP5_
zpwI3lFqAxb;IBFPo3dY0b|BQDD;+qWc(CM+s8f@F+FYq~vbLnDwM7F1I996l*V-Ec
zH(_6^wYRmgdPyFoeRP+O4vhEE`XP8Ngt)-!_INaW@z|A=0nK2kujzp^<6j9#X6t;=)OZ-!lIs~tQL&~!jva)V&o}fo_DK9IqkSyH{7nBi7LfvZ=^&E
zc3U`e&RCZf>tezEvP@WJHc8*am}Uc1>nUShCoF5mG&9oVQWF?Mv+pKJOWA6DYQGXg
z=@j}N^vpwVKwHeesMbJKIbyY;4Gf1ufW6j2jTx?5+$iDJsgi)f-Hpb+LqAd@b+^DO
zqKJE3bbS}+WOZoEvS2-(a5|lFIvaqhua|j2S$d6B*$ZyB4PU>0#n&&N@%8JBX`XPq
zy`$`#)rsso%Dz=&+Jn?oA23&s3JCPWf1%jOP9IBJHCQjF%CNf4`LeEfdAWF_U3;8P
zE0$%!G|gtX!L35Pv3;fNDEmG*A=W^;X8XoWx_)PW8y#w&@E|-SI)WdvI<01btb_T+
zhj?x{y$||)eV9J{9big6X44E0bKI5Cl4O1%y|?IKIo)bwG&Z`X@l8U*&1T?Hn>*Lq
zCIA}&U_lPTC5095?rOH-MUB;|4uq|ZM77RkMnbmiHz&(MassD}XbXBSI4;G+n&qExOUzSiVENShdwwo
zk}Tb8!1pIT*kCDwgi}ALKl+{|s0PkgTw3S$-rDOHTQIK&6MGNBC?--VB*kj2iXSuk
z&sV7*QJ3zTDx-p?(bBl}bkZR0yIVi0fuI^(4Q3J_rL9!Q`bmm)rP}MKq7^zUTi-3f
zV~?7+5c}~{MW=^Mdj*mVQ0i{3#Bx^_(+cMUz^M|%psX8uHKIDtB{JtZre^#B);`}t
z?sSyG=g+STZj^u_L%qb|h<%f3iI{-CE?Z&I#EZm#?OrT*;vPW-l
ze{{n2g;sGk$ae5zSXHx16)D8Cze$58KWa44vw!68W>TNWP|y9b51$6qHMd-Dv1!Wgq_ZNcPFL*Npx
zIZs$->z$LmmgjaWD7Ota7Hr#sZ7;zxSCKM-Gg)t*Gyx3k`;PN%$L3CW+ve_XyXkYw
zCe2+&wKir{3x4Mr^PDj+R-Z6ABb8^y^DeO;#&(zFcs*ycnQ58zZ%zPgs{V=+7<4+qMm#KYzyOe_gSD{)~B^
z@cuU8dVR<3dd2k}P}!j+aYZm&wd2|uNj@@Nga{`%x=`fy+>He
z$n%V8DLC!s==Zk1UO>1?du!9zQ5zE5~zq<
zzSnHGn=Pc|K+OujyFI!yiPWvHp&xjhaIDSX@Fe%1A|E6lz6(f^^lPSKPF}vJwal^}
zg8${Au%^5M4}@h
z9X$D~0Y(4L;!K=0R;nH!1pwC)%g3h4$JQQ^y>tt9u9oGFzbVx3Hv4RE$AQ`q+k0O1
zenj0#6`&5F5-f>OO?x|O*PB99?~51{-$%QXsOr!f%tKJXNMbTd0V8?4Uv4($-h*tGprF@|VO3FQ;8uuuS%p7$PWf)2t*dP1KMLlHNY%~iko9egWTWl5EpAG&H&Mtu(+6!}4w+#(GEonHt{K|I
z^nCc8xNz1W&;nC#S;|_7pOd+_&C`r|UaZl2xmiZCY(UxEBtm;JH#6pw0jQiZ<~3uQ
zQ)6>P%+0D4lZZ$KQH^@`B4+PZ#7vl{j5H@qQ?j7H0Vp;wH4*Y`(mD$WJUf`2DUcnz
z1F30V{Btb}SnVP2pWTduWs2
z%!wPA`Vna#-PINyGmZ$_d~a}^67oD_Syr4*CtNNUoX#f)Q|HFskT<=n8sD<-Sl0#f
zJYkv>I03mzkQ`WS8wLCSkumx+1!tuNMPR=L^m!bLgAq1)L@%2U1;fC#`wszV9RZR~wnO!rpPSiC~KEYACx4QYY6!
zrTbkI???4#-3qvwLO!w{rbEzqXy%B>aECzaj{sB}vxYIz;~PWG9%^qv70{lILKT
z(Hdz|D^9P{i_S3PDCBRwku3PJAc53sfD(5%C1nB&2d6t@r3O3TA)t2|o~OK%&w<5j
z6-p^smc^1&^E*~Iv9RwhDBFg25XiI1Wq0SlvV)jVX6t39MWA!G>~2mTr#BZvbB9OhtAntI@fu|Ixm=~1?w^+dE0}?O1;0o<6p_L?YiwK*UkFRh}D)Bw^}Dv
zBxF#kJ%tE|&cpUBy>*MlL*xN_$d-!qIuv)KsuU%W-ibVa2uN|y{8Q}xq~1iALy!#}
zoO!!jFNXkbG*qdE1*#TYiFbxl^m^dhjLC?b-c%EnfvSHFCL&$E(V1C|yb$RSjP>PU
zRJgRnLbYwCLtB)V{dd4q!IU#&M4{H!Wrh@8YZ{QOa^;ZFt?$jDkt$oe3s>)o2?%L`?fA9Ex(!La?CpxS8Gf<*~U
zvOWlWj~U%ZEYx?@G7O7AT*NSX_A!NKl_@pL@p+!H%nL}(we5B**xuia15Rb&zLMB*1TxwV<0**
zGgT5qi9soWb24|OFrWVL5eo5B%3dS3;(WGdw5QV73;cM4OO06)~1w#WnC~$
z8Pk+P(_~x3x7!t0w!UF^VCvfyP#*?F)a$6-&bkNJ8rckKJ^PPtw)f2H*PJwFCGf{l5LYQAwUC}#Qs
z6LIGR_&@>l2oyam>cw|BhF
zE8dqkye*5l00P)=8*cAcYv)-Ma$?MuN#fl|2sG
zd{-P;{j0fhu8TJaqo^6Nh|7Q3
zqpT#|z@oZ3v}V37=n>p?DUNa=?lmX1SW2T=!#7P*erqlCT8OITAgfg)l9uJQ8YZ-Q
z&|r2w0@q6GRLx9KH%Kk{As2VI#;VnYu1m11#0*-Xx;vN}<-(7peFz9ayDX0mu7aQg
zGu0cPc9t&4`bgTdVzgn`)*|&zYY-7Y2P_9>s(luB;AYIOI56@SA-?7(N+6n`#1PsI
zfRYZicu^&c=9EpKOr4F2$$E_dsj`4xNiN-Xqq-fppL2&@4(1XG450*}?8z7nvNA#X
zoorNhxx}5sw6y)8y7U!~tBNN912grYUt6tlZncME_o|TAvd4ju>NU{^>vA2{PD8Il
z?fuxK*m@zif^sufinAKt1-a?ZXiCyu9)speJE&^4{vb$`advYTV~*I`T2R776DFE4
zrx_DZnDdOpj7@J~B_w5VVHDb14@ooPNul1>lw1`^vDYyUvOJ}pw5)7Yb*lFL`?sWh
z9L^&Jm6O#OcSfq4wp3#pZkp+;DAFC!qFqM)w*XSdX-Yo$y|f*rWQCjafWoszusGonovsuLCK61{5iKJz=(*!aOD9nUR-dOw^pZm8;hjl~TZ|
z_Q+85?(E&@p=Y;353jiKsUM3JfNC$6&Pvrz+D1LwO(~)%qFtgF^yz~O)mT30fm{{X
zT3tnx_x*@v>UaKVqqyCvuhvLebJ`2m(+P_MsAX9(ud6eYRtq&xGeo2pLRwc!uiF)<
z6ztm#x9hv#+@-Ox_892hZ$|sw!Hv9GwCm3kH=_VVt+(_BrS-E^)!8VwJHMQpr0;UE
z+NgD1F)s^NokDBvXCnY6PKMhDpllIj>y-O&*cTJ(#v>D%jJ-_)G-{p5PHU<8H
zRMPi-G11ZTI(%wvU}!y(9CR_8q_0UhLOlY7785e4QG9cum}Cybyf*Uy6@_qV+4L$A
zYPPU!n@{4(NbT+Z2X)KX%GAP}S#z>S;?#8awQm^#R)>_#@MNAcGBczU+^#q5w;SHS
zE-2gPh2kbPY%<|epoM_Kkllb$Cuz*w0N~!Yh&K%S-
zeD%VS8sH{payiJht)9i@RG7e+!J4hE#DPt5wmVAuDDJ6b+P5*+686ENV^C
z*Bhw|Gn8!}S^vNZ6Hl0U#>^9DUN9#EREZOkSbtAeOLB|sZf3!)F0wIF+`$w`iHaPF*AOJ~3K~%OxP#RlDNGA%k
zP;997Hvm$xlW~|hTEsRqQrl>?+E>#ztX#i8_IaG6{>Wxp9~O3xz-uhP?jWM~36YR}
zs5aRafP3RbNMn7j*DJ2qg!!6qeP_%IW17s%zk(=c@ocIuVB!)Av8Jvh0@4Du*3yQh
zgAqo~#!_W>?aNC~F-hw{LEy>2*0g5K=LuymSWXk>b;7hvnC1-f8iQKESK7i;>s+K|
z2dic1F4km|l3QImBLxO>@g~EGkvR6Sh>N1OVs`@_3>0hNNYyeO3|0K$g9b}zm+EnE
z!aIBasHxQmrPf!W7Wy(LaL!-=%X-53d}~y)F;X?jEP(8IzrK6(-hxuLwy?JzBe?fR
zQH+ja%vCySE{jvn)+9AqfT}gaR=c=Y&=We*pVtMK^NN@A2``s3USADF8GxGI7C02a
z`E_2TJ|qUDSqUB|LcrHfw>-&Dvwwv{bN4&;4=D%7^4P+yCgzAyDzFk
zzdfaHntg@L(^Y*R-g4j*%O^RNx
zYM`ltDCX?X+!{5rJGSL1VV1eiYbH0cu;dSC!s)!W+NrmpUGAryI|j)jlJ#0
zgn`Y?a9K?%4T7l4`;vp%q9`|$a?`fEd^vfth=UY9)cp;;$vT
zWORRPn44i~Ol4`n&iRGfnA8m<5*xQrc?b$_V#Fp#CwH=|0eO{<+HrDk%2^%7T4V3N
z7ui0Md`}K03^*pD^kCp1=(gyBd*hx8o+20gC;0e
zCnLpd{<#OIEjh#DPJkK>(L$%{dI~Fu+Fg3tec8m%J6FFpa(0sN18SlXO!be}FyXbk
zV1~_86-^amvC45TLQq1*8ov>g>2JiP^bwJOuGSn9XmZ$7sLQME5&Cn9$I)IlE6!q(
zn&jD93owubn!LZAs{LJb_H#KuIMpE1l1MybrWsROG4qU>7po0TGp00Q=D8(mNvQSL
z6x!{|HjD9|pkjhEQtYz6-7Lq|8PnR}xF*o61E4!ks>(pDj2w$MJ4at1A*J_wtNWy?
zyYLk^m0)W&tPGN5fJ#i`sI|$Kd{l1nSCsZ?Fn_2;cxLy}n|cvfKN-67$A5@_?zKAy
zCSLEFY7TW1hZ?9mn_wR_rZT2w0z5-o*&h)reyq$w-_T$Tm54J^yJeZ`4eEMX5k)S7lI+1;Mzv9JDYHhN
z)QXO6k?`icqE;Xa{1BkJQ-lr)K^-^F)||FB;hiuq3+8pjG|$M>1Ww6y?Nu&hflx{r
z#DwLvLc{{VY9rsAG*tj8nBFpOx2wxS2^-zqGWxz2S%*!bOEpuX*8gd4wH+1^CeP@%
zj_sv%U=_gB>9pc}I^l9LyYtuA7rb1~IGs+I=NUPtUdIs|3m!n!Adr==3AcV)rPWH^
zJNU))#Yb85gMDYC-RZzgK4P!+upa0`GIKb`gRD;;Bwjw+T7M#gB?@}LfYDujXHUMH
zY?v*n!re-nf>eWi-6aXmaH^)UM>h+Q2Bf%&l8C7F@=*2`qYk2k6(Zbfe)eK!IXP{;
z#=_T*#zLjsdi7Km%4A9Yb0FktG9Z==ROKmKK$+l%IT@GB8J|CY!Iv*z@cFZ~oY?m*
zDhfTxWYRtZQ4Dsl!D{OXCvQ4Vj5+1j*nC|V2T=Z4jfF~7Szfkmr+6L1W&mp6Hk8{2
z6a^Dw&J)hdYRUrZF^p{&>|_$iQWT}^wrAIUBWIH^Co|5d>kw6UwQ7yth)46aVkWOO
zDQ)kfz*I+*1pE+?6702*esI|2CW&jwCzl$bvNi#b7>UhhnvlHTR;eZ~#?sq+MX60s
z-XNTZn7~xEpEYSnz|#+_^Ex%I#|t9KQX9xQrObJpw?R+Fy<4pZH>8Jg`O8KfD|1QoU|MhjF5r1iRT
z5XG_0i?LQD#qWNyuZ7;|y2`^^%~su$s^Lj&H0zD33{1HVw-SKV$|bZWXov@^|L+Z2
zdutS5J~@~`1owIkAgbvJ&NGy=+|qE
zl&g~b=TKEjr0(KeJdscxehLF++0(iy|eDPhAlJt`3^6R^cAA3X(BTykJf%W?rzQ
z)m=(wdrlJ))ph}dqzOqf3hjf?PT|RYVkAb3QEeAVAT0o85Q^21I@A_bx8rY`cLy{y
zN^od|sBO=yQfKM%VIi@5(jA&AP}2&cgaeRr0^1bKO=|tO15&YwzXnRwQyD^|ZG;Bq+Nr*F2Y-tIQ!X|aqpf;;Artldaft7Xhw@vhr3;QVPNvcvYB5F(w&|ga}Lm{v7(C(1QuuVs6eSz0L7$e2l%Wm#X
zdBav1RIJ~UdJ@E#qTJTSR+P+c9z)fHg^(zN7{(Au!89cc!V|SQeLkIhj%Q3cAv18j
z-P{$jH0d#FNxK?5Wk8C+Wa|g1g?5=4OUgJ+vrBl5iMpK6IIkz1PiHLi94v~xAi7i_
zRjgj?x?$gTs3;N$Q<`y_orNljZCA``$HqHKvCQf2EdH(vo)U5v@RX4!n`4~VuTc%=
zorS{KFRYeqOUR&ii|?Ng1a>LdQMyF7?niK=3r`HiM!j^j)hFsLZ9EAMHBioMcF#oK
z$hJ1eM6$o5ba1EZeXA_tWHQ(0w%xj~QoARrK}~B;TxE*t^0`{;RgGAPyhUk=Z#P
zffNy;ICvr@W22PX+N>TF-u0|ICh)50rPZ*=kPxX7DIj;_D8T%$xZKvF6l-=lJx6gHU65uAw?wzH`
zj^`&BsoHO-iS}9@SeK3}Xh=y(ONQHb-a{NCI<`Qm<_9#yj|a9%4=T=Dk`fdJAIV9yKP48;BxJ#^Ba-EFIv=@5fQUPuh3v-kU(7HpqjF2l75{u0-rEb(Vco2=)SCQ-BNRE93o*vCQOs
zo`6JxluXu_S&)3ON!6sHx|&D<8jzAfJDog3kaI$IAe9%wv{;U*)a+leAha@4^C@FK
z&me^Xsme&r6NJpz0e6g4koE-zse&kWo6eM6?v@yw8M$PXRIC<(ckEHt7dMlpIaZ+f
z=AB&D2)pTrIrfLj`)HK*rHoYjg)LTa&MCJ|wHEb*YVDRu`|>oIHFQo`mlMK(d&`C#?G#{$@u0uTOG$~vF51jikuU+ZF3_Lv%0rn
zQ@bs{2+k*$_Pw0(`f|qW>nkoV7pwnTOv*?UYItf5__5=<9<1zJQ^P9lD=r7q#z=jC
zFR8ztv~Mg7{uXG`KjveHM$t6bS0A&rEkET!gLtnjZJprJeMhP}Kyg-TD?9SOIU8?s*z62_%9dO(AmywSCkIi~CDB!`q^VU&
zN}0X3bi(<3!sTVf%a=1wrwQwlF=s|*#oO06Y`d9HY}vRk^7C9{Y6>VU_a
zr?2yj^Sa`4GA8zNz8H`?pRMk3FJ?#V^$PGlFXgsjzi!yx&0q*cnzS!@EnC?Yw_R{c
z8?J1}r@Nx;8?M`qTiI=$nFYB>ll0`Az>{CUgeH#C(BDa;E8}2u->s0T)d&699xJM?
z#sj-|DTlYU|?t@hu(Rhjn1KU
zUlI)scbk)JXrkLXTgUdX+E`xwj+pUSW8#TiqwRuB{0I-WFxQ^t-R
zn@%XS_d46=Sg39`5u3eBV=T$%o@fE-j-q?Jo`TwhTe?{W_es}#?Nv33Cz=3Hz#t?H
zH5jHjJ1m|YY;06+4Ga&UKyP&x5>m(xXAVZJ&Bn`HQ4+_41vxxlBauvM&
z$Cig<51&U}wwo=mYG+^E_B*@Qvjz^(Y?8g&-$kpH<>s|88D+m=-`I7>0#rtdtA#wd
zg{f^x>OiVKsAhA$p1{DAB4sn7ND1IvHS;EkQ_o_hiIJC#`7~j@neKHtEtt;}rX?dS
zwop@|-Wfee!KmGg+s$y2xWJ(v6ncvG~5pDSh$#XrBF0ah-mv
zR*Hg4V>WqTHdd~qh59Ia|I^xk$7FtH@th|s
z{;w9K5+trw7SVL(!}bySbx3uZEJO3sk;(e&4tnHOa`2bAlZZ5F+&v4XA#>TX^*;!v
z26r{;W$y8QPzr&VAQ%&pG?M|8w`Q>#^iEw%`It
zMx?ZR-v2SDPq$rCMv%by0e}QkD!Wu&p6+S;3#Y2QFH&X%$%Sx)RTOG|xtBwu#EAVcNYGA@n=l4p+`u3|rn)>=Kr?tnIPbK3
zFIOa6lEkx5U!9@fN%3a6dkBNO9gt_oSXpW$$Q<1WajIeX86&k-WxPnf*S_fhR?>c?
z%bFpY?Hgam^7Omodive5erEk?17t}Um}@2?ffakWg-TMUB)3*DQ4#@GUSJDWmqFY8
zqg2+v7h~8MY_aCtmS=Ci{p8ZV1FalrY**Zeb%vVo+%Mu`vU8wr%~b@Q{{2Fj`LHVx
zXD0q&Z@qHQldjO5NG??>FJ>HPH>tQ+DzA+zwu}PxyP}751t*a>xwE>(xAv9)Rv`7y
zP7M=zi!3sg_8T_nQ2laK{~18H_cr_eTlDVd^@+m^d5NuTw^a*~
zK=MEFEH|{x-wp%aZhqesm{Ph-ss~=iSZ&x8|u=f$K$|nzdi7H9C&{_(R<%}E`}^d)a7^pP;2!PW=l9bj%NOj(?HPU
z@jxkr=dcFpG56HMc=qM!kI4eHA
z?E3Sw;`yx06Oyhd+O5@?=Gq$G9|wLtPP^V7cz?5mzVkTHs`aNBBVmWXDA&F38=ucl
z{Px>#_}hQ}4S)aJ-*MmF%E;ZjAgU(c8&;Bg9tXHq`yL{|C1h~q`pWKL>>iM#Lc3ig
z1KK^{hLO6b{43$wPJqu%9Li!UhwJ+WJEl3eh=^K3o9_64T{WdOwCXj^weqsD8Oaf8zTc@PN&1AAX
z_ruo)j^T%<_fvGr1dR<%gA;%MM`#68X)-HK{}WdNYe7LpqXUgj9Q;5lhmGgORdxc9
zkcCa~VfP!#VsbR>N*+=oioJt7QW=+&HnT#@^Thi;o1~*r{LSkVm_@x`8p&xD%?I7B
zR5p+Eik11Ag)n99{`0YrE6e;iTGru``F|jgT75RVP9^u;(2G-nGF$W-u}btJ#rZwo
zjBefr+8&~P?(nK9pOL{~r4;j;-2KLNU3fk(P%9?UV?r$zh1f+heIBrw)r;v|C#oF~
z(`!(+Sw-aNH0ryILvrb0M8<}J*O;&prL``l$c6W!b`X+EC}+Vj4wLyYVLSw_zM;hd
zx_JUv!JKqxDoy2JYFfh(xKyY3Lh@BdoD-aKAD8-=Y`&)U@T{Fy51$b6gev?QB=vG{
zk!~fm+pkJKsW8aTj;LT*{4gb|en4v|^~PQA`7;u;d#4z!9XQ|K@ZK+o+#ra@jyDG|
z4Ihsa{k|bl%shsLMdZ>z*GEU7#JbEw@;I$L^gPd$nB!!E42(f9#lsGy9%i&Z&jZKN
zEMTJ0><$}!V5p+I=c9*2Vswj-P3eUpKM?E;l79Ezur-O-B#Sg102e#;1&2)_^|RI2ZpGaFlO}kCRDQYcdrFhw*|YOH
zf!F=s@KYl^TW;I+piTx?tP`*Hds^{(5&P-vD=$MnzYY08|&ap3A1LJ(iRq#d^?)
z?SQ#~Cs|(I{Y{hbWy|bJ+>chNisrHP-JgOB2q)gt*TbZ!93L=i8HVEL^`4oauLaE=j1){43XND)UJ`?8jy2>@W6YS?yrQw1A(lzK
zv&uX(k+e6bals2mAl8Si3WXo(Ime;XE5ktG=W~*ob`(!7SZl
z$r^R~o!C2VOs73W(7UXk!QfvG2RzA%vPI0+-YZGWVE4o+kqs7yC26Vn(Mn?|z{@N@6J@=;
z6ZZZP1WXp#vpS=6UPOyQicUfcKgldq0dcTY0i?bbW(A0nED3}$Sdu338<2{FDRwXw
z0UL#yc~ZEd@`1)D3Y}>DK&1m+EEP}8cxTc3m*BpD=jXuZv*Y=^fO3P10V%4sk8=t4
zngeY1i05QkU=rIO$oy<7G`%~T=wf~GpUnio8DeDNQxQ0@z?4-s%t-xhBbFb5r#wWb
ztWwk^jg#TZ5q}-w+0~UXh$MLa)-mhB8^0Tfw0T?C8lK8#=AnNX!Z8`C;c@1aR0iZ~
zt?6rH^7!t|1icneM~DhTt3`Q}uqErEqLTGOooxQ6)%7wMNHOArhzJSMtc&9*Ga$EtsqMr3$DWMuihJ|Ik)?fC
zzj56cp3f(0u`)?2=39B3)fp6p^lmJ`MB0{C0_i4Lmgq?0H7yG7!4RMi50}XD4L8O4
z7ffJOltMr&kjDWX0HJ|!0B8Ue&=^zNr<@q6P1<*u2f#a(
zXzK*S7nX-o0o?EsPJu$k(g<6>NJh|#KYC#v<
z_`4`Xt!#O8!951fJqp#x5VI)e<@_MV{9IXbA4RUbshxj%o)A@x4RDz(!RR
zc>07(Grz21peaqujM0yF>tu8%D#kd@=flA*TZyM7dlH~e3zcz+ud1ZSFLgAzTnXXR
zbD6h-ml0g-UL=q6whl1sLgVJIo|5MQ{ZLnZSg3+{*P0{=DNjJfGa)bmssL5ISgF~J
zn^MA*R-g+<%=ZLQG<}b=fYd!13fkp;{wJ6p$`8Dyp|Z)aDm_s7fkr2Y1cHvP9T)`Z
z9sOopm*Vp|@bPiu<9UJ04Xzzj2Y4uo0HuZzF{bdr-q0Lw4Pv|3gr*&P5~x8?*sC|FY8R0fwk{~jQQE||eQmV%i+T2A-8_p1HR^hnJ(=NFVYk-WNG
zf!5dfvcvvic1+V_OV~f~uT;hQcZ$@5H36SJpk2)Oaa|WaTg7oOq?x>?RNJfpe~`=4
zlpYb;Tl|n7-%~sg!vWH8aM2aogGrOkM90G0MyJ~iBuf&POn4LF7t^DIEiPS!0BwLa
zi&I4bSMP!;E0Hc=g~Kx>u05x%&t}OYUpvBRdZlDalys-+{lKa(o~18Ru76Qb!K)U2
zrY8C3x(uOXGA(0t^8)pb-fy%T6B17=gIZXHT3<(H%xKVB8LbkoqvGCx)Ma0fF*|z#
zNVV2*oQr)rjsv9{6V^N+Lq+VLYr$QM#YvML^lNnclrnu)=g}<0q7>I1(~L-$K_=51
zCMOwV;C6jCdr$yz@nJyO!Z;>~ngEKnV2@WuXA|Di?|Il(a$-a=&vY_c9);Ium;(r<
z)W`^j$iNCO5&Iln7_>UPCXk{XNS){Tqx-FZl5sbuqfwgG>PtwLj~ZxIIIJIEXUI(W
zFV}(?gBxAV@v**WIvD6WaM3JIWn;iWUUciR;;_^&(>`}-_v`MEdPCS+?zI%C*EoQ8
zc~dZJulK;j0KC?3TVpNMhfqOiP;DS;01hw>lnT_Fa92XDgu55i+R%>E3U&|cg?oED
z@pv;PHr8j3^_j`bP**5H$YcXhs~ZSpt_9-KB9rUYtIc7E`Nb543C)uN<;@sfcECP{
zcT#naX&=r$+|lcm!qZ%al1W6OxN
z?Y!_@At0^>gM=|qAj3SmItUWwn9iQmLZHUpNMjPm2u$I`J{gEIiJ>9@RfsYrCIT_+Mpr&l43}pU+vhE5+0%eR9Bi|d}x&mDj%q9tU;p#l@1g%kXDd*
z+Fd7%TXA0l_jBO+d2iR})2s$u2a1Y^KDjrHy*5OBAVLu@k?UV%mXbztB#4|{rIc{D
z4yrUkkgVyyzRaPvv}Brc@zLwU%P7O8vz5q0ANzUX9r-w^4YB>DPy~0*dQdmwvzywr&3i(1SFXBwbvzQ-XPFFv#z+j{ZZQ!aN*SIhK
zyxGx4nHj`nt&JjywL^R_j6!hv9}1-Ygi>(B2tP5)ejx*w(G_1zg5O!y=_>*4#`}==
zwd;xVDtJ5X1LI=-O&q_~8W%xmcsvT;9+nby)QZa9v!G^~8Xo;XdJh*bFwe80%SW|H3vs8cNVP{yzc
zVZSkYo0Bi1V1Nja8!8RZV|q)76qN}_VcbW0$#{|NOQrItmy$l61%p$3(+IeAtpo
z>p^J35HVoI6=5u<5-H2UCr_*2pH&Evu-2l_a
zh0otU@!Q7}A0HP!KA!k|UikdH@cg{+ye`RR1|EyVOh9@XD;avLANm07_;=w2U>K
z7p2^7o*gmC52MqnfatoKpw-n8B#VNlEa{V<-Xhc%g}S2QUbd=G%^eoReK
z>Dw7VcTd!7h{kK+AZ3>n?b60|K3eRyx=cDGp`4Ti2$|nYcj1z8mpN!Bg@f{n#s-|u
za^m!R$^s`0q;uX!wv^00{V-HQL^KEA5ri=%`l$UclH3bG^ZOcp2NEImM@BbmXxosKxZf
zyhH;u7&Outl6XZz61iYHCT(nGeh#Lf)qqqnAQd2rxq-NY%UI=_?$1!xEgCiMbpsT%
zEeuoROevi?4299h6^80Kv_>m;`s9CnB*?nzO=JB^@25#m>XjnqovKe=Vv*!GoRzXf
zKR*<^reCF@iplIqty8KnFbdpYYjftRS;6p{#@7kNB!wBMr|iYVeX(#Z-%vXogg1zb2ak9e?SAetn|lK@EVd6>pmOt%WP%Pl7Q2Eo^zwP!YAA&8DdnEmNQLp9TQ?&vQ8<7f#h))K1&dC}eV@@1Go#LR)`sN4sPaaVy@X_?
zfU^88GWI0C6sYx@2)~7~o_cvnO1L0#8
zkcae^Ng%|jcQy@3ofGEyS7w;&BSb2Ge>1@1;Y{$g!Arpd;3Zc3uNv0>>9C27j--EH
zoV(yt7|>`->-ZW-PXAPnus(b@jU{6^Z=n)2J6Gbyl}L^ICV|c(ZHpsv&
zH+exJ@Apw#*ile;ppf9GWLG1!N~lbW!cE3c_~sGo{r+D1^>Tg-2OaoCmDpVU(8mKa;R*&_r#!8h2E*;F>h2Emg-vEICzq=z|;ucxdW-;MUQjZOtg7s;=Y#zqO8EzOy4n0KhDDr
za_4E2^;(UQa%zYDP^eRH9;Kum{T?3O%0M+AOD)ag_)X#`s`J-}v@aG)o^reZ53lX<
z`XY%AFdYFP?rCrj+v+Ey`MN-b%VE>KKY>&j%7baj)PDw29)n-KbhgGMoN3Db;_v(^
zBUQP)tan-5wgyDx9t@S
z#l1T`578#`#eG8qKoMgU`^>agXssr3i_aa)ccaYyeN*c3lF3Hdf;QY6YSTq!MJZN7
zZvK~wQWb5Gv!)f@(;{nasK<%(d~+amqSnK%R#7UWaCpvS72Jxwhb8q{9Y@9K$(g4!
zQulq&;Q?-V=aeVB-WNVUKdspEJbjNaAZ0@V03zNmcwyX6A7*&^)QGIJb(;(YIDEFn
zQ~W}{me$+6M*L{WOwzr1kfu~e!AahenBR%==5UwXe^Eiy{IF+W!un~dB`Z?Lt=*-J
z9pe&=0gPZ3hMmKifZ;%js`;8K3K}<0S7k_q8aR*=9T<9p#LGYBb_a!fLA;Mg#41bF
zJs1%`4T5zmI#VCf?kbU3*H=HDqz*O*Q5uYtRxp{Ne;#^Eb0(@y<^Ujc88ex0+yl?g
z%K+5J2R?rLz~BG=X>8Qh@%i*}=i7R$h%Eq$E2D-t%g;K6;q@a6L|OZR?DE911~eaPI8-Sk-H0zQmvMkkKmPhk{H<0?PWqn@>s~<6AInsQ}aB`tr
z0#fl|P%eVkT_6~VG<^`Ro4~BN?h`}C#z?ipBs+}+wFoL($UtL7D=?V|t#Y_vpsciX
zX_R$DgL95vR&Z=CvPWrhgfh6i&(f%?dkj`-oIB@1TmV#vG=QtOJXhbCze@Y|t9XI1
z^;PK9awH9~>et9?Q1Zs41&T1eFC2$uFRP6eYVX&ff6aaOCH8bh-6DaxIXRj#0tZvW
zP4-c)%GNW5TIUHvRDeR>QeQ!GfV4v$K}?P8?HT%eb)=`~6=
zZOZyh0LT_gzxR6?Q)E|sy_fO@G0PjnxVp_&WIg3{F2CO>G9OJhrBP!PrRp(Sin_n#u{}6
zLoo)=GU(y!YHbHvGf7`CQgbaUbpe{VZ**yYvb0ZX!BLBcyV$d2KE>+86laB0A!2Jn
zFjA#fa4{e4aW=fYz2P_ys0#Xh;raZ)^Ya7OXSMI&8G|ty-IX9~&qa69|
z^QEwe+`Fy`NNuJD+PObl!^3Y^?Aa|3m`_WOx9VGD8I{UKG>FX)OveJR*9im&PR{Ul=}NF|W+J{Bi^69q3
zY={TCExAH4FaVUHpr+trT6!*Oaj1S@QbdwOFNe>kK^0udlj<63JzNHMpw!y{&dNpq
zQZD|xA`x1CKW4T~CrScIUrC6UAu7F;sQxAL)o(!5GImp@x_DhdG4{SSJ~Zo0Th1X`e{^{Q<1JCufN7Pd??DF-u}BLLszTfTMXB
zLRN1cmm(OlC3QAi!WnAz4zkFB))~&h%BH9KNoQ|6sC5NI>WNVnocu#9ErA
zsg>qwYIa|m-|H}-Qx}jb{^zevUbhJ$Xa!Yp;T6r#D_M+|4#WXeK}$f&fY*}B1duw~
z@iS>(>YZ8}&WofkDJ_(~(d1`a7`@hIEg7vVW}Qi!`7m3#u0_sk(vEOS8gQdMhwO)@
zI!33Ic~2d*K|;Z3&6wID9w5Q1gC(TUgsoxcCcliv=Ta
zF+kQT<2VY=qvCN|g6I3&f$Q_cxF6<490PaAdR1pQ0xLjXRzzae8kWB5nxEpQBhi&g
zr9boZo4lsXubsbHp(wepY$|pl$F6{>q7u2J$PMpgp$Y~Z+Ps(rPlBy44xDWb8IZ1C
zqae76QOiKT26`#z)w;xl6cqxGur-8n~VoCh_t4
z#7D^bK0feyb=*U6-_ex$d13wFUK6(tAzGey_d
z7fn`$={cKwioPguUIr)n0-*2$#5DQPl7tbF(CqgrDK^k!UMbnZx?~BSvCR|th$axM
zn~!dOk3O6-f4?VvQj+KM>IPHD|0W=%UnT*$c+eYcwunkin()Ge#_SQ{EmskaKPf7KKXz{n|zu!DK)*OxjBd}0sd$y$VRXQy3DSG9zG5jT++eoB`xEn3viXJdbohvmaEFp
z$_^j^77OJOw)bupmWRoGf~B!gpdTQhJd?^quu7$Shx7rxO(w>0FjX0kRsoU8_8GXZ
zCjEO@MElV!p|e#&t(Fy03(y+jI0_zT!~6RKqu;oD$9;E{(b4aYAv!~NR+5|prjSuk
z4ji)Rbc*}thyP(%`qvs{t?sCm?zrTS_Lwa%R3(9MnMq-0fVYWwqn3cw0nk8WL8Sqv
z?#Z7~@@i{p9|NTm`pvj2qZS{Iy4<&yj>-%vY$dclFtMx!@5NzH1yf_Kz9gh+dcLOg
zc=h?LV%6l4_j3bNfpUeqq)^xvhlgsu#Z+v`S
z_;_CCi+Vos@%hBv2fAPeDr|D#J@?}!YGZ*Y3*+e1u0QuIn8c9?HVuA>=~{y+XS9nY
z0JAcxnmmd&&tZ~&y~@_0KUy<6MZNYEsq*E|nm{W#Zs~>L3H6UGYsCigbZ6x`4|`t%
zCXfIFy(n-vGy9p1B7AvoH@}f!_Nz`1=RqaLJQ)jhaKq6KJkEEV#~Y6JKr1H-HxP>T
zrgs79?H!)pvE@}en&o&|s<%nBmZc$NefkAR$=1U)hpXSDCM=KJ0#oKgjDd`I&9S7w
z`WU_rS1=HO?weEUZVL~*CoIQ+3?5o!F7kK=;+`t={r$dS?7tC6=|A~r`sVff$nqr9
zA(|f?Q?`@ulob(7H(ksmTnk&0jUQ-@nzuK}9i8TR20!F>GIjL67{HaLFIxs0HJwK5R39bTsqvwc|iL
zt1(jt<7gJiUL8c00<_9_92M8wfpMRZapSr=u5zPv#uMy5Tr5vn0+;r=xtC{%8K{ab
zAoVBuP%zJcijBs!Aqh0}nJB4k{{WMwi>?
zU_-4NofP+N?_Y5_Zed1e#wZ1&3}`h6AQxUePUI$bHXj{JTkDgj?pa!53SD48M;PLh
z*Nk}2LX4Z0%kqDOG?&@@QGPKzE!A1}=KEdSe7z8o9KnRiVpp*U+J*N>&oIS=r^oLhCzK?o@}C3`>&
z=;A{OrQ>AcbGM;`D2erzo$+O3qAC~ET5;3^M{PLkfwP(i^k@$QQuRck1f+C82Bq_g
zq~Mg;7hmEEUoDmhELX<>03ZNKL_t)V&w29lz$V6>?(hgraoTrAX3A@;<1oK_-;&
z^A5+LolKsmJy`;1FTqatw0XNM>4Ip$vDHjx)ko>|0(h3x_g$s^)UT`-%sCN
zMALvf1wwHsV$6rWR+Hq|Wiid!h&UhUTxjD@3cmy*SO*Bt^rjMrt5(u$u^8QuTzTQ0
zj<|*$pu6C@2A-FJspsWDYHmoj5S9Y8W>*9^6bDhkMvd-2KN~Y;hX|nFjCzPqf(L+V
zS^HGLJj@rRx=Kai8%@EKQPpXs
z;gQ-W$=*^QU|nc>6Q*>J_Ggahg`$e*g^>~uzk;bbxYDH0+uk+zXt6)*tWdTtADiIH
zN_QNx4kl{Pcx?ux`rW-xDLp%rp)BjOqE~CXeqpC3BjqWMN@}4CWQQRVkLwk&HB&I9
zn+uTo<~p@KW>A&^NcS#3_oCWbV4f?H0qr+P1_}hN5RPWar-dy+uvNz02bc?Li-A;`
zopHfPO(4}6=UMS~ACP`Rb>LZVW6>`AzPQs$AoBx{=E=6CQ<>~w>N{`89|BS__M(v4
z@GP|r0o{uFj^g(ce*j2flX9{!g)`VGGEY-=Uy$-Z<$+2Ag>Mjbmz1u`k={F)fX0rIko!L_{rNMWZ)^poo-ui9G_Yaz}
z=t&mmEGb}tAV8UM=e0Bony8kbCeDHk-xuyX&ITQy&yJ4?raboi^NEkoPfI&=*`@kG
zLSLGjL|01OOlTvQHDxg6N}2F{i8FHy-lFv5pL1}PZ!a#{&iv5Qm6B#ZO|TyF?fLMZ
zt(gl*<(U6(4vpJjZZSL9dXpU64c!3MChgM{ytB!VmjZMPTX@AHDg767Xf^4sxq)O$
zgXRwr*%L)GCAZav^EmNnCmzR%vp#UNH`D~A7}eKg!cc+Az4qu4gIq~*Z~LA?c#(B#
zB=+eHo(PZRI!j|kzGF!-HHCx_?dL>GZB3txhf=Asof8v-IN5g{OF04Ypp#W9KhH+N
zL`j^Fy0cQEYPCbq;a`WhL}zq3W;z^@
z4wR(sfxwi76lb+o8k)=ds>%9lD<~Kq7824v1%{X`#4ouAo|oYHwAasT;J$Th+g4am
zz$u>C<%jIyQvERkNMZJ5NF+UxqIEWQ1%d`UXcS`829^dY{vgou?wGU>;9`=T<_Dn~
z-Z17ZK3r2$*tgQYe7NH^8)8-v%beETX=Rl`=-E&oGHy
zmM~4Ola-3;wob{`79Sk4($xD!Iv~Cy5&L~sc*Y$sMqDw(4pvj@l@^Lv=tyNH$`QQ7
zh3)ff2BM}^Eh5Epx@8op`qhS-C!s`augf)G(qUGYhlfmf5$R;Q{6K5J&Pz9)#h}R~
z+Xh)(+IO5M9&Zo4zrW*rd$a5Dj`Mure4GZj;#mx7AMMh<$v(#rB};qL#=p9%c!H@`
zZTdHXR4q%gl35=B_w)M0K+1ezJpXPEk9BeNVY$2y
z>mH)bT$?A^eh-LR2$=xF)=sBGb_O;vQtHW#?L5&OL`mus%hEn(o)PC?AM8BpO@|M^
zw^c0hFNz*DEQk@5;ovV0d=p5;VL(krsvbElT`etbD5RVGdZgfrA;C!X<&&z`om1TV
zbhqO509MFdngv>v!e}AyD~pwCbxHeloPdhUQ~i-2x{W@;=td!Dq_~9W)B(Deo)d60
zqudUne2utEL2q@c1=yz>q>O?(`)C+yzE3CyZro0JDF%k7ULwu_74y$BhRbDwL=M$M5ln0^%pQnw9RgO#$rsCa$i^ZE3s^9u6LF(h_7t6C&6
zQb}Ww`nCpS0x6jRQL{QlO`%|C1c7TOU6va@Jo`L?p~_yE&99`C_KBb0ms#4US(;7X
zcc3xKK-v7W*y?jy
zH96mTKJa$F;j9m|`amfsT796F6HqeCh_7K*CE%)&F(#gi&`V-6&2@9Kmm=He
zIm>pPedFaK{CbAV(ms5Z_L2H9Q~azL%$(EUI)mc6|5;^vTcr|YOxk|s3(y`KpnJpk
zk{S`<@){OG5r!zEM7bJe5c_*nZe}pruTS!$FJ354
za*>%rl%jaOv~?C->k6C5<0{gZ@fFd_DMeEWcVw&k;i^PgYOzC}UzD!dMK?n}BqeGn
z_6`+I-6|IX)B*-pFp_OVGam@w@Oct%FPC~jks4{jsuKVtgenH(Q
z)ry7n9IZgK&Y_aM2LYn&XFkr;82+=G%#RhLSeolm38R_^XcS{!YI=sJAJu&mtk|T~
zR?Xrd$_s^AWtC6;=wLwGkuldmkPlhd56_G&AGIRch_bI-XE>34E8@~nUENrz115%f
z5pbcAN;(7`9fb%NmI@%AA*Ka{+HvAgLUmtM3F(g4GuVKs&MrCYbYeeTxsP`;xz_bG
z6CSBfZ>2>v`@W_rxFYs%&97elz#i*NF;IdrRuhYnc@Vi;jQ~}cT8G|CfVuwTl(zgtaD;65pIg|yXuc+q
z_eZYF{pgVt7ojiwX;oQ{Zum#LN!
zu`@^j=hvjy)jP&>&!A+3SOvTe%;tLS_7sZJU7a5J9O9Hg+Lv>kCKK?Mp5$CSN#q}6
zs29G&QjmiQo+nbdoo()3p`mGT=oXj6Fy}~o{R?o+e;G*W57ukKtF&v;s+99)wusdg
zeMnC4ZKyh;z9;UmLp&8rJwl*nNkyPRNjgq3;AHZ;;&I0Yq`XUp%be`*Obu36{Sat3
zHcD>m;L)q2Ujv`dfzM~h=hGOe>-GZMv=Fz(eh3X#%0N|2&Vw>jRg3}L1Ln6HmfojK
z;HD@I5V6B|ap(AwDI^eu6~%6G*EjmGOPms2hh6fb=oN*Usr!&mGUenw-;^D|MzLqf
zU~5Xol5Ap@`2BS^WWMiWvA6fw<$bd^siWWbTy&z;ca`j|pEn}H4S;oA*KLP^=!kMK
zg)#h)Gr%4;Axqk8hY^ZMR9&G4N|0g)t?I8*N(4(mV4fO8nonU=FXWBbaGr-V>k%T_
zlf9IZgo123T%47ub)XhTFE(T#7jvsRO&ke!D<9R&OuTBnj(dB1~M%92)#&`=N{(q
z=kYaVqvCE(NTDz^|E@Ktc7Rkpn!Z(kZQxqd6@bcpD@@xEXGvRhGeIxVkUIc*+YgX?
z=`f6poLEYb8_Ucm!Shv>JUKP16`T$6vhDls?v=a%=^zph7b^2-nbGPZY~I4PXGV;De5O{gOn+*WxgBxl+?jwu=IJV>0vKBaa>xUjt?w#oKPHe$ud@s^`C}%d8xr-+`JKiTX84;v7;pt~
z1-Jp_29;)CN63r6mk3$i)_Rb^)RL;@P9<^`&Sa}J=cncpXH5*Y_x-S^T-w-gS`k$D
z$a~e~Oq!CoNRMYc8oH*KvR!`J=qI$3dqk7!F%N6}z
z$*O_1zKQ#oW%&a)2{p~Lf~kjizu7{Xj~9l@yXW$H9CiWBXU3j0P6fl;*+1<+N=f=Y
zYl(&jlWI5~=9zYd!l-$uJX``^9v`1$q@WtWD{Bh*7yjo@0jXD}R)6Q~e~$R^^1IkJ
z_|VU*q*%&aJV~=_38_D7$^D#>aoL^7N*W{DH)1BBcsfui$>8eJrs~twi*C?Fo2h
z#AVX0!)JEaXaT=6JShMmrTg~(&-KK<$Y2&$
zngm6<%*3thdExndx*qw0j2k*UM%m4eG4zsU;+DY-PK=Vm?E9!!x-p8A!3B$}P>VqE$gJ!z8S=ppN14Kc5nf?hAHiExjPO
zR`X_^1IHLHUu4`DO|z@Ll{rj7-S^aR9tR$e2i||ZXh?-UDHs^F-i)_q|`uA;qAM!LeN>VbN2aVA$OYFRF
zJg?g&z9w5`3{-#rYqL}~1Ke$bC$8&foxZ~Oj8xHCakHW^(4C#?MNvsnNaq%BEL0Rw
zuEPw?3GM<=Y&upS()@|Tfu>HN>6=VlZA245MVAfK%YogaKr!oemBtu8IC20*B;c`z
zQwBzs%qDKyIr#AVGKPH8OQuxjVbOpTPcJYP_xrE~fRs|5MJO1hSpm8nP?z$POZh3f
z;;2qQ#Z;C=77>(^0%LSd)WxNVn*5n@AI`yBq%JZ5E%rQe1xmFp$9e!&K&1l2z7wpy
zDl@f|K-=;dF90>%heg>VE{nZdlPuH5nx7Thwbtl#R!P8dmN0KY4>X@eyKjxPCDClg
z`FYp`>nu`dGAZhSJu?CYt1(llV#sf87jcm!&s5Hm`)Va2-NPp+z>g2!C%firBs+kL
z()cpv;F5q;gx}B_#M8plSpsLisD@T8Xz4uOa0VMyPBf1?w`aBh!;`49UwejotYl)_
z_xb|Dl!%Iu;TMy3GRwDTDoG$pR9vCG(hEQWb_G-^e%5xCS3dAlwu^++FAkXTAq-F2
zb01&y?b&0**+~|(0A{KC95rLIPz>GEdvjW${x?ed^qYVB`oF)!4(#)=q_
z(QdND7U5MCh?o}gKFOK@u%q`IyV7Fqqq2>hq+O#S^Sfc^Y_fC`V4%k-F*zI!G?r35X}#Ypx0wj#ynZHvP&
zUxiAavOWtH7+`a@+nkm$lfzVa9tb(^aSx!7VjyZrK^5^Cuh*CQ0k}nZs9)uUiHJZI
zPTlpv%prhcDF8bYHRHb-qx0+uv4u*P{4)-}ey>I+S!)C5u_Rm?4BqINCcw;g`tx<1
z2hPWdx3@REzrW+{*E`<-?HAtP-|={VU&m@n>%~i~u?-p=NO>Pve8yu4ga>HQWWfx?
z)%mq0cGlEW^$Mcw{<%x}(Onrq55capFZ~q5eXK}&CQB6J08SAeQM}H;zvjYIex7lM
zY+81#19?27ktXGb^Vz|XWqsQM8;_$8tsFF$xH2R44J$Q)I=hZuBt_-!=Iy!9=ecl5
zsz}%TRu{`Q-+*Z&)IIEl)DLBr_NQPoV5f<#^|##^_l5hqa6O;q#qswtBG5}^>u5U<
zoTtb12Y~W>h`D%2i_XKx7=q4<=FHTc7^&(#guzOwKdS+zHXnx{V520NA;vO#IwAc~
z*|%_7N*$Z`SNNb(A!v>xP?wBr3@v@gARaY5hE+L0FcKrRJQO^Q>T@tT1Va^IAMz1l
z1oIUKWz{7QmRY(T;3>&bgON(frCE)!
zsxFW`nXVLxd(0VAa74-bCRfyS_@%j@c>vs?RG`Gu>#1ZNlbNO6JyKa>zq3-Jt8Yq#
zW)Ik6cHRFJz_I|PP
zyu#W+)C^Sk-2p4TCGhB2YEy=g;4S@FWr2EOw`5}Ra~}ky`eiz3oh2X68$TF
z7y&6nEr{a271vWys^UH^)lUXPq;g~YZpzH93Jq2_-NZOHz+O;SX#!*(C;ce-(3_w1rEVvxOq
zE5+5r4@NHSyKZNsR6x=Jy-laWyQp6C}+Q%j*tK#JBv
z---dLqZJ&rnmJ!(J1K2gRTq$2G2jIhQmxvpN}v>s%Bx{~PPzJGw5I1R{80yvq1(nQ
zBA&*@5YjD`jbf@-D-LI*-rnEv_Wp)n@9%j3^~>3*_mnZGEA&EES+dTl_Lx4lAt?=v
z2L`VIgOFbZkP7K;$oe7`aF!3Qyjzr?0x8um=$?J?Uul}oN%~q=3W2IWKP6Z{2U6h!
z-GG$NDRxZpB2)#@`Ql%b=nO6+)pT&+Wvq`v%p^V`&<2zut!`e~XcJVHJvYtA8&m|V^$YxW*MlMF^?x$bZ<)yIW?KTVpON*oI@
zS}S;aG&~*$-rgQ~yuEEeid|AW*Hpjvx`<2q(iKcpx;XLi6ktmAi;83V-5f-jbn(Z6
z`}kWnIp_ozHDpvv-ZZE4QOJv8J`I=q&i55ScP5G<7&I_k9isEYD=UpI1;nTfvpM*7
z(9@Jb`LQOe=IygEN)ll8&Zc-z~;56$x`6FjZp7EVA)Cj+ID
zpS9)lx_U>ajy{Yz%XI~Ff=ihOGKrtML~%*`5{SwvZ^^iyeNju|7yq1-WC>f;b?i56
z(qIdpGHIVW)uxhT)|0$ecljc1a?KRBAest(|9C!sR3^m1x;UUMehF+ZG4ZBAvu0^!
zf;bDon@@+lH*ZPxG^O;VITF1DP!O$-x7bIbRwOQ$-%mfLl0Ba+UP
z(*FXG$~QyC_L?TInpup01fY`knmwIoF+vNptb=-D6yhKS1rIB)<(raxh`#MV&Wx)O>Gun=fOfvWkLIJ!Hq3U>xG%`{gxq#%zkqRD6svX%q6se%d>x_a1M&mPly=hU%W`HVA6R~LYi4OR
z7DE(8yo&)EtI&Q5&veES$Mjo=F-epYQiwo<&q-iUo9uj1
zNVVaZHvf5w*B+Er(o2;5fDu5Fq@QN{;i(E(>#^^-e$`f&Y%IJws!18L9!~N5tgt1d
z5;4x>#MvI()%ZZ=hC+2Rzj~%35m