From dc4f7d4dd307dbc2fb19188bfc484c221391e296 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?=C3=81lex=20Ruiz?= <alejandro.ruiz.becerra@wazuh.com>
Date: Tue, 23 Apr 2024 16:27:00 +0200
Subject: [PATCH] Remove unused file and improve documentation a bit.

---
 integrations/README.md                        | 98 +++++++------------
 .../pipeline/indexer-to-integrator.conf       | 33 -------
 2 files changed, 36 insertions(+), 95 deletions(-)
 delete mode 100644 integrations/amazon-security-lake/logstash/pipeline/indexer-to-integrator.conf
diff --git a/integrations/README.md b/integrations/README.md
index aa860f8a439d0..e141452d7a8b5 100644
--- a/integrations/README.md
+++ b/integrations/README.md
@@ -1,18 +1,18 @@
 ## Wazuh indexer integrations
 
-This folder contains integrations with third-party XDR, SIEM and cybersecurity software. 
+This folder contains integrations with third-party XDR, SIEM and cybersecurity software.
 The goal is to transport Wazuh's analysis to the platform that suits your needs.
 
 ### Amazon Security Lake
 
-Amazon Security Lake automatically centralizes security data from AWS environments, SaaS providers, 
-on premises, and cloud sources into a purpose-built data lake stored in your account. With Security Lake, 
-you can get a more complete understanding of your security data across your entire organization. You can 
-also improve the protection of your workloads, applications, and data. Security Lake has adopted the 
-Open Cybersecurity Schema Framework (OCSF), an open standard. With OCSF support, the service normalizes 
+Amazon Security Lake automatically centralizes security data from AWS environments, SaaS providers,
+on premises, and cloud sources into a purpose-built data lake stored in your account. With Security Lake,
+you can get a more complete understanding of your security data across your entire organization. You can
+also improve the protection of your workloads, applications, and data. Security Lake has adopted the
+Open Cybersecurity Schema Framework (OCSF), an open standard. With OCSF support, the service normalizes
 and combines security data from AWS and a broad range of enterprise security data sources.
 
-#### Usage
+#### Development guide
 
 A demo of the integration can be started using the content of this folder and Docker.
 
@@ -20,16 +20,17 @@ A demo of the integration can be started using the content of this folder and Do
 docker compose -f ./docker/amazon-security-lake.yml up -d
 ```
 
-This docker compose project will bring a *wazuh-indexer* node, a *wazuh-dashboard* node, 
-a *logstash* node, our event generator and an AWS Lambda Python container. On the one hand, the event generator will push events 
-constantly to the indexer, on the `wazuh-alerts-4.x-sample` index by default (refer to the [events 
+This docker compose project will bring a _wazuh-indexer_ node, a _wazuh-dashboard_ node,
+a _logstash_ node, our event generator and an AWS Lambda Python container. On the one hand, the event generator will push events
+constantly to the indexer, to the `wazuh-alerts-4.x-sample` index by default (refer to the [events
 generator](./tools/events-generator/README.md) documentation for customization options).
-On the other hand, logstash will constantly query for new data and deliver it to output configured in the 
-pipeline, which can be one of `indexer-to-s3`, `indexer-to-file` or `indexer-to-integrator`.
+On the other hand, logstash will constantly query for new data and deliver it to output configured in the
+pipeline, which can be one of `indexer-to-s3` or `indexer-to-file`.
 
 The `indexer-to-s3` pipeline is the method used by the integration. This pipeline delivers
 the data to an S3 bucket, from which the data is processed using a Lambda function, to finally
 be sent to the Amazon Security Lake bucket in Parquet format.
+
 <!-- TODO continue with S3 credentials setup -->
 
 Attach a terminal to the container and start the integration by starting logstash, as follows:
@@ -56,7 +57,7 @@ parquet-tools show <parquet-file>
 
 Bucket names can be configured editing the [amazon-security-lake.yml](./docker/amazon-security-lake.yml) file.
 
-For development or debugging purposes, you may want to enable hot-reload, test or debug on these files, 
+For development or debugging purposes, you may want to enable hot-reload, test or debug on these files,
 by using the `--config.reload.automatic`, `--config.test_and_exit` or `--debug` flags, respectively.
 
 For production usage, follow the instructions in our documentation page about this matter.
@@ -64,55 +65,28 @@ For production usage, follow the instructions in our documentation page about th
 
 As a last note, we would like to point out that we also use this Docker environment for development.
 
-#### Deployment on AWS Lambda
-
-##### Creating a .zip deployment package with dependencies
-
-To automatically generate the zip file, run steps 1 and 2 and the run `make`. If you don't
-have `make` install, you can continue with the steps to create the package manually.
-
-1. Create and activate a virtual environment in our project directory.
-    ```bash
-    cd amazon-security-lake
-    python3 -m venv .venv
-    source .venv/bin/activate
-    ```
-
-2. Install the required libraries using pip.
-    ```console
-    (.venv) pip install -r requirements.aws.txt
-    ```
-
-3. Use `pip show` to find the location in your virtual environment where pip has installed your dependencies.
-    ```console
-    (.venv) ~/src$ pip show <package_name>
-    ```
-    The folder in which pip installs your libraries may be named `site-packages` or `dist-packages`. This folder may be located in either the `lib/python3.x` or `lib64/python3.x` directory (where python3.x represents the version of Python you are using).
-
-4. Deactivate the virtual environment
-    ```console
-    (.venv) ~/src$ deactivate
-    ```
-
-5. Navigate into the directory containing the dependencies installed with pip and create a .zip file in the project directory with the installed dependencies at the root.
-    ```console
-    ~/src$ cd .venv/lib/python3.12/site-packages
-    ~/src/.venv/lib/python3.12/site-packages$ zip -r ../../../../wazuh_to_amazon_security_lake.zip .
-    ```
-
-6. Navigate to the root of the project directory where the `run.py` file containing the handler code is located and add that file to the root of the .zip package. 
-    ```console
-    ~/src/.venv/lib/python3.12/site-packages$ cd ../../../../src
-    ~/src$ zip ../wazuh_to_amazon_security_lake.zip run.py wazuh_ocsf_converter.py
-    ~/src$ zip ../wazuh_to_amazon_security_lake.zip models -r
-    ```
-
-The instructions on this section have been based on the following AWS tutorials and documentation. 
-
-* [Tutorial: Using an Amazon S3 trigger to create thumbnail images](https://docs.aws.amazon.com/lambda/latest/dg/with-s3-tutorial.html)
-* [Tutorial: Using an Amazon S3 trigger to invoke a Lambda function](https://docs.aws.amazon.com/lambda/latest/dg/with-s3-example.html)
-* [Working with .zip file archives for Python Lambda functions](https://docs.aws.amazon.com/lambda/latest/dg/python-package.html)
-* [Best practices for working with AWS Lambda functions](https://docs.aws.amazon.com/lambda/latest/dg/best-practices.html)
+#### Deployment guide
+
+- Create one S3 bucket to store the raw events, for example: `wazuh-security-lake-integration`
+- Create a new AWS Lambda function
+  - Create an IAM role with access to the S3 bucket created above.
+  - Select Python 3.12 as the runtime
+  - Configure the runtime to have 512 MB of memory and 30 seconds timeout
+  - Configure an S3 trigger so every created object in the bucket with `.txt` extension invokes the Lambda.
+  - Run `make` to generate a zip deployment package, or create it manually as per the [AWS Lambda documentation](https://docs.aws.amazon.com/lambda/latest/dg/python-package.html#python-package-create-dependencies).
+  - Upload the zip package to the bucket. Then, upload it to the Lambda from the S3 as per these instructions: https://docs.aws.amazon.com/lambda/latest/dg/gettingstarted-package.html#gettingstarted-package-zip
+- Create a Custom Source within Security Lake for the Wazuh Parquet files as per the following guide: https://docs.aws.amazon.com/security-lake/latest/userguide/custom-sources.html
+- Set the **AWS account ID** for the Custom Source **AWS account with permission to write data**.
+
+<!-- TODO Configure AWS Lambda Environment Variables /-->
+<!-- TODO Install and configure Logstash /-->
+
+The instructions on this section have been based on the following AWS tutorials and documentation.
+
+- [Tutorial: Using an Amazon S3 trigger to create thumbnail images](https://docs.aws.amazon.com/lambda/latest/dg/with-s3-tutorial.html)
+- [Tutorial: Using an Amazon S3 trigger to invoke a Lambda function](https://docs.aws.amazon.com/lambda/latest/dg/with-s3-example.html)
+- [Working with .zip file archives for Python Lambda functions](https://docs.aws.amazon.com/lambda/latest/dg/python-package.html)
+- [Best practices for working with AWS Lambda functions](https://docs.aws.amazon.com/lambda/latest/dg/best-practices.html)
 
 ### Other integrations
 
diff --git a/integrations/amazon-security-lake/logstash/pipeline/indexer-to-integrator.conf b/integrations/amazon-security-lake/logstash/pipeline/indexer-to-integrator.conf
deleted file mode 100644
index afd7712413ddf..0000000000000
--- a/integrations/amazon-security-lake/logstash/pipeline/indexer-to-integrator.conf
+++ /dev/null
@@ -1,33 +0,0 @@
-input {
-   opensearch {
-      hosts =>  ["wazuh.indexer:9200"]
-      user  =>  "${INDEXER_USERNAME}"
-      password  =>  "${INDEXER_PASSWORD}"
-      ssl => true
-      ca_file => "/usr/share/logstash/root-ca.pem"
-      index =>  "wazuh-alerts-4.x-*"
-      query =>  '{
-            "query": {
-               "range": {
-                  "@timestamp": {
-                     "gt": "now-1m"
-                  }
-               }
-            }
-      }'
-      schedule => "* * * * *"
-   }
-}
-
-output {
-   stdout { 
-      id => "output.stdout"
-      codec => json_lines
-   }
-   pipe {
-      id => "output.integrator"
-      ttl => "10"
-      command => "/env/bin/python3 /usr/share/logstash/amazon-security-lake/run.py"
-   #   command => "/usr/share/logstash/amazon-security-lake/run.py --pushinterval 300 --maxlength 2000 --linebuffer 100 --sleeptime 1 --bucketname securitylake --s3endpoint s3.ninja:9000 --s3profile default"
-   }
-}