[timeseries] Add initial support for elasticsearch #99 #164

nepython · 2020-07-15T19:23:35Z

Checks:

I have manually tested the proposed changes
I have written new test cases to avoid regressions (if necessary)
I have updated the documentation (e.g. README.rst)

Closes #99, #68

Need Help
There is one test (test_get_device_metrics_csv) failing on travis (it's probably caused by cardinality aggregation counting None from date_histogram as a value). I am unable to find a solution to this. In real case this should be rare as can be confirmed by the passing of test_wifi_hostapd.

PS: Query posted on https://discuss.elastic.co/t/cardinality-aggregation-always-returning-one-extra-count/243462

nemesifier

Good progress @nepython, see my comments below.

nemesifier · 2020-07-25T02:44:10Z

README.rst

@@ -385,7 +415,12 @@ call in your custom code (eg: a custom check class), you can do so as follows:
                    "MEAN(buffered) AS buffered FROM {key} WHERE time >= '{time}' AND "
                    "content_type = '{content_type}' AND object_id = '{object_id}' "
                    "GROUP BY time(1d)"
-                )
+                ),
+                'elasticsearch': _make_query({


why don't you change the code so that calling this functon from the configuration is not necessary?
You can loop over the data structure and call it when it's initialized, this way we make things easy for users and we avoid them come to complain to us in the support channels 😂

Yes that can be done and I have done the same for built-in charts. _make_query is just a utility function which will update the aggregation for a default_chart_query defined in openwisp_monitoring.db.backends.elasticsearch.queries. So queries returned via _make_query will always retain the structure of default_chart_query.

I wanted to leave the option of directly using a dsl query with timeseries_db.query, exactly like how we can query InfluxDB directly using the same function.

A full dsl query will look like this,

{'query': {'nested': {'path': 'tags', 'query': {'bool': {'must': [{'match': {'tags.object_id': {'query': '9a39a5ae-146b-4a50-b113-f9381b8c1721'}}}, {'match': {'tags.content_type': {'query': 'config.device'}}}]}}}}, '_source': False, 'size': 0, 'aggs': {'GroupByTime': {'nested': {'path': 'points', 'aggs': {'set_range': {'filter': {'range': {'points.time': {'from': 'now-1d/d', 'to': 'now/d'}}}, 'aggs': {'time': {'date_histogram': {'field': 'points.time', 'fixed_interval': '10m', 'format': 'date_time_no_millis', 'order': {'_key': 'desc'}, 'time_zone': 'Asia/Kolkata'}, 'aggs': {'nest': {'nested': {'path': 'points.fields', 'aggs': {'CPU_load': {'avg': {'field': 'points.fields.cpu_usage'}}}}}}}}}}}}}}

So, if we make _make_query as a compulsion, we might be cutting down a user's freedom to query via DSL. Personally, I would like to give user this freedom (this would enable him to just put a query like above in chart configuration and it will work) 😄.

openwisp_monitoring/db/backends/elasticsearch/index.py

requirements.txt

openwisp_monitoring/db/backends/influxdb/tests.py

openwisp_monitoring/db/backends/elasticsearch/tests.py

openwisp_monitoring/db/backends/elasticsearch/index.py

openwisp_monitoring/db/backends/elasticsearch/queries.py

tests/openwisp2/settings.py

PabloCastellano · 2020-07-28T10:54:29Z

docker-compose.yml

@@ -22,6 +24,45 @@ services:
      INFLUXDB_DB: openwisp2
      INFLUXDB_USER: openwisp
      INFLUXDB_USER_PASSWORD: openwisp
+  # clustered version of elasticsearch is used as that might be used in production
+  es01:


We don't need to run Elasticsearch in a High Available environment. Testing HA capabilities is ElasticSearch's job 😃 . We can simply make sure that setting up a multi-nodes cluster works but IMHO it is enough for us to run tests in only one instance.

WDYT?

I agree though there are some problems that I was facing with elasticsearch docker due to which too I am using two nodes 😅. Can you please check out if it's possible to run elasticsearch on a single port (I am not sure about this as I could not :/ ) and then I can adapt. Thanks!

nepython · 2020-07-29T18:02:30Z

Currently, things to do:~

Fix cardinality aggregation returning an extra value for wifi_clients causing test_get_device_metrics_csv to fail.
Elasticsearch tests are currently consuming ~120s to run. This can be reduced to 75s by retaining indices. There are other optimizations too which shall be implemented together in [timeseries] Optimize elasticsearch #168.

nemesifier

Currently, things to do:~

Fix cardinality aggregation returning an extra value for wifi_clients causing test_get_device_metrics_csv to fail.

Please resolve this asap.

Elasticsearch tests are currently consuming ~120s to run. This can be reduced to 75s by retaining indices. There are other optimizations too which shall be implemented together in [timeseries] Optimize elasticsearch #168.

We may do this in a second step. We make it work first, then we optimize.

tests/openwisp2/settings.py

nepython · 2020-07-30T18:15:48Z

Currently, things to do:~

Fix cardinality aggregation returning an extra value for wifi_clients causing test_get_device_metrics_csv to fail.

Please resolve this asap.

I need help, mentioned in opening comment 😬. I am still trying to resolve but couldn't find any fix, will question this on elasticsearch discussion forums if I am unable to in the next 1-2 days.

Elasticsearch tests are currently consuming ~120s to run. This can be reduced to 75s by retaining indices. There are other optimizations too which shall be implemented together in [timeseries] Optimize elasticsearch #168.

We may do this in a second step. We make it work first, then we optimize.

Ok

nemesifier · 2020-07-30T18:59:43Z

Currently, things to do:~

Fix cardinality aggregation returning an extra value for wifi_clients causing test_get_device_metrics_csv to fail.

Please resolve this asap.

I need help, mentioned in opening comment . I am still trying to resolve but couldn't find any fix, will question this on elasticsearch discussion forums if I am unable to in the next 1-2 days.

I see that you mentioned it, can you please write a detailed explanation of what is going on and provide links to the external info you mention?

nemesifier

@nepython how's it going with fixing the build here?
Please focus on completing this ⚠️

nepython · 2020-08-26T06:20:10Z

@nepython how's it going with fixing the build here?
Please focus on completing this ⚠️

@nemesisdesign, definitely I am working to fix the build, working mostly on https://github.com/nepython/openwisp-monitoring/tree/issues/99-add-initial-support-second-timeseries_db to try out minor changes but so far no good. I asked a query on S/O in the morning (due to no response on elasticsearch discussion forum 😶), it can be found at https://stackoverflow.com/questions/63590693/elasticsearch-cardinality-aggregation-returning-one-extra-count.

As far as I can visualize, the clients Chart sometimes returns an extra count and sometimes it doesn't. In the current travis build one test is failing due to this. Locally, that one test is always passing now (but it was failing earlier 5-6 days ago just don't know why it is passing now :/).

What I am trying to figure out right now is why sometimes the clients charts returns an extra count and correct count the other times. Is this even an issue on our end or on elasticsearch's (unsure)? I posted a detailed status on the PR and a request too to provide me with some JSON data or access to any instance in which the user is ready to experiment with elasticsearch (as you had mentioned in our last call) few days back on the IM channel https://gitter.im/openwisp/openwisp-monitoring?at=5f3f709f49148b41c9619185 and reminded the same personally but didn't receive any response 😓.

Right now if we are in a hurry to merge this PR, we can do two things:~

Disable clients chart for elasticsearch and state it in README as Work in Progress.
We can include the code related to clients chart in dev with a short note related to this problem of clients chart in a section say Known Issues and open up an issue for the same, while skipping that one test for now and adding a comment above it that this needs to be fixed.

This is a big add-on and something I have worked really hard upon so would really like to see this getting merged though even after a month I am not able to solve this one single issue (which is blocking everything else) :(. I am determined to fix this issue even after GSoC and see this module getting released and be helpful to others 😃.

Closes #99

nepython marked this pull request as draft July 15, 2020 19:24

nepython force-pushed the dev branch from 0900380 to 57b0ded Compare July 15, 2020 19:44

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch 3 times, most recently from be49865 to 184f7a6 Compare July 17, 2020 19:04

nepython force-pushed the dev branch from df41092 to 8581ee9 Compare July 17, 2020 19:09

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch from 184f7a6 to d383f17 Compare July 21, 2020 16:45

nepython linked an issue Jul 21, 2020 that may be closed by this pull request

[timeseries] Add initial support for second timeseries db #99

Open

4 tasks

nepython force-pushed the dev branch from c2f6ae8 to a2f88d9 Compare July 24, 2020 18:30

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch 2 times, most recently from 6205616 to de11205 Compare July 24, 2020 20:22

nemesifier reviewed Jul 25, 2020

View reviewed changes

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch 3 times, most recently from c789856 to e07bc63 Compare July 27, 2020 18:29

nepython linked an issue Jul 27, 2020 that may be closed by this pull request

[enhancement] Prepare travis-ci build #68

Closed

3 tasks

nemesifier reviewed Jul 27, 2020

View reviewed changes

PabloCastellano reviewed Jul 28, 2020

View reviewed changes

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch 5 times, most recently from 7bad1b0 to e98535a Compare July 29, 2020 17:32

nepython added enhancement New feature or request timeseries Issues / PRs / tasks related to timeseries database labels Jul 29, 2020

nepython marked this pull request as ready for review July 29, 2020 17:57

nemesifier reviewed Jul 30, 2020

View reviewed changes

tests/openwisp2/settings.py Outdated Show resolved Hide resolved

tests/openwisp2/settings.py Outdated Show resolved Hide resolved

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch from e98535a to 9c53057 Compare July 30, 2020 13:54

nepython mentioned this pull request Jul 31, 2020

[feature] Add a check which inspects device configuration status periodically #123

Merged

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch 2 times, most recently from b5a9d0d to 9a80da6 Compare August 6, 2020 17:13

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch from 9a80da6 to fa5759a Compare August 6, 2020 18:41

nepython force-pushed the dev branch from a6d3c0a to f6c7591 Compare August 7, 2020 08:16

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch from fa5759a to 319ec3d Compare August 7, 2020 08:36

Base automatically changed from dev to master August 7, 2020 23:31

nemesifier changed the base branch from master to dev August 10, 2020 21:13

nepython force-pushed the issues/99-add-initial-support-second-timeseries_db branch 4 times, most recently from 93039cf to 91a648f Compare August 21, 2020 06:33

nemesifier requested changes Aug 26, 2020

View reviewed changes

nepython force-pushed the dev branch from bf71f80 to 017a192 Compare August 28, 2020 08:56

Base automatically changed from dev to master August 31, 2020 01:21

nepython mentioned this pull request Aug 31, 2020

[enhancement] Prepare travis-ci build #68

Closed

3 tasks

devkapilbansal force-pushed the issues/99-add-initial-support-second-timeseries_db branch 2 times, most recently from 1a5f91b to 624987b Compare April 8, 2021 06:37

[timeseries] Add initial support for elasticsearch #99

044a29f

Closes #99

devkapilbansal force-pushed the issues/99-add-initial-support-second-timeseries_db branch from 624987b to 044a29f Compare April 8, 2021 06:47

nemesifier mentioned this pull request Jun 25, 2024

[monitoring] Adding influxDB 2.x version support #274 #584

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[timeseries] Add initial support for elasticsearch #99 #164

[timeseries] Add initial support for elasticsearch #99 #164

nepython commented Jul 15, 2020 •

edited

Loading

nemesifier left a comment

nemesifier Jul 25, 2020

nepython Jul 27, 2020

PabloCastellano Jul 28, 2020 •

edited

Loading

nepython Jul 28, 2020

nepython commented Jul 29, 2020

nemesifier left a comment

nepython commented Jul 30, 2020

nemesifier commented Jul 30, 2020

nemesifier left a comment

nepython commented Aug 26, 2020

[timeseries] Add initial support for elasticsearch #99 #164

Are you sure you want to change the base?

[timeseries] Add initial support for elasticsearch #99 #164

Conversation

nepython commented Jul 15, 2020 • edited Loading

nemesifier left a comment

Choose a reason for hiding this comment

nemesifier Jul 25, 2020

Choose a reason for hiding this comment

nepython Jul 27, 2020

Choose a reason for hiding this comment

PabloCastellano Jul 28, 2020 • edited Loading

Choose a reason for hiding this comment

nepython Jul 28, 2020

Choose a reason for hiding this comment

nepython commented Jul 29, 2020

nemesifier left a comment

Choose a reason for hiding this comment

nepython commented Jul 30, 2020

nemesifier commented Jul 30, 2020

nemesifier left a comment

Choose a reason for hiding this comment

nepython commented Aug 26, 2020

nepython commented Jul 15, 2020 •

edited

Loading

PabloCastellano Jul 28, 2020 •

edited

Loading