performance degradation LDES Client + Repository materialiser #660

KVerduyn · 2024-07-04T07:56:05Z

Performance of the LDIO docker container is slowing down, with excessive memory consumption (10GB+), resulting in docker container dropping out due to heap space isues. Failure is after +/- 5 hours
Two pipelines are set up, one from geomobility, one from telraam.
Last Logfile is attached.

System is running on a hetzner server with 16GB memory on ubuntu linux, shared cpu.

LDES Client.zip

_LDES_Client_Telraam_logs-2.txt

rorlic · 2024-07-05T08:41:55Z

The issue can be reproduced with the attached configuration (gh-issue-660.zip). The problem lies in the repository materializer (Ldio:RepositoryMaterialiser) because the issue is none existing when using a no-op output (Ldio:NoopOut).

The left part of the graph shows the memory usage with the no-op output. On the right is the heap usage when having both pipelines output to a graph DB using the repository materializer:

The beahviour is most-likely due to the repository materializer components using the same graph DB connection. This results is one of the pipelines not being able to send the output to the graph DB. The following queries can be used to check the number of received version objects:

select (Count(?S) as ?telling) FROM <http://geomobility.eu/> where { ?S a <https://implementatie.data.vlaanderen.be/ns/vsds-verkeersmetingen#Verkeerstelling> . }
select (Count(?S) as ?telling) FROM <http://telraam.net/> where { ?S a <https://implementatie.data.vlaanderen.be/ns/vsds-verkeersmetingen#Verkeerstelling> . }

We noticed another (smaller) issue while investigating the above: the output counters are incremented before the members are actually received by the graph DB. This results in a mismatch of the above counts and the prometheus ldio_data_out_total counters. See github issue #661.

KVerduyn added the needs triage Issue needs to be evaluated by team label Jul 4, 2024

Yalz added this to VSDS Backlog Jul 4, 2024

github-project-automation bot moved this to 📋 Backlog in VSDS Backlog Jul 4, 2024

Yalz assigned rorlic Jul 4, 2024

rorlic mentioned this issue Jul 5, 2024

The prometheus counter ldio_data_out_total is incremented too early #661

Closed

Yalz added bug Something isn't working performance and removed needs triage Issue needs to be evaluated by team labels Jul 5, 2024

Yalz linked a pull request Jul 23, 2024 that will close this issue

fix: #660: Repository Sink: performance degradation #666

Merged

Yalz added this to the LDI 2.9.0 milestone Sep 10, 2024

Yalz closed this as completed in 5aa2f97 Sep 20, 2024

github-project-automation bot moved this from 📋 Backlog to 👀 In review in VSDS Backlog Sep 20, 2024

Yalz moved this from 👀 In review to ✅ Done in VSDS Backlog Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance degradation LDES Client + Repository materialiser #660

performance degradation LDES Client + Repository materialiser #660

KVerduyn commented Jul 4, 2024

rorlic commented Jul 5, 2024

performance degradation LDES Client + Repository materialiser #660

performance degradation LDES Client + Repository materialiser #660

Comments

KVerduyn commented Jul 4, 2024

rorlic commented Jul 5, 2024