[CONFIG-129] Ctlstore reflector metrics inconsistency in prod euw 1 #106
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Check out https://twilio.slack.com/archives/C0324FBR4Q5/p1677021062766279 for RCA.
In order to make this change effective to every node loading ldb, we might need to cut a tag and making every terraform places updating the ctlstore-ldb module version. An alternative to this solution has already been achieved by creating a different metrics by counting the pods running that has app tag
ctlstore-reflector-v2
: https://segment.datadoghq.com/dashboard/5hd-biq-s3m/ctlstore?fullscreen_end_ts=1678395211411&fullscreen_paused=false&fullscreen_section=edit&fullscreen_start_ts=1678380811411&fullscreen_widget=2374545259291630&from_ts=1678380795743&to_ts=1678395195743&live=true