-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix VaultAutopilotHealthy alert name and time window in alert descriptions #1157
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, please bump the chart and run hack/helm-docs.sh
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, please also update security-apps
(in a separate PR) to include the new version.
0c69cbd
to
ab67db6
Compare
@adfinis/helm-charts what do you think about bumping this chart to |
af851c8
to
73f97ea
Compare
Hmm read fix and this big bump and was like why? Are you sure this stuff is 1.0 ready? |
Looking at https://semver.org/#spec-item-5, we only need to care about the "breaking" change if we have |
73f97ea
to
2adb3a0
Compare
- alert: VaultResponseFailures | ||
expr: increase(vault_audit_log_response_failure[5m]) > 0 | ||
for: 15m | ||
labels: | ||
severity: critical | ||
annotations: | ||
summary: High frequency of failed Vault responses | ||
description: There has been an increased number of failed Vault responses in the last 15 minutes | ||
description: There has been an increased number of failed Vault responses in the last 20 minutes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How has this changed? It's still for: 15m
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
might be because 15m on the check + 5m in the query add up to 20m?
Exactly. That is what is keeping @hairmare from going |
I honestly believe that we depend on too many downstream charts in our |
i believe that as well, but this is vault-monitoring hence the reasoning doesn't apply |
a7240f7
to
eed5b77
Compare
I would like to improve the alert name for the autopilot.
Also, I noticed a link that no longer points to a meaningful location. We can remove it.