Skip to content

Terraform module for deploy kubernetes monitoring on datadog

Notifications You must be signed in to change notification settings

linceaerian/terraform-datadog-kubernetes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Datadog kubernetes module

Requirements

Name Version
terraform >= 0.13

Providers

Name Version
datadog >= 2.25

Usage

module "datadog_kubernetes" {
  source             = "linkbynet/kubernetes/datadog"
  orchestrator_name  = "AKS"
  environment        = test
  customer_name      = "mycustomer"
  customer_id        = 1234
  asset_lbnref       = 1234
  asset_name         = mykubernetes
}

Modules

No modules.

Resources

Name Type
datadog_monitor.cluster_capacity_cpu_high resource
datadog_monitor.cluster_capacity_ram_high resource
datadog_monitor.cluster_failedscheduling resource
datadog_monitor.cluster_pv_high_usage resource
datadog_monitor.cluster_pvc_pending resource
datadog_monitor.daemonset_pods_not_enough resource
datadog_monitor.deployment_deleted resource
datadog_monitor.deployment_no_pod resource
datadog_monitor.deployment_pods_not_enough resource
datadog_monitor.host_cpu_high resource
datadog_monitor.host_disk_high resource
datadog_monitor.host_load_high resource
datadog_monitor.host_node_disk_pressure resource
datadog_monitor.host_node_memory_pressure resource
datadog_monitor.host_node_not_ready resource
datadog_monitor.host_ntp_sync resource
datadog_monitor.host_pod_scheduled_not_ready resource
datadog_monitor.host_ram_high resource
datadog_monitor.host_reboot resource
datadog_monitor.services_endpoints_address_not_ready resource
datadog_monitor.statefulset_no_pod resource
datadog_monitor.statefulset_pods_not_enough resource

Inputs

Name Description Type Default Required
asset_lbnref lbnref of master cloud service any n/a yes
asset_name Asset name any n/a yes
category Category of the monitors string "283" no
collect_logs do we collect logs string "false" no
custom_tags map of custom tags to set on ressources map(string) {} no
customer_id ID of the client in Idefix any n/a yes
customer_name Client name as in Idefix any n/a yes
daemonset_pods_not_enough_threshold_critical Critical threshold for daemonset_pods_not_enough string "0.9" no
daemonset_pods_not_enough_threshold_warning Warning threshold for daemonset_pods_not_enough string "0.4" no
default_tags map of default tags to set on ressources map(string) {} no
deployment_pods_not_enough_threshold_critical Critical threshold for deployment_pods_not_enough string "0.6" no
deployment_pods_not_enough_threshold_warning Warning threshold for deployment_pods_not_enough string "0.9" no
environment Specifies the environment any n/a yes
escalation_message Default escalation message string "" no
escalation_message_cluster_api_operations_error_rate Escalation message for monitor [cluster_api_operations_error_rate] string "" no
escalation_message_cluster_capacity_cpu_high Escalation message for monitor [cluster_capacity_cpu_high] string "" no
escalation_message_cluster_capacity_ram_high Escalation message for monitor [cluster_capacity_ram_high] string "" no
escalation_message_cluster_failedscheduling Escalation message for monitor [cluster_failedscheduling] string "" no
escalation_message_cluster_pv_high_usage Escalation message for monitor [cluster_pv_high_usage] string "" no
escalation_message_cluster_pvc_pending Escalation message for monitor [cluster_pvc_pending] string "" no
escalation_message_daemonset_pods_not_enough Escalation message for monitor [daemonset_pods_not_enough] string "" no
escalation_message_deployment_deleted Escalation message for monitor [deployment_deleted] string "" no
escalation_message_deployment_no_pod Escalation message for monitor [deployment_no_pod] string "" no
escalation_message_deployment_pods_not_enough Escalation message for monitor [deployment_pods_not_enough] string "" no
escalation_message_host_cpu_high Escalation message for monitor [host_cpu_high] string "" no
escalation_message_host_disk_high Escalation message for monitor [host_disk_high] string "" no
escalation_message_host_load_high Escalation message for monitor [host_load_high] string "" no
escalation_message_host_node_disk_pressure Escalation message for monitor [host_node_disk_pressure] string "" no
escalation_message_host_node_memory_pressure Escalation message for monitor [host_node_memory_pressure] string "" no
escalation_message_host_node_not_ready Escalation message for monitor [host_node_not_ready] string "" no
escalation_message_host_ntp_sync Escalation message for monitor [host_ntp_sync] string "" no
escalation_message_host_pod_scheduled_not_ready Escalation message for monitor [host_pod_scheduled_not_ready] string "" no
escalation_message_host_ram_high Escalation message for monitor [host_ram_high] string "" no
escalation_message_host_reboot Escalation message for monitor [host_reboot] string "" no
escalation_message_services_endpoints_address_not_ready Escalation message for monitor [services_endpoints_address_not_ready] string "" no
escalation_message_statefulset_no_pod Escalation message for monitor [statefulset_no_pod] string "" no
escalation_message_statefulset_pods_not_enough Escalation message for monitor [statefulset_pods_not_enough] string "" no
evaluation_delay Delay in seconds for the metric evaluation number 15 no
extra_tags_cluster_api_operations_error_rate Extra tags for monitor [cluster_api_operations_error_rate] list(string) [] no
extra_tags_cluster_capacity_cpu_high Extra tags for monitor [cluster_capacity_cpu_high] list(string) [] no
extra_tags_cluster_capacity_ram_high Extra tags for monitor [cluster_capacity_ram_high] list(string) [] no
extra_tags_cluster_failedscheduling Extra tags for monitor [cluster_failedscheduling] list(string) [] no
extra_tags_cluster_pv_high_usage Extra tags for monitor [cluster_pv_high_usage] list(string) [] no
extra_tags_cluster_pvc_pending Extra tags for monitor [cluster_pvc_pending] list(string) [] no
extra_tags_daemonset_pods_not_enough Extra tags for monitor [daemonset_pods_not_enough] list(string) [] no
extra_tags_deployment_deleted Extra tags for monitor [deployment_deleted] list(string) [] no
extra_tags_deployment_no_pod Extra tags for monitor [deployment_no_pod] list(string) [] no
extra_tags_deployment_pods_not_enough Extra tags for monitor [deployment_pods_not_enough] list(string) [] no
extra_tags_host_cpu_high Extra tags for monitor [host_cpu_high] list(string) [] no
extra_tags_host_disk_high Extra tags for monitor [host_disk_high] list(string) [] no
extra_tags_host_load_high Extra tags for monitor [host_load_high] list(string) [] no
extra_tags_host_node_disk_pressure Extra tags for monitor [host_node_disk_pressure] list(string) [] no
extra_tags_host_node_memory_pressure Extra tags for monitor [host_node_memory_pressure] list(string) [] no
extra_tags_host_node_not_ready Extra tags for monitor [host_node_not_ready] list(string) [] no
extra_tags_host_ntp_sync Extra tags for monitor [host_ntp_sync] list(string) [] no
extra_tags_host_pod_scheduled_not_ready Extra tags for monitor [host_pod_scheduled_not_ready] list(string) [] no
extra_tags_host_ram_high Extra tags for monitor [host_ram_high] list(string) [] no
extra_tags_host_reboot Extra tags for monitor [host_reboot] list(string) [] no
extra_tags_services_endpoints_address_not_ready Extra tags for monitor [services_endpoints_address_not_ready] list(string) [] no
extra_tags_statefulset_no_pod Extra tags for monitor [statefulset_no_pod] list(string) [] no
extra_tags_statefulset_pods_not_enough Extra tags for monitor [statefulset_pods_not_enough] list(string) [] no
irp_cluster_api_operations_error_rate Define IRP for monitor [notify_cluster_api_operations_error_rate] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-cluster-api-error-rate" no
irp_cluster_capacity_cpu_high Define IRP for monitor [notify_cluster_capacity_cpu_high] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-cluster-capacity" no
irp_cluster_capacity_ram_high Define IRP for monitor [notify_cluster_capacity_ram_high] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-cluster-capacity" no
irp_cluster_failedscheduling Define IRP for monitor [notify_cluster_failedscheduling] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-cluster-failedscheduling" no
irp_cluster_pv_high_usage Define IRP for monitor [notify_cluster_pv_high_usage] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-persistent-volumes-usage-too-high" no
irp_cluster_pvc_pending Define IRP for monitor [notify_cluster_pvc_pending] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-cluster-pvc-pending" no
irp_daemonset_pods_not_enough Define IRP for monitor [notify_daemonset_pods_not_enough] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-daemonset-missing-pods-in-daemonset" no
irp_deployment_deleted Define IRP for monitor [notify_deployment_deleted] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-deployments-was-recently-deleted" no
irp_deployment_no_pod Define IRP for monitor [notify_deployment_no_pod] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-deployments-no-pods-in-deployments" no
irp_deployment_pods_not_enough Define IRP for monitor [notify_deployment_pods_not_enough] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-deployments-missing-pods-in-deployments" no
irp_host_cpu_high Define IRP for monitor [notify_host_cpu_high] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-cpu-high" no
irp_host_disk_high Define IRP for monitor [notify_host_disk_high] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-disk-usage-high" no
irp_host_load_high Define IRP for monitor [notify_host_load_high] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-load-high" no
irp_host_node_disk_pressure Define IRP for monitor [notify_host_node_disk_pressure] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-node-disk-pressure" no
irp_host_node_memory_pressure Define IRP for monitor [notify_host_node_memory_pressure] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-node-memory-pressure" no
irp_host_node_not_ready Define IRP for monitor [notify_host_node_not_ready] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-cluster-nodes-not-ready" no
irp_host_ntp_sync Define IRP for monitor [notify_host_ntp_sync] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-ntp-not-sync" no
irp_host_pod_scheduled_not_ready Define IRP for monitor [notify_host_pod_scheduled_not_ready] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-node-notready" no
irp_host_ram_high Define IRP for monitor [notify_host_ram_high] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-ram-high" no
irp_host_reboot Define IRP for monitor [notify_host_reboot] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator/IRP/IRP-host-reboot" no
irp_services_endpoints_address_not_ready Define IRP for monitor [notify_services_endpoints_address_not_ready] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-service-endpoints-address-not-ready" no
irp_statefulset_no_pod Define IRP for monitor [notify_statefulset_no_pod] string "http://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-statefulset-no-pod" no
irp_statefulset_pods_not_enough Define IRP for monitor [notify_statefulset_pods_not_enough] string "https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/service/IRP/IRP-statefulset-missing-pods" no
isprod activate the monitor on production on Sentinel string "false" no
isprod_cluster_api_operations_error_rate Define production status for monitor [notify_cluster_api_operations_error_rate], usually managed by global isprod variable string "" no
isprod_cluster_capacity_cpu_high Define production status for monitor [notify_cluster_capacity_cpu_high], usually managed by global isprod variable string "" no
isprod_cluster_capacity_ram_high Define production status for monitor [notify_cluster_capacity_ram_high], usually managed by global isprod variable string "" no
isprod_cluster_failedscheduling Define production status for monitor [notify_cluster_failedscheduling], usually managed by global isprod variable string "" no
isprod_cluster_pv_high_usage Define production status for monitor [notify_cluster_pv_high_usage], usually managed by global isprod variable string "false" no
isprod_cluster_pvc_pending Define production status for monitor [notify_cluster_pvc_pending], usually managed by global isprod variable string "" no
isprod_daemonset_pods_not_enough Define production status for monitor [notify_daemonset_pods_not_enough], usually managed by global isprod variable string "" no
isprod_deployment_deleted Define production status for monitor [notify_deployment_deleted], usually managed by global isprod variable string "" no
isprod_deployment_no_pod Define production status for monitor [notify_deployment_no_pod], usually managed by global isprod variable string "" no
isprod_deployment_pods_not_enough Define production status for monitor [notify_deployment_pods_not_enough], usually managed by global isprod variable string "" no
isprod_host_cpu_high Define production status for monitor [notify_host_cpu_high], usually managed by global isprod variable string "" no
isprod_host_disk_high Define production status for monitor [notify_host_disk_high], usually managed by global isprod variable string "" no
isprod_host_load_high Define production status for monitor [notify_host_load_high], usually managed by global isprod variable string "" no
isprod_host_node_disk_pressure Define production status for monitor [notify_host_node_disk_pressure], usually managed by global isprod variable string "" no
isprod_host_node_memory_pressure Define production status for monitor [notify_host_node_memory_pressure], usually managed by global isprod variable string "" no
isprod_host_node_not_ready Define production status for monitor [notify_host_node_not_ready], usually managed by global isprod variable string "" no
isprod_host_ntp_sync Define production status for monitor [notify_host_ntp_sync], usually managed by global isprod variable string "" no
isprod_host_pod_scheduled_not_ready Define production status for monitor [notify_host_pod_scheduled_not_ready], usually managed by global isprod variable string "" no
isprod_host_ram_high Define production status for monitor [notify_host_ram_high], usually managed by global isprod variable string "" no
isprod_host_reboot Define production status for monitor [notify_host_reboot], usually managed by global isprod variable string "" no
isprod_services_endpoints_address_not_ready Define production status for monitor [notify_services_endpoints_address_not_ready], usually managed by global isprod variable string "" no
isprod_statefulset_no_pod Define production status for monitor [notify_statefulset_no_pod], usually managed by global isprod variable string "" no
isprod_statefulset_pods_not_enough Define production status for monitor [notify_statefulset_pods_not_enough], usually managed by global isprod variable string "" no
message_cluster_api_operations_error_rate Custom message for monitor [cluster_api_operations_error_rate] string " This monitor calculate the API error rate per operation type.\n In case of anomaly detected based on error rate history, an alert can be raised. \n \n" no
message_cluster_capacity_cpu_high Custom message for monitor [cluster_capacity_cpu_high] string " Theses monitor verify the cluster capacity, by node role, for the memory and cpu usage. \n \n" no
message_cluster_capacity_ram_high Custom message for monitor [cluster_capacity_ram_high] string " Theses monitor verify the cluster capacity, by node role, for the memory and cpu usage. \n \n" no
message_cluster_failedscheduling Custom message for monitor [cluster_failedscheduling] string " This monitor check Kubernetes events and search for FailedScheduling events.\n This event means that the cluster does not find a suitable node to schedule a Pod.\n \n" no
message_cluster_pv_high_usage Custom message for monitor [cluster_pv_high_usage] string " This monitor detect if the cluster Persitent volume usage is too high\n and will need a modification to be apply.\n\n Disk-free threshold formula: (total_size/(log(total_size)*20). Monitor formula: disk_free - disk_free_threshold\n\n The value should be higher than 0.\n\n {{#is_alert}} The disk saturation on the persistent volume {{persistentvolumeclaim.name}} has always been critical during the last check period.\n\n It seems that there is currently a space problem on {{persistentvolumeclaim.name}}\n\n Cleaning up the disk storage at least {{eval \"humanize_bytes(abs(value))\"}} in order to have the monitor UP. {{/is_alert}}\n\n {{#is_alert_recovery}} The disk space usage on the persistent volume {{persistentvolumeclaim.name}} is back to normal now. {{/is_alert_recovery}}\n\n Please escalate the issue to the team in charge.\n\n You will probably need to increase the PV by following:\n https://docs.managed-containers.lbn.fr/#/administration-guides/kubernetes/orchestrator-components/persistent-storage/?id=resizing-persisentvolumesn \n" no
message_cluster_pvc_pending Custom message for monitor [cluster_pvc_pending] string " This monitor check if some PersistentVolumeClaims are not bound to PersistentVolumes on the cluster.\n \n" no
message_daemonset_pods_not_enough Custom message for monitor [daemonset_pods_not_enough] string " This monitor check the number of available Pods in a DaemonSet versus the DaemonSet number of ready Pods.\n It is in alert if ready != desired\n The DaemonSet is special kind of Deployment for a Pod that should always run on all cluster Nodes (monitoring agent...)\n \n" no
message_deployment_deleted Custom message for monitor [deployment_deleted] string " This monitor if the Deployment was deleted during last check period.\n \n" no
message_deployment_no_pod Custom message for monitor [deployment_no_pod] string " This monitor check that there is at least on Pod in each Deployment.\n \n" no
message_deployment_pods_not_enough Custom message for monitor [deployment_pods_not_enough] string " This monitor check the number of available Pods in a Deployment versus the Deployment desired number of Pods.\n \n" no
message_host_cpu_high Custom message for monitor [host_cpu_high] string " This monitor check the CPU usage on each Node.\n \n" no
message_host_disk_high Custom message for monitor [host_disk_high] string " This monitor check the Disk usage on each Node.\n \n" no
message_host_load_high Custom message for monitor [host_load_high] string " This monitor check the Load usage on each Node.\n \n" no
message_host_node_disk_pressure Custom message for monitor [host_node_disk_pressure] string " This monitor detect if the Node is in a disk_pressure condition.\n \n" no
message_host_node_memory_pressure Custom message for monitor [host_node_memory_pressure] string " This monitor detect if the Node is in a memory_pressure condition.\n \n" no
message_host_node_not_ready Custom message for monitor [host_node_not_ready] string " This monitor check that all cluster nodes are Ready.\n \n" no
message_host_ntp_sync Custom message for monitor [host_ntp_sync] string " This monitor check if the NTP is correctly synced.\n \n" no
message_host_pod_scheduled_not_ready Custom message for monitor [host_pod_scheduled_not_ready] string " This monitor compare the number on scheduled Pods on host vs the number of Ready Pods on host.\n \n" no
message_host_ram_high Custom message for monitor [host_ram_high] string " This monitor check the RAM usage on each Node.\n \n" no
message_host_reboot Custom message for monitor [host_reboot] string " This incidate that the server has been restarted recently.\n \n" no
message_services_endpoints_address_not_ready Custom message for monitor [services_endpoints_address_not_ready] string " This monitor check the endpoints status for each endpoints of each service.\n \n" no
message_statefulset_no_pod Custom message for monitor [statefulset_no_pod] string " This monitor check that there is at least on Pod in each Statefulset.\n \n" no
message_statefulset_pods_not_enough Custom message for monitor [statefulset_pods_not_enough] string " This monitor indicate that there is not enough pods available in the statefulset.\n \n" no
new_host_delay Delay in seconds before monitor new resource number 300 no
no_data_timeframe_cluster_api_operations_error_rate No data timeframe in minutes for monitor [cluster_api_operations_error_rate] number 2880 no
no_data_timeframe_cluster_capacity_cpu_high No data timeframe in minutes for monitor [cluster_capacity_cpu_high] number 2880 no
no_data_timeframe_cluster_capacity_ram_high No data timeframe in minutes for monitor [cluster_capacity_ram_high] number 2880 no
no_data_timeframe_cluster_failedscheduling No data timeframe in minutes for monitor [cluster_failedscheduling] number 2 no
no_data_timeframe_cluster_pv_high_usage No data timeframe in minutes for monitor [cluster_pv_high_usage] number 600 no
no_data_timeframe_cluster_pvc_pending No data timeframe in minutes for monitor [cluster_pvc_pending] number 2 no
no_data_timeframe_daemonset_pods_not_enough No data timeframe in minutes for monitor [daemonset_pods_not_enough] number 2 no
no_data_timeframe_deployment_deleted No data timeframe in minutes for monitor [deployment_deleted] number 2 no
no_data_timeframe_deployment_no_pod No data timeframe in minutes for monitor [deployment_no_pod] number 2 no
no_data_timeframe_deployment_pods_not_enough No data timeframe in minutes for monitor [deployment_pods_not_enough] number 2 no
no_data_timeframe_host_cpu_high No data timeframe in minutes for monitor [host_cpu_high] number 2 no
no_data_timeframe_host_disk_high No data timeframe in minutes for monitor [host_disk_high] number 2 no
no_data_timeframe_host_load_high No data timeframe in minutes for monitor [host_load_high] number 2 no
no_data_timeframe_host_node_disk_pressure No data timeframe in minutes for monitor [host_node_disk_pressure] number 2 no
no_data_timeframe_host_node_memory_pressure No data timeframe in minutes for monitor [host_node_memory_pressure] number 2 no
no_data_timeframe_host_node_not_ready No data timeframe in minutes for monitor [host_node_not_ready] number 30 no
no_data_timeframe_host_ntp_sync No data timeframe in minutes for monitor [host_ntp_sync] number 2 no
no_data_timeframe_host_pod_scheduled_not_ready No data timeframe in minutes for monitor [host_pod_scheduled_not_ready] number 2 no
no_data_timeframe_host_ram_high No data timeframe in minutes for monitor [host_ram_high] number 2 no
no_data_timeframe_host_reboot No data timeframe in minutes for monitor [host_reboot] number 2 no
no_data_timeframe_services_endpoints_address_not_ready No data timeframe in minutes for monitor [services_endpoints_address_not_ready] number 2 no
no_data_timeframe_statefulset_no_pod No data timeframe in minutes for monitor [statefulset_no_pod] number 2 no
no_data_timeframe_statefulset_pods_not_enough No data timeframe in minutes for monitor [statefulset_pods_not_enough] number 2 no
notify_capacity to who notify capacity alerts string "" no
notify_cluster_api_operations_error_rate Define notification hooks for monitor [notify_cluster_api_operations_error_rate] string "" no
notify_cluster_capacity_cpu_high Define notification hooks for monitor [notify_cluster_capacity_cpu_high] string "" no
notify_cluster_capacity_ram_high Define notification hooks for monitor [notify_cluster_capacity_ram_high] string "" no
notify_cluster_failedscheduling Define notification hooks for monitor [notify_cluster_failedscheduling] string "" no
notify_cluster_pv_high_usage Define notification hooks for monitor [notify_cluster_pv_high_usage] string "" no
notify_cluster_pvc_pending Define notification hooks for monitor [notify_cluster_pvc_pending] string "" no
notify_daemonset_pods_not_enough Define notification hooks for monitor [notify_daemonset_pods_not_enough] string "" no
notify_deployment_deleted Define notification hooks for monitor [notify_deployment_deleted] string "" no
notify_deployment_no_pod Define notification hooks for monitor [notify_deployment_no_pod] string "" no
notify_deployment_pods_not_enough Define notification hooks for monitor [notify_deployment_pods_not_enough] string "" no
notify_host_cpu_high Define notification hooks for monitor [notify_host_cpu_high] string "" no
notify_host_disk_high Define notification hooks for monitor [notify_host_disk_high] string "" no
notify_host_load_high Define notification hooks for monitor [notify_host_load_high] string "" no
notify_host_node_disk_pressure Define notification hooks for monitor [notify_host_node_disk_pressure] string "" no
notify_host_node_memory_pressure Define notification hooks for monitor [notify_host_node_memory_pressure] string "" no
notify_host_node_not_ready Define notification hooks for monitor [notify_host_node_not_ready] string "" no
notify_host_ntp_sync Define notification hooks for monitor [notify_host_ntp_sync] string "" no
notify_host_pod_scheduled_not_ready Define notification hooks for monitor [notify_host_pod_scheduled_not_ready] string "" no
notify_host_ram_high Define notification hooks for monitor [notify_host_ram_high] string "" no
notify_host_reboot Define notification hooks for monitor [notify_host_reboot] string "" no
notify_no_data_cluster_api_operations_error_rate Will raise no data alert if set to true for monitor [cluster_api_operations_error_rate] bool true no
notify_no_data_cluster_capacity_cpu_high Will raise no data alert if set to true for monitor [cluster_capacity_cpu_high] bool true no
notify_no_data_cluster_capacity_ram_high Will raise no data alert if set to true for monitor [cluster_capacity_ram_high] bool true no
notify_no_data_cluster_failedscheduling Will raise no data alert if set to true for monitor [cluster_failedscheduling] bool true no
notify_no_data_cluster_pv_high_usage Will raise no data alert if set to true for monitor [cluster_pv_high_usage] bool true no
notify_no_data_cluster_pvc_pending Will raise no data alert if set to true for monitor [cluster_pvc_pending] bool true no
notify_no_data_daemonset_pods_not_enough Will raise no data alert if set to true for monitor [daemonset_pods_not_enough] bool true no
notify_no_data_deployment_deleted Will raise no data alert if set to true for monitor [deployment_deleted] bool true no
notify_no_data_deployment_no_pod Will raise no data alert if set to true for monitor [deployment_no_pod] bool false no
notify_no_data_deployment_pods_not_enough Will raise no data alert if set to true for monitor [deployment_pods_not_enough] bool false no
notify_no_data_host_cpu_high Will raise no data alert if set to true for monitor [host_cpu_high] bool false no
notify_no_data_host_disk_high Will raise no data alert if set to true for monitor [host_disk_high] bool false no
notify_no_data_host_load_high Will raise no data alert if set to true for monitor [host_load_high] bool false no
notify_no_data_host_node_disk_pressure Will raise no data alert if set to true for monitor [host_node_disk_pressure] bool false no
notify_no_data_host_node_memory_pressure Will raise no data alert if set to true for monitor [host_node_memory_pressure] bool false no
notify_no_data_host_node_not_ready Will raise no data alert if set to true for monitor [host_node_not_ready] bool false no
notify_no_data_host_ntp_sync Will raise no data alert if set to true for monitor [host_ntp_sync] bool false no
notify_no_data_host_pod_scheduled_not_ready Will raise no data alert if set to true for monitor [host_pod_scheduled_not_ready] bool false no
notify_no_data_host_ram_high Will raise no data alert if set to true for monitor [host_ram_high] bool false no
notify_no_data_host_reboot Will raise no data alert if set to true for monitor [host_reboot] bool false no
notify_no_data_services_endpoints_address_not_ready Will raise no data alert if set to true for monitor [services_endpoints_address_not_ready] bool false no
notify_no_data_statefulset_no_pod Will raise no data alert if set to true for monitor [statefulset_no_pod] bool true no
notify_no_data_statefulset_pods_not_enough Will raise no data alert if set to true for monitor [statefulset_pods_not_enough] bool true no
notify_services_endpoints_address_not_ready Define notification hooks for monitor [notify_services_endpoints_address_not_ready] string "" no
notify_statefulset_no_pod Define notification hooks for monitor [notify_statefulset_no_pod] string "" no
notify_statefulset_pods_not_enough Define notification hooks for monitor [notify_statefulset_pods_not_enough] string "" no
notify_to Define notification hooks string "@webhook-SGCProd @monitoring-teams" no
orchestrator_name name of the orchestrator solution (monitors prefix): AKS/GKE/EKS… any n/a yes
renotify_interval Delay in minutes before monitor is renotified number 40 no
renotify_interval_cluster_api_operations_error_rate Delay in minutes before monitor [cluster_api_operations_error_rate] is renotified, default: check renotify_interval number -42 no
renotify_interval_cluster_capacity_cpu_high Delay in minutes before monitor [cluster_capacity_cpu_high] is renotified, default: check renotify_interval number -42 no
renotify_interval_cluster_capacity_ram_high Delay in minutes before monitor [cluster_capacity_ram_high] is renotified, default: check renotify_interval number -42 no
renotify_interval_cluster_failedscheduling Delay in minutes before monitor [cluster_failedscheduling] is renotified, default: check renotify_interval number -42 no
renotify_interval_cluster_pv_high_usage Delay in minutes before monitor [cluster_pv_high_usage] is renotified, default: check renotify_interval number -42 no
renotify_interval_cluster_pvc_pending Delay in minutes before monitor [cluster_pvc_pending] is renotified, default: check renotify_interval number -42 no
renotify_interval_daemonset_pods_not_enough Delay in minutes before monitor [daemonset_pods_not_enough] is renotified, default: check renotify_interval number -42 no
renotify_interval_deployment_deleted Delay in minutes before monitor [deployment_deleted] is renotified, default: check renotify_interval number -42 no
renotify_interval_deployment_no_pod Delay in minutes before monitor [deployment_no_pod] is renotified, default: check renotify_interval number -42 no
renotify_interval_deployment_pods_not_enough Delay in minutes before monitor [deployment_pods_not_enough] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_cpu_high Delay in minutes before monitor [host_cpu_high] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_disk_high Delay in minutes before monitor [host_disk_high] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_load_high Delay in minutes before monitor [host_load_high] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_node_disk_pressure Delay in minutes before monitor [host_node_disk_pressure] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_node_memory_pressure Delay in minutes before monitor [host_node_memory_pressure] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_node_not_ready Delay in minutes before monitor [host_node_not_ready] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_ntp_sync Delay in minutes before monitor [host_ntp_sync] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_pod_scheduled_not_ready Delay in minutes before monitor [host_pod_scheduled_not_ready] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_ram_high Delay in minutes before monitor [host_ram_high] is renotified, default: check renotify_interval number -42 no
renotify_interval_host_reboot Delay in minutes before monitor [host_reboot] is renotified, default: check renotify_interval number -42 no
renotify_interval_services_endpoints_address_not_ready Delay in minutes before monitor [services_endpoints_address_not_ready] is renotified, default: check renotify_interval number -42 no
renotify_interval_statefulset_no_pod Delay in minutes before monitor [statefulset_no_pod] is renotified, default: check renotify_interval number -42 no
renotify_interval_statefulset_pods_not_enough Delay in minutes before monitor [statefulset_pods_not_enough] is renotified, default: check renotify_interval number -42 no
scope_cluster_api_operations_error_rate Define a custom scope for monitor [cluster_api_operations_error_rate] string "" no
scope_cluster_capacity_cpu_high Define a custom scope for monitor [cluster_capacity_cpu_high] string "" no
scope_cluster_capacity_ram_high Define a custom scope for monitor [cluster_capacity_ram_high] string "" no
scope_cluster_failedscheduling Define a custom scope for monitor [cluster_failedscheduling] string "" no
scope_cluster_pv_high_usage Scope for monitor [cluster_pv_high_usage] string "" no
scope_cluster_pvc_pending Define a custom scope for monitor [cluster_pvc_pending] string "" no
scope_daemonset_pods_not_enough Define a custom scope for monitor [daemonset_pods_not_enough] string "" no
scope_deployment_deleted Define a custom scope for monitor [deployment_deleted] string "" no
scope_deployment_no_pod Define a custom scope for monitor [deployment_no_pod] string "" no
scope_deployment_pods_not_enough Define a custom scope for monitor [deployment_pods_not_enough] string "" no
scope_host_cpu_high Define a custom scope for monitor [host_cpu_high] string "" no
scope_host_disk_high Define a custom scope for monitor [host_disk_high] string "" no
scope_host_load_high Define a custom scope for monitor [host_load_high] string "" no
scope_host_node_disk_pressure Define a custom scope for monitor [host_node_disk_pressure] string "" no
scope_host_node_memory_pressure Define a custom scope for monitor [host_node_memory_pressure] string "" no
scope_host_node_not_ready Define a custom scope for monitor [host_node_not_ready] string "" no
scope_host_ntp_sync Define a custom scope for monitor [host_ntp_sync] string "" no
scope_host_pod_scheduled_not_ready Define a custom scope for monitor [host_pod_scheduled_not_ready] string "" no
scope_host_ram_high Define a custom scope for monitor [host_ram_high] string "" no
scope_host_reboot Define a custom scope for monitor [host_reboot] string "" no
scope_services_endpoints_address_not_ready Define a custom scope for monitor [services_endpoints_address_not_ready] string "" no
scope_statefulset_no_pod Define a custom scope for monitor [statefulset_no_pod] string "" no
scope_statefulset_pods_not_enough Define a custom scope for monitor [statefulset_pods_not_enough] string "" no
severity_cluster_api_operations_error_rate Define severity for monitor [cluster_api_operations_error_rate] string "3" no
severity_cluster_capacity_cpu_high Define severity for monitor [cluster_capacity_cpu_high] string "2" no
severity_cluster_capacity_ram_high Define severity for monitor [cluster_capacity_ram_high] string "4" no
severity_cluster_failedscheduling Define severity for monitor [cluster_failedscheduling] string "1" no
severity_cluster_pv_high_usage Define severity for monitor [cluster_pv_high_usage] string "3" no
severity_cluster_pvc_pending Define severity for monitor [cluster_pvc_pending] string "3" no
severity_daemonset_pods_not_enough Define severity for monitor [daemonset_pods_not_enough] string "2" no
severity_deployment_deleted Define severity for monitor [deployment_deleted] string "3" no
severity_deployment_no_pod Define severity for monitor [deployment_no_pod] string "2" no
severity_deployment_pods_not_enough Define severity for monitor [deployment_pods_not_enough] string "2" no
severity_host_cpu_high Define severity for monitor [host_cpu_high] string "4" no
severity_host_disk_high Define severity for monitor [host_disk_high] string "3" no
severity_host_load_high Define severity for monitor [host_load_high] string "2" no
severity_host_node_disk_pressure Define severity for monitor [host_node_disk_pressure] string "2" no
severity_host_node_memory_pressure Define severity for monitor [host_node_memory_pressure] string "2" no
severity_host_node_not_ready Define severity for monitor [host_node_not_ready] string "3" no
severity_host_ntp_sync Define severity for monitor [host_ntp_sync] string "4" no
severity_host_pod_scheduled_not_ready Define severity for monitor [host_pod_scheduled_not_ready] string "4" no
severity_host_ram_high Define severity for monitor [host_ram_high] string "3" no
severity_host_reboot Define severity for monitor [host_reboot] string "3" no
severity_services_endpoints_address_not_ready Define severity for monitor [services_endpoints_address_not_ready] string "2" no
severity_statefulset_no_pod Define severity for monitor [statefulset_no_pod] string "2" no
severity_statefulset_pods_not_enough Define severity for monitor [statefulset_pods_not_enough] string "2" no
team_in_charge Define the team in charge string "Undefined" no
template Template name string "caas-kubernetes" no
threshold_critical_cluster_api_operations_error_rate Monitor [cluster_api_operations_error_rate] critical threshold number 1 no
threshold_critical_cluster_capacity_cpu_high Monitor [cluster_capacity_cpu_high] critical threshold number 90 no
threshold_critical_cluster_capacity_ram_high Monitor [cluster_capacity_ram_high] critical threshold number 90 no
threshold_critical_cluster_failedscheduling Monitor [cluster_failedscheduling] critical threshold number 1 no
threshold_critical_cluster_pv_high_usage Monitor [cluster_pv_high_usage] critical threshold number 0 no
threshold_critical_cluster_pvc_pending Monitor [cluster_pvc_pending] critical threshold number 0 no
threshold_critical_deployment_deleted Monitor [deployment_deleted] critical threshold number 1 no
threshold_critical_deployment_no_pod Monitor [deployment_no_pod] critical threshold number 1 no
threshold_critical_host_cpu_high Monitor [host_cpu_high] critical threshold number 85 no
threshold_critical_host_disk_high Monitor [host_disk_high] critical threshold number 95 no
threshold_critical_host_load_high Monitor [host_load_high] critical threshold number 5 no
threshold_critical_host_node_disk_pressure Monitor [host_node_disk_pressure] critical threshold number 1 no
threshold_critical_host_node_memory_pressure Monitor [host_node_memory_pressure] critical threshold number 1 no
threshold_critical_host_node_not_ready Monitor [host_node_not_ready] critical threshold number 4 no
threshold_critical_host_ntp_sync Monitor [host_ntp_sync] critical threshold number 1 no
threshold_critical_host_pod_scheduled_not_ready Monitor [host_pod_scheduled_not_ready] critical threshold number 1 no
threshold_critical_host_ram_high Monitor [host_ram_high] critical threshold number 10 no
threshold_critical_host_reboot Monitor [host_reboot] critical threshold number 5 no
threshold_critical_recovery_cluster_api_operations_error_rate Monitor [cluster_api_operations_error_rate] critical_recovery threshold number 0 no
threshold_critical_services_endpoints_address_not_ready Monitor [services_endpoints_address_not_ready] critical threshold number 1 no
threshold_critical_statefulset_no_pod Monitor [statefulset_no_pod] critical threshold number 1 no
threshold_critical_statefulset_pods_not_enough Monitor [statefulset_pods_not_enough] critical threshold number 0.3 no
threshold_ok_host_node_disk_pressure Monitor [host_node_disk_pressure] ok threshold number 1 no
threshold_ok_host_node_memory_pressure Monitor [host_node_memory_pressure] ok threshold number 1 no
threshold_ok_host_node_not_ready Monitor [host_node_not_ready] ok threshold number 2 no
threshold_ok_host_ntp_sync Monitor [host_ntp_sync] warning threshold number 1 no
threshold_warning_cluster_capacity_cpu_high Monitor [cluster_capacity_cpu_high] warning threshold number 80 no
threshold_warning_cluster_capacity_ram_high Monitor [cluster_capacity_ram_high] warning threshold number 80 no
threshold_warning_host_cpu_high Monitor [host_cpu_high] warning threshold number 75 no
threshold_warning_host_disk_high Monitor [host_disk_high] warning threshold number 90 no
threshold_warning_host_load_high Monitor [host_load_high] warning threshold number 3 no
threshold_warning_host_node_disk_pressure Monitor [host_node_disk_pressure] warning threshold number 1 no
threshold_warning_host_node_memory_pressure Monitor [host_node_memory_pressure] warning threshold number 1 no
threshold_warning_host_node_not_ready Monitor [host_node_not_ready] warning threshold number 2 no
threshold_warning_host_ntp_sync Monitor [host_ntp_sync] warning threshold number 1 no
threshold_warning_host_ram_high Monitor [host_ram_high] warning threshold number 20 no
threshold_warning_statefulset_pods_not_enough Monitor [statefulset_pods_not_enough] warning threshold number 0.1 no
timeframe_cluster_api_operations_error_rate Monitor timeframe for monitor [cluster_api_operations_error_rate] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_4h" no
timeframe_cluster_capacity_cpu_high Monitor timeframe for monitor [cluster_capacity_cpu_high] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_1d" no
timeframe_cluster_capacity_ram_high Monitor timeframe for monitor [cluster_capacity_ram_high] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_1d" no
timeframe_cluster_failedscheduling Monitor timeframe for monitor [cluster_failedscheduling] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "5m" no
timeframe_cluster_pv_high_usage Monitor timeframe for monitor [cluster_pv_high_usage] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_15m" no
timeframe_cluster_pvc_pending Monitor timeframe for monitor [cluster_pvc_pending] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_daemonset_pods_not_enough Monitor timeframe for monitor [daemonset_pods_not_enough] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_10m" no
timeframe_deployment_deleted Monitor timeframe for monitor [deployment_deleted] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_deployment_no_pod Monitor timeframe for monitor [deployment_no_pod] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_deployment_pods_not_enough Monitor timeframe for monitor [deployment_pods_not_enough] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_10m" no
timeframe_host_cpu_high Monitor timeframe for monitor [host_cpu_high] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_30m" no
timeframe_host_disk_high Monitor timeframe for monitor [host_disk_high] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_host_load_high Monitor timeframe for monitor [host_load_high] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_30m" no
timeframe_host_node_disk_pressure Monitor timeframe for monitor [host_node_disk_pressure] string 2 no
timeframe_host_node_memory_pressure Monitor timeframe for monitor [host_node_memory_pressure] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string 2 no
timeframe_host_node_not_ready Monitor occurence (timeframe) for monitor [host_node_not_ready] string "6" no
timeframe_host_ntp_sync Monitor timeframe for monitor [host_ntp_sync] string 2 no
timeframe_host_pod_scheduled_not_ready Monitor timeframe for monitor [host_pod_scheduled_not_ready] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_10m" no
timeframe_host_ram_high Monitor timeframe for monitor [host_ram_high] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_host_reboot Monitor timeframe for monitor [host_reboot] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_services_endpoints_address_not_ready Monitor timeframe for monitor [services_endpoints_address_not_ready] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_statefulset_no_pod Monitor timeframe for monitor [statefulset_no_pod] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeframe_statefulset_pods_not_enough Monitor timeframe for monitor [statefulset_pods_not_enough] [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
type_cluster_api_operations_error_rate Define type for monitor [notify_cluster_api_operations_error_rate] string "information" no
type_cluster_capacity_cpu_high Define type for monitor [notify_cluster_capacity_cpu_high] string "incident" no
type_cluster_capacity_ram_high Define type for monitor [notify_cluster_capacity_ram_high] string "information" no
type_cluster_failedscheduling Define type for monitor [notify_cluster_failedscheduling] string "incident" no
type_cluster_pv_high_usage Define type for monitor [notify_cluster_pv_high_usage] string "incident" no
type_cluster_pvc_pending Define type for monitor [notify_cluster_pvc_pending] string "incident" no
type_daemonset_pods_not_enough Define type for monitor [notify_daemonset_pods_not_enough] string "incident" no
type_deployment_deleted Define type for monitor [notify_deployment_deleted] string "alert" no
type_deployment_no_pod Define type for monitor [notify_deployment_no_pod] string "incident" no
type_deployment_pods_not_enough Define type for monitor [notify_deployment_pods_not_enough] string "incident" no
type_host_cpu_high Define type for monitor [notify_host_cpu_high] string "information" no
type_host_disk_high Define type for monitor [notify_host_disk_high] string "information" no
type_host_load_high Define type for monitor [notify_host_load_high] string "information" no
type_host_node_disk_pressure Define type for monitor [notify_host_node_disk_pressure] string "information" no
type_host_node_memory_pressure Define type for monitor [notify_host_node_memory_pressure] string "information" no
type_host_node_not_ready Define type for monitor [notify_host_node_not_ready] string "incident" no
type_host_ntp_sync Define type for monitor [notify_host_ntp_sync] string "information" no
type_host_pod_scheduled_not_ready Define type for monitor [notify_host_pod_scheduled_not_ready] string "information" no
type_host_ram_high Define type for monitor [notify_host_ram_high] string "information" no
type_host_reboot Define type for monitor [notify_host_reboot] string "information" no
type_services_endpoints_address_not_ready Define type for monitor [notify_services_endpoints_address_not_ready] string "incident" no
type_statefulset_no_pod Define type for monitor [notify_statefulset_no_pod] string "incident" no
type_statefulset_pods_not_enough Define type for monitor [notify_statefulset_pods_not_enough] string "incident" no

Outputs

No outputs.

About

Terraform module for deploy kubernetes monitoring on datadog

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages