Azure Monitor managed service for Prometheus had an issue during regular deployments starting august-13-2023, whereby , Managed Prometheus metrics got auto-disabled on some AKS clusters that satisfies one or more of the below criteria.
- AKS cluster had both Azure Monitor logs & Azure Monitor managed service for Prometheus enabled, and also cluster had any
PUT
operation. - AKS cluster had Azure Monitor managed service for Prometheus enabled, and Azure Monitor logs was enabled after that or vice versa.
Issue has been mitigated on the Azure service side on august-27-2023, but some clusters might still see Prometheus metrics collection disabled and they require customer action as stated below. Please follow the below steps to re-enable Azure Monitor managed service for Prometheus on the impacted AKS cluster(s) you might have.
Step-1
: Dis-associate the Data collection rule for Managed Prometheus with the impacted AKS cluster.
You can dis-associate the DCR (Data Collection Rule) association with the cluster by several ways.
Usually in most cases, DCR is created in the same resource group as AKS cluster, in the same region as Azure Monitor Workspace, and it has MSProm
profix in its name. Once Prometheus DCR is idenfied for your AKS cluster, you would have to dis-associate that DCR with the AKS cluster resource, which can be done in several ways as below.
-
Method-1 : Navigate to
Datacollection Rules
in Azure portal, and then chooseResources
menu, which will show you all the Data collection Rules in the selected Subscriptions and Resource groups, that you would need to filter with AKS cluster's Subscription and Resource group. Once the DCR is identified, choose theResources
menu underConfiguration
group for the DCR, which will show the AKS cluster that you can choose and delete. -
Method-2 : If you had chosen custom DCR names & Resource Group for your Prometheus DCR, you would have to identify the correct DCR to perform the above step.
Step-2
: After step-1 above has been complete, please re-enable Azure Monitor managed service for Prometheus using any of the supported deployment mechanisms as documented here
If you need further help on recovering from this issue, please reach out to us at [email protected]