Skip to content

Latest commit

 

History

History
242 lines (212 loc) · 15.7 KB

RELEASENOTES.md

File metadata and controls

242 lines (212 loc) · 15.7 KB

Azure Monitor Metrics for AKS clusters

Release 11-16-2023

  • Windows image - `mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.2-main--win
  • Change log -
    • Fix Windows liveness probe for error level evaluation

Release 11-03-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.1-main-11-03-2023-c061beb4
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.1-main-11-03-2023-c061beb4-win
  • TA image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.1-main-11-03-2023-c061beb4-targetallocator
  • cfg sidecar image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.1-main-11-03-2023-c061beb4-cfg
  • Change log -
    • Add new regions for Azure Monitor Workspace - Azure#609
    • Add telemetry for target allocator & config side-car image tags - Azure#661
    • Add more metrics as default metrics (to enable Insights Portal Ux) - Azure#667
      • kube-state-metrics
        • kube_service_info
        • kube_pod_container_status_running
        • kube_pod_container_status_waiting
        • kube_pod_container_status_terminated
        • kube_pod_container_state_started
        • kube_pod_created
        • kube_pod_start_time
        • kube_pod_init_container_info
        • kube_pod_init_container_status_terminated
        • kube_pod_init_container_status_terminated_reason
        • kube_pod_init_container_status_ready
        • kube_pod_init_container_resource_limits
        • kube_pod_init_container_status_running
        • kube_pod_init_container_status_waiting
        • kube_pod_init_container_status_restarts_total
      • node-exporter (Linux)
        • node_boot_time_seconds
    • Adding telemetry for ta and cfg reader img versions - Azure#661

Release 10-20-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.0-main-10-20-2023-182f67d4
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.0-main-10-20-2023-182f67d4-win
  • TA image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.0-main-10-20-2023-182f67d4-targetallocator
  • cfg sidecar image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.8.0-main-10-20-2023-182f67d4-cfg
  • Change log -
    • Update telegraf jitter & disable exemplar for rs - Azure#634
    • Add Operator support for prometheus-collector - Azure#554

Release 10-05-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.7-main-10-06-2023-b75a076c
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.7-main-10-06-2023-b75a076c-win
  • Change log -
    • Update k8s.io/clientgo in fluentbit plugin from 0.28.0 to 0.28.2 - Azure#595
    • fix: ARC fixes (already released to ARC as hotfix) - Azure#605
      • Update ARC regions (add Sweden South)
      • Fix registry for node exporter
      • Add clusterDistribution override parameter for AKS EE
    • Update CPU requests for Daemonset (linux & windows) - Azure#606
    • Add telemetry for per target scrape interval - Azure#614
    • Update dependencies, Disable exemplars on ME (Linux only), Update CVE exemptions, Stop windows ingestion from replicaset, Fix try scans to fail on CVEs - Azure#616
      • Linux
        • mdsd = azure-mdsd-1.23.5 --> 1.27.4
        • ME = 2.2023.224.2214 --> 2.2023.928.2134
        • telegraf = 1.25.2 --> 1.27.3
        • golang = 1.18 --> 1.20
      • Windows
        • golang = 1.18 --> 1.20
      • Upgrade addon token adapter for back door deployments (Linux only)
        • master.221118.2 --> master.230804.1
    • Fix $ substitution issue in relabel and metric relabel config - Azure#618
    • update github.com/prometheus/client_golang from 1.16.0 to 1.17.0 in fluentbit plugin - Azure#608

Release 9-11-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.5-main-09-12-2023-8fbde9ca
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.5-main-09-12-2023-8fbde9ca-win
  • Change log -
    • Add container image signing - Azure#570
    • fix: windows liveness probe bug fix - Azure#568
    • fix: proxy liveness probe timing fix - Azure#591
    • update : add trouble shooting script - Azure#572
    • Add following metrics from below targets to be collected by default when with ingestion profile - This is for future alerting improvements - Azure#571
      • job=Kubelet - kubelet_certificate_manager_client_ttl_seconds, kubelet_certificate_manager_client_expiration_renew_errors, kubelet_server_expiration_renew_errors, kubelet_certificate_manager_server_ttl_seconds, kubelet_volume_stats_inodes_free, kube_persistentvolumeclaim_access_mode, kube_persistentvolumeclaim_labels, kube_persistentvolume_status_phase
      • job=kube-state-metrics - kube_daemonset_status_current_number_scheduled, kube_daemonset_status_number_misscheduled

Release 08-11-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.4-main-08-11-2023-6de2ec55
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.4-main-08-11-2023-6de2ec55-win
  • Change log -
    • fix: revert commit e7254187 to remove $$-->$ issue in config processing
    • fix: ARC extension : fixes for selectively not mounting for k3s & k8s edge distros (Edge distros)

Release 07-28-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.2-main-07-28-2023-0efd3e4e
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.2-main-07-28-2023-0efd3e4e-win
  • Change log -
    • fix: Add unfair semaphore wait to windows container for better initial CPU performance
    • fix: Upgrades Node exporter - image from: v1.5.0 to:v1.6.0; chart from:4.14.0 to:4.21.0 Kube state - image from: v2.8.1 to:v2.9.2; chart from:4.32.0 to:5.10.1
    • Arc extension: do not mount ubuntu ca-certs if k8s distro is AKS Edge Essentials

Release 06-26-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.2-main-06-26-2023-6ee07896
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.2-main-06-26-2023-6ee07896-win
  • Change log -
    • fix: Bicep template warnings
    • fix: Adding kube_labels and kube_annotations to the default list of metrics
    • fix: Bump github.com/prometheus/client_golang from 1.15.1 to 1.16.0 in /otelcollector/fluent-bit/src
    • fix: Bump k8s.io/apimachinery from 0.27.2 to 0.27.3 in /otelcollector/fluent-bit/src
    • fix: Bump k8s.io/client-go from 0.27.1 to 0.27.3 in /otelcollector/fluent-bit/src

Release 06-02-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.1-main-06-02-2023-d384b035
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.1-main-06-02-2023-d384b035-win
  • Change log -
    • fix: Terraform template fixes for Azure Monitor Metrics addon
    • fix: Reduce image tag length to docker limit of 128 characters
    • fix: Various ARC release script fixes
    • fix: Bicep template fix for adding role assingment for new grafana instance + allow different RGs for Grafana and AMW

Release 05-04-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.0-main-05-04-2023-4450ad10
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.7.0-main-05-04-2023-4450ad10-win
  • Change log -
    • feat: Add release and CI/CD support for Arc extension
    • fix: Allowlist all metrics used in alerting
    • fix: Update CPU and memory limits for Windows pods

Release 04-25-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.6.0-main-04-25-2023-2eb2a81c
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.6.0-main-04-25-2023-2eb2a81c-win
  • Change log -
    • feat: Add priorityclass system node critical for RS, DS & KSM pods
    • fix: Upgrades Fluent bit - from: v1.9.6 to:v2.0.9 Telegraf(windows) - from: v1.23.4 to:v1.24.2 Otelcol - from:v0.66.0 to:v0.73.0
    • fix: pod annotations bug

Release 03-24-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.5.0-main-03-24-2023-7eb3f5c7
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.5.0-main-03-24-2023-7eb3f5c7-win
  • Change log -
    • feat: Support for ARC-A
    • fix: Match ME setings between DS & RS
    • feat: Simplify Pod annotation based scraping by adding it as a target in the configmap
    • fix: Add golang race detector during builds (SDL requirement)
    • fix: Reduce telemetry volume
    • feat: Make deployment progress deadline configurable as a param (controbution from @OriYosefiMSFT)
    • feat: Enable workload identity for valur store (1p only) (contribution from @lnr0626)
    • fix: Bump client-go and lumberjack.v2 packages for fluentbit
    • fix: Upgrades Node exporter - image from: v1.3.1 to:v1.5.0; chart from:3.1.1 to:4.14.0 Kube state - image from: v2.6.0 to:v2.8.1; chart from:4.23.0 to 4.32.0 ME - from:2.2022.1201.1140 to:2.2023.224.2214 MDSD - from:1.23.4 to:1.23.5 MA - from:46.2.3 to: 46.4.1 Telegraf(linux) - from 1.23.0 to 1.25.2
    • fix: CVEs (many)

Release 02-22-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.4.0-main-02-22-2023-3ee44b9e
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.4.0-main-02-22-2023-3ee44b9e-win
  • Change log -
    • feat: Allow setting a priority class for the daemonset and deployment objects
    • fix: Truncate the tag to 128 characters (docker requirement)
    • fix: Bump github.com/prometheus/client_golang from 1.9.0 to 1.11.1 in /internal/referenceapp/golang
    • feat: HTTP Proxy for distroless and ARC
    • fix: Wait for token adapter to be healthy before starting dependencies
    • feat: Add azure policy templates for metrics addon
    • feat: enable network monitoring metrics (kappie)
    • feat: AKS addon HTTP Proxy Support
    • fix: certificate import for windows ME startup

Release 01-31-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.2.0-main-01-31-2023-e1e3858b
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.2.0-main-01-31-2023-e1e3858b-win
  • Change log -
    • Adding Bicep template to monitoring addon
    • Added custom DCR and DCE arm templates for Remote Write
    • Adding monitoring reader role to Azure Monitor Workspace in ARM and Bicep templates
    • Fix fluent-bit daemonset tailing path and mariner docs
    • Liveness probe update for NON-MAC mode (windows)
    • Adds windows daemonset support with MSI (only in deprecated chart mode)

Release 01-11-2023

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.1.0-main-01-11-2023-5bf41607
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.1.0-main-01-11-2023-5bf41607-win
  • Change log -
    • Upgrade otel (from 0.58 to 0.66)
    • Upgrade ME (from 2.2022.1021.1309 to 2.2022.1201.1140)
    • Upgrade mdsd (from azure-mdsd_1.19.3-build.master.428 to azure-mdsd_1.23.4-build.master.28)

Release 12-14-2022 (This version is being released only internally due to deployment freeze during holidays)

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.0.2-main-12-14-2022-e0364da3
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.0.2-main-12-14-2022-e0364da3-win
  • Change Log -
    • Update addon token adapter (from master.220916.1 to master.221118.2)
    • Enable non-default dashboards & their recording rules (apiserver, kube-proxy, kubedns and kubernetes*)
    • Fix for excluding windows nodes in the node dropdown for k8s computer (nodes) dashboard

Release 11-29-2022

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.0.1-main-11-29-2022-97e2122e
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:6.0.1-main-11-29-2022-97e2122e-win
  • Change Log -
    • Move to mariner base for Linux image
    • Enable ARM64 support (for addon based datacollection only) - Includes both Daemonset & Replicaset
    • Update Kube-state-metrics chart (from 4.18.0 to 4.23.0) [chart only upgrade]
    • Update Prometheus node exporter chart (from 3.1.1 to 4.5.2) and image (from 1.3.1 to 1.4.0) [Remove selector label changes in 1.4.x chart that breaks upgrade]

Release 10-27-2022

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:5.4.0-main-10-26-2022-16f02b39
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:5.4.0-main-10-26-2022-16f02b39-win
  • Change Log -
    • Release custom prometheus config global settings to apply to the default targets in AKS-Addon
    • Rebuild with latest base image for security patches

Release 10-06-22

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:5.3.0-main-10-06-2022-c0c49872
  • Change Log -
    • Add capability for the custom prometheus config global settings to apply to the default targets
    • Bug fix - Rollback from otelcollector version 0.59.0 to 0.58.0 due to external labels bug
    • Bug fix - Fix race condition for internal production build

Release 09-30-2022

  • Linux image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:5.2.0-main-09-29-2022-ca064de1
  • Windows image - mcr.microsoft.com/azuremonitor/containerinsights/ciprod/prometheus-collector/images:5.2.0-main-09-29-2022-ca064de1-win
  • Change Log -
    • Public preview release image for Azure Monitor Metrics on AKS clusters
    • Add NOTICE file for OSS code and Component Governance generated notice for container
    • Bug fix - Add missing region dimension for all telemetry collected thru telegraf
    • Bug fix - Fix memory usage alert which hits multiple matches for labels issue
    • Bug fix - Remove virtual node core capacity from telemetry total
    • Bug fix - Update alert group names for default and CI alerts
    • Bug fix - Update prometheus custom config for Azure Monitor Metrics Addon