Staff Pick

๐Ÿ“ก Monitor your APIs โ€” know when they go down before your users do

Better Stack checks uptime every 30 seconds with instant Slack, email & SMS alerts. Free tier available.

Start Free โ†’

Affiliate link โ€” we may earn a commission at no extra cost to you

Blogโ€บAzure Monitoring Guide

Azure Monitoring Guide 2026: Azure Monitor, App Insights & Best Practices

A complete guide to monitoring Azure infrastructure in production โ€” covering Azure Monitor metrics, Log Analytics, Application Insights APM, alerts, and when to add third-party tools.

Updated: May 2026ยท20 min read

Is Azure down right now? Check the Azure Status page or read our Azure outage guide.

Azure is the second-largest cloud provider, powering millions of production workloads. Microsoft gives you a rich monitoring stack โ€” Azure Monitor, Application Insights, Log Analytics, Azure Security Center, and more โ€” but knowing which tools to use for which problems, and how to avoid spending a fortune on telemetry ingestion, requires real production experience. This guide distills Azure monitoring into actionable best practices.

The Azure Monitoring Stack: What's Available

Azure provides several native monitoring services, each covering a different layer of observability:

ServiceWhat It DoesBest For
Azure Monitor MetricsCollect and graph metrics from all Azure servicesCPU, memory, latency, error rates
Log Analytics WorkspaceCentralize and query logs via KQLLog aggregation, complex analysis
Azure Monitor AlertsAlert on metrics, logs, or activity eventsProactive alerting, auto-scaling
Application InsightsFull APM for web apps and APIsRequest tracing, exceptions, dependency tracking
Azure Activity LogAudit log of all Azure control-plane operationsSecurity, compliance, change tracking
Azure Service HealthAzure service health events affecting your subscriptionOutage awareness, incident tracking
Network WatcherNetwork topology, traffic analytics, packet captureNetwork debugging, security
Azure Security CenterSecurity posture and threat detectionCompliance, vulnerability assessment

Azure Monitor: Core Infrastructure Monitoring

Understanding Azure Monitor Metrics

Azure Monitor Metrics is the foundation of Azure infrastructure monitoring. Every Azure resource โ€” VMs, AKS clusters, Functions, SQL databases, App Services, and hundreds more โ€” automatically publishes metrics. Metrics are organized by resource type and metric namespace.

Key Azure Monitor Metrics concepts:

Critical Metrics by Azure Service

Virtual Machines (VMs)

  • Percentage CPU: Alert at >80% sustained for 5+ minutes
  • Network In/Out: Baseline and alert on anomalies
  • Disk Read/Write Bytes: I/O saturation on managed disks
  • OS Disk IOPS Consumed Percentage: Alert at >90%
  • Memory (via Azure Monitor Agent): Not available by default โ€” install AMA extension

Azure Kubernetes Service (AKS)

  • Node CPU usage percentage: Alert at >80% โ€” scale node pool before saturation
  • Node memory working set percentage: Alert at >80%
  • Pod count by phase: Watch Pending/Failed counts โ€” indicates scheduling issues
  • Unschedulable pod count: Immediate alert โ€” pods waiting for capacity
  • Restarting container count: CrashLoopBackOff detection

Azure SQL Database

  • DTU percentage (DTU model): Alert at >80% โ€” scale tier before saturation
  • CPU percentage (vCore model): Alert at >70%
  • Storage percentage: Alert when >80% to prevent autogrow issues
  • Deadlocks: Alert on any deadlocks โ€” indicates application logic issues
  • Connection failures: Alert on sustained failures โ€” connection pool exhaustion

Azure App Service

  • CPU percentage: Alert at >80% โ€” scale out instances
  • Memory percentage: Alert at >85% to prevent OOM kills
  • Http5xx: Error rate โ€” alert on >1% of requests
  • Response time: Alert on P95 > your SLA threshold
  • Requests: Traffic baseline โ€” drops indicate upstream issues

Azure Functions

  • Execution count: Traffic baseline and anomaly detection
  • Function errors: Alert on sustained error rate >1%
  • Function execution duration: Watch P99 โ€” timeout is configurable but default 5 min
  • Throttled function runs: Hitting concurrency limits on Consumption plan
  • Active instance count: Scale monitoring for Premium/Dedicated plans
๐Ÿ“ก
Recommended

Supplement Azure Monitor with Better Stack for on-call alerting

Azure Monitor can alert via Action Groups, but Better Stack gives you escalation policies, on-call schedules, and incident management in one place. Integrates with Azure Monitor in minutes.

Try Better Stack Free โ†’

Azure Monitor Alerts: Getting Effective Alerts

Azure Monitor Alerts fire when a condition is met on a metric, log query, or activity log event. Azure supports three alert types โ€” metric alerts (fastest, near real-time), log search alerts (flexible KQL queries on Log Analytics data), and activity log alerts (service health, resource changes).

Alert Best Practices

Application Insights: APM for Azure Apps

Application Insights is Microsoft's application performance monitoring (APM) service. It instruments your code โ€” web apps, APIs, Functions, and workers โ€” to capture telemetry automatically. Unlike Azure Monitor Metrics (infrastructure layer), Application Insights operates at the application layer: requests, exceptions, dependencies, page views, custom events, and traces.

Auto-Instrumented vs SDK Instrumentation

Application Insights offers two instrumentation approaches:

Auto-Instrumentation (Codeless)

Available for Azure App Service (.NET, Java, Node.js, Python), Azure Functions, and AKS via the Application Insights Kubernetes agent. No code changes required โ€” enable via the Azure Portal.

Captures: HTTP requests/responses, SQL queries, HTTP dependencies, exceptions, performance counters

SDK Instrumentation (Code-Based)

Add the Application Insights SDK to your code for full control. Available for .NET, Java, Node.js, Python, JavaScript, and more.

Adds: Custom events, custom metrics, custom traces, user journey tracking, custom dimensions on all telemetry

Key Application Insights Features

Log Analytics: Querying Logs with KQL

Log Analytics Workspace is where Azure routes all diagnostic logs โ€” VM syslog, AKS container logs, App Service application logs, Activity Logs, and Application Insights telemetry. You query it using Kusto Query Language (KQL).

Essential KQL Queries

// Top failing requests in App Insights

requests
| where success == false
| summarize count() by name, resultCode
| order by count_ desc
| take 20

// P95 response time by operation in last 1 hour

requests
| where timestamp > ago(1h)
| summarize percentile(duration, 95) by name
| order by percentile_duration_95 desc

// AKS pod restart count (last 24h)

KubePodInventory
| where TimeGenerated > ago(24h)
| where ContainerRestartCount > 0
| summarize sum(ContainerRestartCount) by PodName, Namespace
| order by sum_ContainerRestartCount desc

// Exception volume by type (last 6h)

exceptions
| where timestamp > ago(6h)
| summarize count() by type, outerMessage
| order by count_ desc

Log Ingestion Cost Optimization

Log Analytics can become expensive quickly. Strategies to control costs:

Alert Pro

14-day free trial

Stop checking โ€” get alerted instantly

Next time Azure goes down, you'll know in under 60 seconds โ€” not when your users start complaining.

  • Email alerts for Azure + 9 more APIs
  • $0 due today for trial
  • Cancel anytime โ€” $9/mo after trial

Azure Workbooks & Dashboards

Azure Monitor Workbooks are interactive reports that combine metrics, logs, parameters, and visualizations. They're the preferred way to build custom Azure dashboards beyond the basic Azure Portal charts.

Third-Party Azure Monitoring Tools

Native Azure Monitor is powerful but has limitations: KQL has a steep learning curve, the alert UI is complex, and cross-cloud visibility is limited. Third-party tools are common in Azure production environments:

ToolAzure IntegrationBest For
DatadogNative Azure integration โ€” 800+ metrics, log forwardingFull-stack observability, multi-cloud
New RelicAzure polling + Log Forwarding integrationAPM + infrastructure in one platform
Grafana CloudAzure Monitor data source pluginTeams already using Grafana/Prometheus
Better StackWebhook integration with Azure Monitor AlertsUptime monitoring + on-call management
DynatraceAuto-discovery via Azure APIEnterprise APM with AI-powered root cause
ElasticAzure Metricbeat + Filebeat + FunctionbeatLog search + APM on self-managed stack

Azure Monitoring Best Practices

1. Enable Diagnostic Settings on Every Resource

By default, only platform metrics are collected. Enable diagnostic settings on each resource to route logs and additional metrics to Log Analytics. Use Azure Policy to enforce this at scale across subscriptions.

2. Use Tags for Cost Allocation and Alert Routing

Tag resources with environment, team, and service. Use tags in Action Groups to route alerts to the right team and in Cost Management to allocate monitoring spend.

3. Set Up Azure Service Health Alerts

Configure Service Health alerts (free) to notify your team when Azure has incidents, planned maintenance, or health advisories in your regions. Integrate these with your on-call tool via webhook or Logic App.

4. Define SLOs and Use Error Budgets

Define SLOs in Log Analytics Workbooks (e.g., 99.9% uptime = 43 minutes downtime/month budget). Track error budget burn rate and alert when burning too fast. See our SLO guide for implementation details.

5. Implement Distributed Tracing for Microservices

If you run multiple services on Azure (AKS + Functions + App Service + Service Bus), enable Application Insights distributed tracing or OpenTelemetry to correlate requests across service boundaries. Without it, debugging multi-hop failures is guesswork.

6. Monitor Your Monitoring: Alert on Data Gaps

Create a "Heartbeat absent" alert in Log Analytics to detect when an agent stops reporting. If your VM's Log Analytics agent crashes, you want to know before a real incident happens without visibility.

๐Ÿ“ก
Recommended

Better Stack + Azure Monitor: on-call alerting done right

Connect Azure Monitor alerts to Better Stack for smart escalations, on-call scheduling, and incident management. Works with any Azure alert action group.

Try Better Stack Free โ†’

Azure vs AWS Monitoring: Key Differences

If you're coming from AWS, here's how Azure's monitoring stack maps:

AWS ServiceAzure EquivalentKey Difference
CloudWatch MetricsAzure Monitor MetricsAzure uses diagnostic settings instead of automatic log routing
CloudWatch LogsLog Analytics WorkspaceAzure uses KQL; AWS uses CloudWatch Logs Insights (SQL-like)
CloudWatch AlarmsAzure Monitor AlertsAzure has dynamic thresholds; AWS requires manual static thresholds
AWS X-RayApplication Insights (distributed tracing)App Insights also does full APM; X-Ray is tracing only
CloudTrailAzure Activity LogVery similar; both audit all API calls
AWS HealthAzure Service HealthBoth offer subscription-level health events
CloudWatch DashboardsAzure Workbooks / DashboardsWorkbooks are more powerful; Azure Portal Dashboards are simpler

Getting Started: Azure Monitoring Quickstart Checklist

Day 1 Setup

  • โœ“Create a central Log Analytics Workspace in each region
  • โœ“Enable diagnostic settings on all critical resources (route to Log Analytics)
  • โœ“Enable Application Insights on all web apps and APIs
  • โœ“Configure Azure Service Health alerts (free, high value)
  • โœ“Set Action Groups for email/SMS/webhook routing
  • โœ“Create metric alerts for CPU, memory, error rates on production resources
  • โœ“Enable Container Insights on AKS clusters
  • โœ“Set a daily ingestion cap on Log Analytics to avoid bill shock

Related Azure Monitoring Resources

๐Ÿ›  Tools We Use & Recommend

Tested across our own infrastructure monitoring 200+ APIs daily

SEMrushBest for SEO

SEO & Site Performance Monitoring

Used by 10M+ marketers

Track your site health, uptime, search rankings, and competitor movements from one dashboard.

โ€œWe use SEMrush to track how our API status pages rank and catch site health issues early.โ€

From $129.95/moTry SEMrush Free
View full comparison & more tools โ†’Affiliate links โ€” we earn a commission at no extra cost to you