Grafana Cloud Outage History

50 incidents reported. Data sourced from the official Grafana Cloud status page.

50
Total Incidents
20
Major/Critical
22
Minor
49
Resolved

May 2026

Elevated Error Rate of Browser Checks in PoP Oregon

minor
May 5, 04:11 PMinvestigating
May 5, 04:11 PM
investigatingWe’re currently investigating an issue affecting browser checks in the PoP Oregon region. Our team is actively working to identify the cause. Thank you for your patience.

k6 Partial Outage

major
May 4, 10:58 PMMay 5, 02:09 AMresolved
May 5, 02:09 AM
resolvedThis incident has been resolved. Thank you for your patience.
May 5, 12:04 AM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
May 4, 11:23 PM
investigatingAfter further investigation, this issue may also be affecting Synthetic Monitoring. We continue to identify the cause and will update as soon as we have more information.
+1 more updates

Ingestion Errors for AWS Cloud Provider Observability Metric Streams in prod-us-central-7

major
May 1, 09:14 AMMay 1, 10:27 AMresolved
May 1, 10:27 AM
resolvedThis incident has been resolved.
May 1, 09:43 AM
monitoringA fix has been implemented and we are monitoring the results.
May 1, 09:42 AM
investigatingWe are continuing to investigate this issue.
+1 more updates

April 2026

Gateway Slowness Detected in Prod (US-East-1)

minor
Apr 28, 09:20 AMApr 30, 03:11 PMresolved
Apr 30, 03:11 PM
resolvedAfter further review, this was a false alarm and should not have affected any users. This incident has been resolved. Thank you for your patience.
Apr 28, 09:20 AM
investigatingSuccessful requests have dropped, users may not be able to access their instances.. The issue is under investigation.

Investigating Issues Saving SQL Datasource Credentials

minor
Apr 28, 06:46 PMApr 29, 01:37 PMresolved
Apr 29, 01:37 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 28, 06:59 PM
monitoringWe’ve identified the cause of the issue impacting SQL datasources. Our team is currently implementing a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to...
Apr 28, 06:46 PM
investigatingWe are currently investigating reports of issues affecting SQL-based data sources where users are unable to save credentials. This appears to impact a subset of customers and may be occurring across ...

Performance Testing – Degraded Service (Resolved)

none
Apr 29, 12:00 PMApr 29, 12:00 PMresolved
Apr 29, 01:51 PM
resolvedWe experienced degraded performance affecting Performance Testing from 13:10 UTC to 13:20 UTC. During this time, users may not have been able to start new test runs. The issue has been resolved, and ...

Elevated write latency for AWS Metrics Streaming integration in us-east-3 region.

minor
Apr 29, 10:30 AMApr 29, 10:30 AMresolved
Apr 29, 12:57 PM
resolvedWe were facing an incident with AWS Metrics Streaming integration in us-east-3 region manifesting in elevated ingestion latency. The incident started at around 10:45 UTC and was resolved at around 12:...

InfluxDB Datasource - Intermittent Failures

major
Apr 27, 05:08 PMApr 27, 11:24 PMresolved
Apr 27, 11:24 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 27, 11:13 PM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
Apr 27, 06:01 PM
identifiedWe’ve identified the cause of the issue impacting the InfluxDB datasource. Our team is currently implementing a fix.
+1 more updates

Restrictions on Alerts & Reports for Grafana Cloud Free/Trial Users

minor
Apr 20, 09:12 PMApr 24, 03:04 PMresolved
Apr 24, 03:04 PM
resolvedGrafana Labs has taken steps to safeguard the Grafana Cloud platform against the distribution of unauthorized emails. We have implemented the following changes to new Grafana Cloud Free and Trial acco...
Apr 22, 03:03 PM
monitoringGrafana Labs is implementing measures to safeguard the Grafana Cloud platform against ongoing unauthorized use while preserving the capabilities relied upon by our community. Effective immediately, we...
Apr 20, 10:07 PM
monitoringWe are continuing to monitor for any further issues.
+1 more updates

Cloudwatch Datasource Outage

major
Apr 23, 02:26 PMApr 23, 08:01 PMresolved
Apr 23, 08:01 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 23, 02:39 PM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
Apr 23, 02:26 PM
investigatingWe’re currently investigating an issue affecting Cloudwatch datasources. Our team is actively working to identify the cause. Thank you for your patience.

Elevated 429 Errors Impacting Metrics Querying Across Multiple Regions

critical
Apr 20, 02:09 PMApr 20, 02:30 PMresolved
Apr 20, 02:30 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 20, 02:21 PM
investigatingThe issue is now confirmed to be widespread, affecting Prometheus across all regions. Customers may continue to experience elevated 429 (rate limit) errors, particularly when querying metrics, with f...
Apr 20, 02:09 PM
investigatingWe are currently experiencing a major incident causing elevated 429 (rate limit) errors across multiple regions, primarily impacting metrics querying. This is a high-priority issue, and our engineeri...

Query Caching - Degraded Performance

minor
Apr 17, 09:23 PMApr 17, 10:58 PMresolved
Apr 17, 10:58 PM
resolvedThis incident has been resolved
Apr 17, 10:09 PM
monitoringCurrently prod-us-east-0 and prod-eu-west-3 have recovered, and we are continuing to monitor prod-us-central-0 which is in the process of recovery.
Apr 17, 09:23 PM
investigatingAs of 20:52 UTC, we are currently investigating degraded Query Caching performance in multiple regions. For datasources where query caching is configured, some queries may take longer than usual. Our...

Issues on Stack creation

minor
Apr 16, 12:52 PMApr 16, 02:02 PMresolved
Apr 16, 02:02 PM
resolvedThis incident has been resolved.
Apr 16, 01:19 PM
monitoringThe issue is fixed and we are currently monitoring the service.
Apr 16, 12:52 PM
identifiedSince today 16th at ~12:11UTC we are seeing issues on stack creation across all our regions. Customers will experience error message when attempting to create a stack. Our engineering team has identif...

Degraded Ticket Visibility in Support System

minor
Apr 15, 04:07 PMApr 15, 04:25 PMresolved
Apr 15, 04:25 PM
resolvedThis incident has been resolved and our ticketing system is fully operational. Thank you for your patience.
Apr 15, 04:07 PM
monitoringWe are currently experiencing an issue with our ticketing system provider that is affecting how tickets appear within our internal support views. We are continuing to receive all new tickets successf...

K6 Sporadic DNS Issues

minor
Apr 14, 09:22 AMApr 15, 12:59 PMresolved
Apr 15, 12:59 PM
resolvedThis incident is now resolved. We had intermediary issues with a flaky DNS server that caused random tests to not start properly. Since the DNS server was fixed, we haven't been seeing the issue anymo...
Apr 14, 02:29 PM
monitoringOur engineering team has deployed a fix and we are currently monitoring the behaviour of the system until full resolution.
Apr 14, 02:29 PM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
+1 more updates

k6 Cloud Service Disruption

none
Apr 14, 11:30 AMApr 14, 11:30 AMresolved
Apr 14, 01:44 PM
resolvedBetween approximately 12:30 UTC and 13:15 UTC, k6 Cloud experienced a service disruption due to issues introduced in a recent API release. During this time, users were unable to access the k6 Cloud ap...

Loki write instability in prod-eu-west-2.loki-prod-012

none
Apr 13, 11:30 AMApr 13, 11:30 AMresolved
Apr 14, 12:02 PM
resolvedThere was a period of write instability yesterday. It was between ~1330 -1730 UTC yesterday.  This was due to a scheduled maintenance.

Grafana Cloud Logs - Write degradation in us-east-3

major
Apr 10, 11:53 PMApr 11, 12:36 AMresolved
Apr 11, 12:36 AM
resolvedThis incident has been resolved.
Apr 11, 12:10 AM
monitoringA fix has been implemented and we are monitoring the results.
Apr 10, 11:53 PM
investigatingWe are seeing issues on the write path for Loki in cluster in us-east-3, and we are actively investigating this issue.

Tempo Write Outage

major
Apr 10, 07:42 PMApr 10, 09:02 PMresolved
Apr 10, 09:02 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 10, 07:53 PM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again within an hour.
Apr 10, 07:42 PM
investigatingWe are currently investigating a write outage affecting prod-us-east-3. The issue began at 18:50 UTC. Users may experience errors, timeouts, or unavailability while we work to identify the cause and r...

K6 Browser Testing/Timeline Not Available

minor
Apr 9, 05:34 PMApr 9, 06:50 PMresolved
Apr 9, 06:50 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 9, 06:39 PM
identifiedWe’ve identified the cause of the issue impacting k6 browser testing/timeline. Our team is currently implementing a fix. We’ll provide another update in two hours or sooner if the situation changes.
Apr 9, 05:34 PM
investigatingWe’re currently investigating an issue affecting browser testing. Users running browser tests will not be able to see the browser timeline. Our team is actively working to identify the cause and wi...

Stability Issues for Some Customers in the prod-gb-south-1 Region.

minor
Apr 8, 05:00 PMApr 8, 05:00 PMresolved
Apr 8, 05:00 PM
resolvedWe had a stability issue for a subset of customers in the prod-gb-south-1 region. The impact was between UTC 15:20-16:30 which impacted roughly 30% of queries and rules evaluations. We've applied miti...

Unable to Edit Notification Policies

minor
Apr 7, 03:17 PMApr 7, 08:17 PMresolved
Apr 7, 08:17 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 7, 06:03 PM
identifiedWe’ve identified the cause of the issue impacting notification policies. Our team is currently implementing a fix. We’ll provide another update in 2 hours or sooner if the situation changes.
Apr 7, 04:52 PM
identifiedWe’ve identified the cause of the issue impacting notification policies. Our team is currently implementing a fix. We’ll provide another update in 2 hours or sooner if the situation changes.
+1 more updates

Notification Policies and Contact Points Missing in UI on the Slow Release Channel

minor
Apr 6, 02:48 PMApr 7, 12:26 PMresolved
Apr 7, 12:26 PM
resolvedThis incident has been resolved.
Apr 6, 11:58 PM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again within 2 hours.
Apr 6, 09:04 PM
identifiedWe’ve identified the cause of the issue impacting the Notification Policy and Contact Point UI. Our team is currently implementing a fix. We’ll provide another update when the fix is deployed and we...
+2 more updates

Partial K6 Test Run Outage

major
Apr 3, 03:29 PMApr 3, 05:38 PMresolved
Apr 3, 05:38 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 3, 03:29 PM
investigatingWe're experiencing an outage affecting test runs that use k6 extensions. The issue prevents users from executing these types of test runs both locally and in Grafana Cloud. Test runs that do not use ...

Query degradation and possible rule evaluation failure on prod-eu-west-0.cortex-prod-01

minor
Apr 1, 09:56 AMApr 1, 09:13 PMresolved
Apr 1, 09:13 PM
resolvedThis incident has been resolved.
Apr 1, 10:12 AM
monitoringA fix has been implemented and we are monitoring the results.
Apr 1, 10:11 AM
investigatingWe are continuing to investigate this issue.
+1 more updates

AWS integration Degraded Performance

minor
Apr 1, 08:17 PMApr 1, 09:03 PMresolved
Apr 1, 09:03 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 1, 08:17 PM
investigatingWe are investigating a noticeable drop in active series for the AWS integration that began around 18:15 UTC. This issue may cause scrapes to hit rate limits, which can result in individual data point...

March 2026

Prometheus writes in prod-eu-west-3 are degraded

critical
Mar 25, 02:11 PMApr 23, 08:07 PMresolved
Apr 23, 08:07 PM
resolvedThis incident has been resolved. Thank you for your patience.
Apr 20, 03:08 PM
monitoringWe are continuing to monitor for any further issues.
Apr 14, 08:11 PM
monitoringWe have deployed mitigation and seen improvement in write failures over the past week. We are still seeing intermittent spikes in latency and continue to monitor.
+7 more updates

k6 Cloud Degradation

none
Mar 31, 02:59 PMMar 31, 02:59 PMresolved
Mar 31, 02:59 PM
resolvedFrom approximately 11:00 UTC - 15:00 UTC we had a degradation that caused test start errors for a large percentage of Cloud runs managed as scripts in the GCK6 app. This has since been resolved.

Synthetic Monitoring: Some Check Creations & Updates Might be Blocked.

none
Mar 31, 02:32 PMMar 31, 02:32 PMresolved
Mar 31, 02:32 PM
resolvedThis is a retroactive status page linked to the following incident: https://status.grafana.com/incidents/38wwbz50ggrp This retroactive status page is meant to clarify the time of impact. This issue f...

Synthetic Monitoring: Some Check Creations & Updates Might be Blocked.

major
Mar 31, 02:01 PMMar 31, 02:25 PMresolved
Mar 31, 02:25 PM
resolvedThis incident has been resolved.
Mar 31, 02:01 PM
identifiedSynthetic Monitoring check creation/update for scripted and browser checks might be blocked in the plugin app for some probes. The issue only impacts creating/updating checks from the plugin app. It d...

Some of the CloudWatch queries are failing

major
Mar 31, 09:48 AMMar 31, 10:24 AMresolved
Mar 31, 10:24 AM
resolvedThis incident has been resolved.
Mar 31, 09:49 AM
monitoringWe are continuing to monitor for any further issues.
Mar 31, 09:48 AM
monitoringSome of the CloudWatch queries were failing. Started at 08:37 UTC Monitoring from 09:21 UTC

Tempo Reads Outage for Small Subset of Customers

none
Mar 30, 04:30 PMMar 30, 04:30 PMresolved
Mar 30, 06:34 PM
resolvedWe encountered an issue impacting only a small subset of customers in the prod-us-central-0 region. The incident occurred between 16:20 and 17:50 UTC on 3/30/26. This incident is now resolved.

Some Grafana Instances Unavailable

major
Mar 27, 01:36 PMMar 27, 08:48 PMresolved
Mar 27, 08:48 PM
resolvedThis incident has been resolved. Thank you for your patience.
Mar 27, 08:16 PM
monitoringWe’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again in 1 hour.
Mar 27, 06:10 PM
identifiedWe’ve identified the cause of the issue impacting the instances. Our team is currently implementing a fix. We’ll provide another update in 1–2 hours, or sooner, if the situation changes.
+3 more updates

Prometheus writes, Logs, and Synthetic Monitoring in prod-eu-west-3 are degraded

minor
Mar 24, 09:08 AMMar 25, 12:52 PMresolved
Mar 25, 12:52 PM
resolvedThis incident has been resolved.
Mar 25, 07:43 AM
investigatingThis is also now impacting Logs and Synthetic Monitoring in prod-eu-west-3. For Synthetic Monitoring, users might observe errors pushing check execution metrics, and this can eventually lead to miss...
Mar 25, 07:04 AM
investigatingWe are moving this back to 'Investigating' as we are now observing a substantial drop in successful ingestion and increase in write path errors, and elevated rule evaluation latency and error. Reads a...
+3 more updates

Service degradation on Dashboard loading in several clusters.

none
Mar 24, 02:00 PMMar 24, 02:00 PMresolved
Mar 25, 10:30 AM
resolvedAn issue affecting Grafana Cloud instances was diagnosed yesterday 24th of March that avoided Dashboards to be loaded correctly. The incident impacted the following clusters: - GCP US Central (us-cen...

Grafana Assistant Unavailable in prod-us-east-0

major
Mar 23, 05:03 PMMar 23, 06:48 PMresolved
Mar 23, 06:48 PM
resolvedThis incident has been resolved.
Mar 23, 06:25 PM
identifiedThe issue has been identified, and we are implementing a fix.
Mar 23, 06:07 PM
investigatingThe impact extends beyond the TOS check. Assistant is completely unavailable in the impacted region.
+2 more updates

Authentication API Database Down in prod-eu-west-2 and prod-eu-west-4

major
Mar 20, 03:00 PMMar 20, 03:41 PMresolved
Mar 20, 03:41 PM
resolvedThis incident has been resolved.
Mar 20, 03:08 PM
investigatingWe have observed impact in prod-eu-west-4 as well.
Mar 20, 03:00 PM
investigatingWe are currently investigating an issue impacting the main database for Authentication API's in the prod-eu-west-2 region. Writes are currently failing, but reads are operational.

Various Datasource Issues

major
Mar 19, 04:46 PMMar 19, 06:44 PMresolved
Mar 19, 06:44 PM
resolvedThis incident has been resolved.
Mar 19, 05:56 PM
monitoringWe are continuing to monitor for any further issues.
Mar 19, 05:56 PM
monitoringWe have observed recovery for the Cloudwatch Datasource. We are now seeing failures for the following Datasources: Aurora Opensearch X-Ray Timestream Redshift Sitewise A fix for the above is being...
+2 more updates

Degraded performance of Grafana Cloud k6 test runs

major
Mar 19, 11:17 AMMar 19, 06:11 PMresolved
Mar 19, 06:11 PM
resolvedOur engineering team has deployed a fix and we continue to observe a continued period of recovery. At this time, we are considering this issue resolved. No further updates.
Mar 19, 11:17 AM
investigatingSome customers are seeing degraded performance and errors from certain v6 API endpoints. We are investigating the issue.

Grafana Cloud Logs - Write degradation in Azure Netherlands (eu-west-3)

minor
Mar 13, 10:28 AMMar 18, 07:13 AMresolved
Mar 18, 07:13 AM
resolvedWe have been observing stability for a period of time and will mark the incident as resolved at this time.
Mar 13, 09:22 PM
investigatingWe are continuing to investigate this issue with our CSP, and will provide updates as they become available.
Mar 13, 10:28 AM
investigatingWe are seeing issues on the write path for Loki in cluster Azure Netherlands (eu-west-3). Impact will reflect in degradation of logs ingestion on that cluster. Our engineering team is already working ...

Rule Evaluation Outage in prod-us-west-0

major
Mar 11, 05:10 PMMar 13, 06:15 PMresolved
Mar 13, 06:15 PM
resolvedThis incident has been resolved.
Mar 11, 06:02 PM
monitoringA fix has been implemented and we are monitoring the results.
Mar 11, 05:10 PM
investigatingWe are currently investigating an issue impacting rule evaluation for a subset of customers in the prod-us-west-0 region. We will provide updates as they become available.

Increased number of Aborted-by-Systems with a k6 binary building errors

major
Mar 13, 07:41 AMMar 13, 06:11 PMresolved
Mar 13, 06:11 PM
resolvedThis incident has been resolved.
Mar 13, 12:49 PM
monitoringA fix has been implemented and we are monitoring the results.
Mar 13, 08:45 AM
identifiedThe issue has been identified and a fix is being implemented.
+1 more updates

Grafana Cloud Logs - Write degradation in Azure Netherlands (eu-west-3)

minor
Mar 11, 08:31 AMMar 12, 01:18 PMresolved
Mar 12, 01:18 PM
resolvedThis incident has been resolved.
Mar 11, 09:13 AM
investigatingWe are also reporting impact to Faro performance in the same region. We are continuing to investigate this issue.
Mar 11, 08:31 AM
investigatingWe are seeing issues on the write path for Loki in cluster Azure Netherlands (eu-west-3). Impact will reflect in degradation of logs ingestion on that cluster. Our engineering team is already working ...

Some Write Failures in prod-eu-west-3.

major
Mar 10, 06:00 PMMar 11, 09:48 PMresolved
Mar 11, 09:48 PM
resolvedThis incident has been resolved.
Mar 11, 03:51 PM
monitoringThings have been stable, and we have a potential mitigation should this issue arise again. We are monitoring the issue in the meantime.
Mar 11, 01:35 AM
identifiedThere are ongoing intermittent elevated transient write failures. We will continue to provide additional updates as more information becomes available.
+2 more updates

Metrics write path outage in prod-us-central-0 and prod-us-central-5

minor
Mar 9, 06:03 PMMar 10, 09:17 PMresolved
Mar 10, 09:17 PM
resolvedThis incident has been resolved.
Mar 9, 06:03 PM
monitoringFrom 15:30 to 15:45 UTC and from 16:53 to 17:03 UTC, the prod-us-central-0 and prod-us-central-5 regions saw elevated latency and error rates on the write path. We're monitoring now.

Fleet Managment Elevated Rate of Errors

minor
Mar 9, 02:20 PMMar 10, 08:54 PMresolved
Mar 10, 08:54 PM
resolvedThis incident has been resolved.
Mar 10, 06:11 PM
investigatingOur engineering team continues to work towards a resolution for this issue.
Mar 9, 02:20 PM
investigatingSome users in prod-us-central-0 may be seeing elevated rate of errors when fetching configurations. Our engineers are currently investigating this issue.

Service degradation on Logs Read path in AWS US West (us-west-0)

minor
Mar 10, 03:26 PMMar 10, 08:39 PMresolved
Mar 10, 08:39 PM
resolvedThis incident has been resolved.
Mar 10, 03:26 PM
identifiedThere has been a reoccurrence o the issues on the Read path of Loki services on AWS US West since yesterday 9th around ~17:15UTC. The issue has been identified, and resolutions steps has been taken t...

Various Issues with HG Pages

major
Mar 10, 06:06 PMMar 10, 07:17 PMresolved
Mar 10, 07:17 PM
resolvedThis incident has been resolved.
Mar 10, 06:06 PM
investigatingWe are noticing issues with various HG pages. Our engineering team is actively looking into it.

Outage for prod-eu-central-0 due to AWS S3 outage.

none
Mar 7, 08:07 PMMar 9, 08:59 AMresolved
Mar 9, 08:59 AM
resolvedThis incident has been resolved.
Mar 8, 11:30 AM
monitoringSince about 20:03 UTC we have seen AWS S3 recover and also our services are recovering, we are monitoring.
Mar 7, 08:10 PM
investigatingSince about 20:03 UTC we have seen AWS S3 recover and also our services are recovering, we are monitoring.
+2 more updates

February 2026

Grafana Cloud Metrics - Intermittent Write Latency in prod-us-central, prod-us-central-5, and prod-eu-west-0

minor
Feb 25, 07:54 PMMar 17, 06:22 PMresolved
Mar 17, 06:22 PM
resolvedThis incident is now resolved. During the incident the Cloud Metrics platform experienced intermittent latency spikes communicating with a backend cloud service in the prod-us-central-0 and prod-us-c...
Mar 6, 09:44 PM
monitoringWe are rolling out a mitigation across the environments in these regions, and preemptively where possible to ensure it doesn’t spread elsewhere.
Mar 6, 08:53 PM
monitoringWe have seen an increase in latency in our cloud providers services, and are rolling out a change to mitigate the issue. We are monitoring.
+5 more updates

Get Grafana Cloud Outage Alerts

Be the first to know when Grafana Cloud go down.