Grafana Cloud Outage History
50 incidents reported. Data sourced from the official Grafana Cloud status page.
50
Total Incidents
20
Major/Critical
22
Minor
49
Resolved
May 2026
Elevated Error Rate of Browser Checks in PoP Oregon
minorMay 5, 04:11 PMinvestigating
May 5, 04:11 PM
investigating — We’re currently investigating an issue affecting browser checks in the PoP Oregon region. Our team is actively working to identify the cause. Thank you for your patience.
k6 Partial Outage
majorMay 4, 10:58 PM→May 5, 02:09 AMresolved
May 5, 02:09 AM
resolved — This incident has been resolved. Thank you for your patience.
May 5, 12:04 AM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
May 4, 11:23 PM
investigating — After further investigation, this issue may also be affecting Synthetic Monitoring.
We continue to identify the cause and will update as soon as we have more information.
+1 more updates
Ingestion Errors for AWS Cloud Provider Observability Metric Streams in prod-us-central-7
majorMay 1, 09:14 AM→May 1, 10:27 AMresolved
May 1, 10:27 AM
resolved — This incident has been resolved.
May 1, 09:43 AM
monitoring — A fix has been implemented and we are monitoring the results.
May 1, 09:42 AM
investigating — We are continuing to investigate this issue.
+1 more updates
April 2026
Gateway Slowness Detected in Prod (US-East-1)
minorApr 28, 09:20 AM→Apr 30, 03:11 PMresolved
Apr 30, 03:11 PM
resolved — After further review, this was a false alarm and should not have affected any users.
This incident has been resolved. Thank you for your patience.
Apr 28, 09:20 AM
investigating — Successful requests have dropped, users may not be able to access their instances.. The issue is under investigation.
Investigating Issues Saving SQL Datasource Credentials
minorApr 28, 06:46 PM→Apr 29, 01:37 PMresolved
Apr 29, 01:37 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 28, 06:59 PM
monitoring — We’ve identified the cause of the issue impacting SQL datasources. Our team is currently implementing a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to...
Apr 28, 06:46 PM
investigating — We are currently investigating reports of issues affecting SQL-based data sources where users are unable to save credentials.
This appears to impact a subset of customers and may be occurring across ...
Performance Testing – Degraded Service (Resolved)
noneApr 29, 12:00 PM→Apr 29, 12:00 PMresolved
Apr 29, 01:51 PM
resolved — We experienced degraded performance affecting Performance Testing from 13:10 UTC to 13:20 UTC. During this time, users may not have been able to start new test runs.
The issue has been resolved, and ...
Elevated write latency for AWS Metrics Streaming integration in us-east-3 region.
minorApr 29, 10:30 AM→Apr 29, 10:30 AMresolved
Apr 29, 12:57 PM
resolved — We were facing an incident with AWS Metrics Streaming integration in us-east-3 region manifesting in elevated ingestion latency. The incident started at around 10:45 UTC and was resolved at around 12:...
InfluxDB Datasource - Intermittent Failures
majorApr 27, 05:08 PM→Apr 27, 11:24 PMresolved
Apr 27, 11:24 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 27, 11:13 PM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
Apr 27, 06:01 PM
identified — We’ve identified the cause of the issue impacting the InfluxDB datasource. Our team is currently implementing a fix.
+1 more updates
Restrictions on Alerts & Reports for Grafana Cloud Free/Trial Users
minorApr 20, 09:12 PM→Apr 24, 03:04 PMresolved
Apr 24, 03:04 PM
resolved — Grafana Labs has taken steps to safeguard the Grafana Cloud platform against the distribution of unauthorized emails. We have implemented the following changes to new Grafana Cloud Free and Trial acco...
Apr 22, 03:03 PM
monitoring — Grafana Labs is implementing measures to safeguard the Grafana Cloud platform against ongoing unauthorized use while preserving the capabilities relied upon by our community. Effective immediately, we...
Apr 20, 10:07 PM
monitoring — We are continuing to monitor for any further issues.
+1 more updates
Cloudwatch Datasource Outage
majorApr 23, 02:26 PM→Apr 23, 08:01 PMresolved
Apr 23, 08:01 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 23, 02:39 PM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
Apr 23, 02:26 PM
investigating — We’re currently investigating an issue affecting Cloudwatch datasources. Our team is actively working to identify the cause. Thank you for your patience.
Elevated 429 Errors Impacting Metrics Querying Across Multiple Regions
criticalApr 20, 02:09 PM→Apr 20, 02:30 PMresolved
Apr 20, 02:30 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 20, 02:21 PM
investigating — The issue is now confirmed to be widespread, affecting Prometheus across all regions.
Customers may continue to experience elevated 429 (rate limit) errors, particularly when querying metrics, with f...
Apr 20, 02:09 PM
investigating — We are currently experiencing a major incident causing elevated 429 (rate limit) errors across multiple regions, primarily impacting metrics querying.
This is a high-priority issue, and our engineeri...
Query Caching - Degraded Performance
minorApr 17, 09:23 PM→Apr 17, 10:58 PMresolved
Apr 17, 10:58 PM
resolved — This incident has been resolved
Apr 17, 10:09 PM
monitoring — Currently prod-us-east-0 and prod-eu-west-3 have recovered, and we are continuing to monitor prod-us-central-0 which is in the process of recovery.
Apr 17, 09:23 PM
investigating — As of 20:52 UTC, we are currently investigating degraded Query Caching performance in multiple regions. For datasources where query caching is configured, some queries may take longer than usual.
Our...
Issues on Stack creation
minorApr 16, 12:52 PM→Apr 16, 02:02 PMresolved
Apr 16, 02:02 PM
resolved — This incident has been resolved.
Apr 16, 01:19 PM
monitoring — The issue is fixed and we are currently monitoring the service.
Apr 16, 12:52 PM
identified — Since today 16th at ~12:11UTC we are seeing issues on stack creation across all our regions. Customers will experience error message when attempting to create a stack.
Our engineering team has identif...
Degraded Ticket Visibility in Support System
minorApr 15, 04:07 PM→Apr 15, 04:25 PMresolved
Apr 15, 04:25 PM
resolved — This incident has been resolved and our ticketing system is fully operational. Thank you for your patience.
Apr 15, 04:07 PM
monitoring — We are currently experiencing an issue with our ticketing system provider that is affecting how tickets appear within our internal support views.
We are continuing to receive all new tickets successf...
K6 Sporadic DNS Issues
minorApr 14, 09:22 AM→Apr 15, 12:59 PMresolved
Apr 15, 12:59 PM
resolved — This incident is now resolved. We had intermediary issues with a flaky DNS server that caused random tests to not start properly. Since the DNS server was fixed, we haven't been seeing the issue anymo...
Apr 14, 02:29 PM
monitoring — Our engineering team has deployed a fix and we are currently monitoring the behaviour of the system until full resolution.
Apr 14, 02:29 PM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time.
+1 more updates
k6 Cloud Service Disruption
noneApr 14, 11:30 AM→Apr 14, 11:30 AMresolved
Apr 14, 01:44 PM
resolved — Between approximately 12:30 UTC and 13:15 UTC, k6 Cloud experienced a service disruption due to issues introduced in a recent API release. During this time, users were unable to access the k6 Cloud ap...
Loki write instability in prod-eu-west-2.loki-prod-012
noneApr 13, 11:30 AM→Apr 13, 11:30 AMresolved
Apr 14, 12:02 PM
resolved — There was a period of write instability yesterday. It was between ~1330 -1730 UTC yesterday. This was due to a scheduled maintenance.
Grafana Cloud Logs - Write degradation in us-east-3
majorApr 10, 11:53 PM→Apr 11, 12:36 AMresolved
Apr 11, 12:36 AM
resolved — This incident has been resolved.
Apr 11, 12:10 AM
monitoring — A fix has been implemented and we are monitoring the results.
Apr 10, 11:53 PM
investigating — We are seeing issues on the write path for Loki in cluster in us-east-3, and we are actively investigating this issue.
Tempo Write Outage
majorApr 10, 07:42 PM→Apr 10, 09:02 PMresolved
Apr 10, 09:02 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 10, 07:53 PM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again within an hour.
Apr 10, 07:42 PM
investigating — We are currently investigating a write outage affecting prod-us-east-3. The issue began at 18:50 UTC. Users may experience errors, timeouts, or unavailability while we work to identify the cause and r...
K6 Browser Testing/Timeline Not Available
minorApr 9, 05:34 PM→Apr 9, 06:50 PMresolved
Apr 9, 06:50 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 9, 06:39 PM
identified — We’ve identified the cause of the issue impacting k6 browser testing/timeline. Our team is currently implementing a fix. We’ll provide another update in two hours or sooner if the situation changes.
Apr 9, 05:34 PM
investigating — We’re currently investigating an issue affecting browser testing.
Users running browser tests will not be able to see the browser timeline.
Our team is actively working to identify the cause and wi...
Stability Issues for Some Customers in the prod-gb-south-1 Region.
minorApr 8, 05:00 PM→Apr 8, 05:00 PMresolved
Apr 8, 05:00 PM
resolved — We had a stability issue for a subset of customers in the prod-gb-south-1 region. The impact was between UTC 15:20-16:30 which impacted roughly 30% of queries and rules evaluations. We've applied miti...
Unable to Edit Notification Policies
minorApr 7, 03:17 PM→Apr 7, 08:17 PMresolved
Apr 7, 08:17 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 7, 06:03 PM
identified — We’ve identified the cause of the issue impacting notification policies. Our team is currently implementing a fix. We’ll provide another update in 2 hours or sooner if the situation changes.
Apr 7, 04:52 PM
identified — We’ve identified the cause of the issue impacting notification policies. Our team is currently implementing a fix. We’ll provide another update in 2 hours or sooner if the situation changes.
+1 more updates
Notification Policies and Contact Points Missing in UI on the Slow Release Channel
minorApr 6, 02:48 PM→Apr 7, 12:26 PMresolved
Apr 7, 12:26 PM
resolved — This incident has been resolved.
Apr 6, 11:58 PM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again within 2 hours.
Apr 6, 09:04 PM
identified — We’ve identified the cause of the issue impacting the Notification Policy and Contact Point UI. Our team is currently implementing a fix.
We’ll provide another update when the fix is deployed and we...
+2 more updates
Partial K6 Test Run Outage
majorApr 3, 03:29 PM→Apr 3, 05:38 PMresolved
Apr 3, 05:38 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 3, 03:29 PM
investigating — We're experiencing an outage affecting test runs that use k6 extensions. The issue prevents users from executing these types of test runs both locally and in Grafana Cloud.
Test runs that do not use ...
Query degradation and possible rule evaluation failure on prod-eu-west-0.cortex-prod-01
minorApr 1, 09:56 AM→Apr 1, 09:13 PMresolved
Apr 1, 09:13 PM
resolved — This incident has been resolved.
Apr 1, 10:12 AM
monitoring — A fix has been implemented and we are monitoring the results.
Apr 1, 10:11 AM
investigating — We are continuing to investigate this issue.
+1 more updates
AWS integration Degraded Performance
minorApr 1, 08:17 PM→Apr 1, 09:03 PMresolved
Apr 1, 09:03 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 1, 08:17 PM
investigating — We are investigating a noticeable drop in active series for the AWS integration that began around 18:15 UTC.
This issue may cause scrapes to hit rate limits, which can result in individual data point...
March 2026
Prometheus writes in prod-eu-west-3 are degraded
criticalMar 25, 02:11 PM→Apr 23, 08:07 PMresolved
Apr 23, 08:07 PM
resolved — This incident has been resolved. Thank you for your patience.
Apr 20, 03:08 PM
monitoring — We are continuing to monitor for any further issues.
Apr 14, 08:11 PM
monitoring — We have deployed mitigation and seen improvement in write failures over the past week. We are still seeing intermittent spikes in latency and continue to monitor.
+7 more updates
k6 Cloud Degradation
noneMar 31, 02:59 PM→Mar 31, 02:59 PMresolved
Mar 31, 02:59 PM
resolved — From approximately 11:00 UTC - 15:00 UTC we had a degradation that caused test start errors for a large percentage of Cloud runs managed as scripts in the GCK6 app. This has since been resolved.
Synthetic Monitoring: Some Check Creations & Updates Might be Blocked.
noneMar 31, 02:32 PM→Mar 31, 02:32 PMresolved
Mar 31, 02:32 PM
resolved — This is a retroactive status page linked to the following incident: https://status.grafana.com/incidents/38wwbz50ggrp
This retroactive status page is meant to clarify the time of impact. This issue f...
Synthetic Monitoring: Some Check Creations & Updates Might be Blocked.
majorMar 31, 02:01 PM→Mar 31, 02:25 PMresolved
Mar 31, 02:25 PM
resolved — This incident has been resolved.
Mar 31, 02:01 PM
identified — Synthetic Monitoring check creation/update for scripted and browser checks might be blocked in the plugin app for some probes. The issue only impacts creating/updating checks from the plugin app. It d...
Some of the CloudWatch queries are failing
majorMar 31, 09:48 AM→Mar 31, 10:24 AMresolved
Mar 31, 10:24 AM
resolved — This incident has been resolved.
Mar 31, 09:49 AM
monitoring — We are continuing to monitor for any further issues.
Mar 31, 09:48 AM
monitoring — Some of the CloudWatch queries were failing.
Started at 08:37 UTC
Monitoring from 09:21 UTC
Tempo Reads Outage for Small Subset of Customers
noneMar 30, 04:30 PM→Mar 30, 04:30 PMresolved
Mar 30, 06:34 PM
resolved — We encountered an issue impacting only a small subset of customers in the prod-us-central-0 region. The incident occurred between 16:20 and 17:50 UTC on 3/30/26. This incident is now resolved.
Some Grafana Instances Unavailable
majorMar 27, 01:36 PM→Mar 27, 08:48 PMresolved
Mar 27, 08:48 PM
resolved — This incident has been resolved. Thank you for your patience.
Mar 27, 08:16 PM
monitoring — We’ve implemented a fix and are monitoring the results to confirm the issue is fully resolved. Services may start to recover during this time. We’ll update again in 1 hour.
Mar 27, 06:10 PM
identified — We’ve identified the cause of the issue impacting the instances. Our team is currently implementing a fix. We’ll provide another update in 1–2 hours, or sooner, if the situation changes.
+3 more updates
Prometheus writes, Logs, and Synthetic Monitoring in prod-eu-west-3 are degraded
minorMar 24, 09:08 AM→Mar 25, 12:52 PMresolved
Mar 25, 12:52 PM
resolved — This incident has been resolved.
Mar 25, 07:43 AM
investigating — This is also now impacting Logs and Synthetic Monitoring in prod-eu-west-3.
For Synthetic Monitoring, users might observe errors pushing check execution metrics, and this can eventually lead to miss...
Mar 25, 07:04 AM
investigating — We are moving this back to 'Investigating' as we are now observing a substantial drop in successful ingestion and increase in write path errors, and elevated rule evaluation latency and error. Reads a...
+3 more updates
Service degradation on Dashboard loading in several clusters.
noneMar 24, 02:00 PM→Mar 24, 02:00 PMresolved
Mar 25, 10:30 AM
resolved — An issue affecting Grafana Cloud instances was diagnosed yesterday 24th of March that avoided Dashboards to be loaded correctly.
The incident impacted the following clusters:
- GCP US Central (us-cen...
Grafana Assistant Unavailable in prod-us-east-0
majorMar 23, 05:03 PM→Mar 23, 06:48 PMresolved
Mar 23, 06:48 PM
resolved — This incident has been resolved.
Mar 23, 06:25 PM
identified — The issue has been identified, and we are implementing a fix.
Mar 23, 06:07 PM
investigating — The impact extends beyond the TOS check. Assistant is completely unavailable in the impacted region.
+2 more updates
Authentication API Database Down in prod-eu-west-2 and prod-eu-west-4
majorMar 20, 03:00 PM→Mar 20, 03:41 PMresolved
Mar 20, 03:41 PM
resolved — This incident has been resolved.
Mar 20, 03:08 PM
investigating — We have observed impact in prod-eu-west-4 as well.
Mar 20, 03:00 PM
investigating — We are currently investigating an issue impacting the main database for Authentication API's in the prod-eu-west-2 region. Writes are currently failing, but reads are operational.
Various Datasource Issues
majorMar 19, 04:46 PM→Mar 19, 06:44 PMresolved
Mar 19, 06:44 PM
resolved — This incident has been resolved.
Mar 19, 05:56 PM
monitoring — We are continuing to monitor for any further issues.
Mar 19, 05:56 PM
monitoring — We have observed recovery for the Cloudwatch Datasource.
We are now seeing failures for the following Datasources:
Aurora
Opensearch
X-Ray
Timestream
Redshift
Sitewise
A fix for the above is being...
+2 more updates
Degraded performance of Grafana Cloud k6 test runs
majorMar 19, 11:17 AM→Mar 19, 06:11 PMresolved
Mar 19, 06:11 PM
resolved — Our engineering team has deployed a fix and we continue to observe a continued period of recovery.
At this time, we are considering this issue resolved. No further updates.
Mar 19, 11:17 AM
investigating — Some customers are seeing degraded performance and errors from certain v6 API endpoints. We are investigating the issue.
Grafana Cloud Logs - Write degradation in Azure Netherlands (eu-west-3)
minorMar 13, 10:28 AM→Mar 18, 07:13 AMresolved
Mar 18, 07:13 AM
resolved — We have been observing stability for a period of time and will mark the incident as resolved at this time.
Mar 13, 09:22 PM
investigating — We are continuing to investigate this issue with our CSP, and will provide updates as they become available.
Mar 13, 10:28 AM
investigating — We are seeing issues on the write path for Loki in cluster Azure Netherlands (eu-west-3). Impact will reflect in degradation of logs ingestion on that cluster. Our engineering team is already working ...
Rule Evaluation Outage in prod-us-west-0
majorMar 11, 05:10 PM→Mar 13, 06:15 PMresolved
Mar 13, 06:15 PM
resolved — This incident has been resolved.
Mar 11, 06:02 PM
monitoring — A fix has been implemented and we are monitoring the results.
Mar 11, 05:10 PM
investigating — We are currently investigating an issue impacting rule evaluation for a subset of customers in the prod-us-west-0 region. We will provide updates as they become available.
Increased number of Aborted-by-Systems with a k6 binary building errors
majorMar 13, 07:41 AM→Mar 13, 06:11 PMresolved
Mar 13, 06:11 PM
resolved — This incident has been resolved.
Mar 13, 12:49 PM
monitoring — A fix has been implemented and we are monitoring the results.
Mar 13, 08:45 AM
identified — The issue has been identified and a fix is being implemented.
+1 more updates
Grafana Cloud Logs - Write degradation in Azure Netherlands (eu-west-3)
minorMar 11, 08:31 AM→Mar 12, 01:18 PMresolved
Mar 12, 01:18 PM
resolved — This incident has been resolved.
Mar 11, 09:13 AM
investigating — We are also reporting impact to Faro performance in the same region. We are continuing to investigate this issue.
Mar 11, 08:31 AM
investigating — We are seeing issues on the write path for Loki in cluster Azure Netherlands (eu-west-3). Impact will reflect in degradation of logs ingestion on that cluster. Our engineering team is already working ...
Some Write Failures in prod-eu-west-3.
majorMar 10, 06:00 PM→Mar 11, 09:48 PMresolved
Mar 11, 09:48 PM
resolved — This incident has been resolved.
Mar 11, 03:51 PM
monitoring — Things have been stable, and we have a potential mitigation should this issue arise again. We are monitoring the issue in the meantime.
Mar 11, 01:35 AM
identified — There are ongoing intermittent elevated transient write failures. We will continue to provide additional updates as more information becomes available.
+2 more updates
Metrics write path outage in prod-us-central-0 and prod-us-central-5
minorMar 9, 06:03 PM→Mar 10, 09:17 PMresolved
Mar 10, 09:17 PM
resolved — This incident has been resolved.
Mar 9, 06:03 PM
monitoring — From 15:30 to 15:45 UTC and from 16:53 to 17:03 UTC, the prod-us-central-0 and prod-us-central-5 regions saw elevated latency and error rates on the write path.
We're monitoring now.
Fleet Managment Elevated Rate of Errors
minorMar 9, 02:20 PM→Mar 10, 08:54 PMresolved
Mar 10, 08:54 PM
resolved — This incident has been resolved.
Mar 10, 06:11 PM
investigating — Our engineering team continues to work towards a resolution for this issue.
Mar 9, 02:20 PM
investigating — Some users in prod-us-central-0 may be seeing elevated rate of errors when fetching configurations. Our engineers are currently investigating this issue.
Service degradation on Logs Read path in AWS US West (us-west-0)
minorMar 10, 03:26 PM→Mar 10, 08:39 PMresolved
Mar 10, 08:39 PM
resolved — This incident has been resolved.
Mar 10, 03:26 PM
identified — There has been a reoccurrence o the issues on the Read path of Loki services on AWS US West since yesterday 9th around ~17:15UTC.
The issue has been identified, and resolutions steps has been taken t...
Various Issues with HG Pages
majorMar 10, 06:06 PM→Mar 10, 07:17 PMresolved
Mar 10, 07:17 PM
resolved — This incident has been resolved.
Mar 10, 06:06 PM
investigating — We are noticing issues with various HG pages. Our engineering team is actively looking into it.
Outage for prod-eu-central-0 due to AWS S3 outage.
noneMar 7, 08:07 PM→Mar 9, 08:59 AMresolved
Mar 9, 08:59 AM
resolved — This incident has been resolved.
Mar 8, 11:30 AM
monitoring — Since about 20:03 UTC we have seen AWS S3 recover and also our services are recovering, we are monitoring.
Mar 7, 08:10 PM
investigating — Since about 20:03 UTC we have seen AWS S3 recover and also our services are recovering, we are monitoring.
+2 more updates
February 2026
Grafana Cloud Metrics - Intermittent Write Latency in prod-us-central, prod-us-central-5, and prod-eu-west-0
minorFeb 25, 07:54 PM→Mar 17, 06:22 PMresolved
Mar 17, 06:22 PM
resolved — This incident is now resolved.
During the incident the Cloud Metrics platform experienced intermittent latency spikes communicating with a backend cloud service in the prod-us-central-0 and prod-us-c...
Mar 6, 09:44 PM
monitoring — We are rolling out a mitigation across the environments in these regions, and preemptively where possible to ensure it doesn’t spread elsewhere.
Mar 6, 08:53 PM
monitoring — We have seen an increase in latency in our cloud providers services, and are rolling out a change to mitigate the issue. We are monitoring.
+5 more updates
Related Incident Histories
Get Grafana Cloud Outage Alerts
Be the first to know when Grafana Cloud go down.