Fly.io Outage History

50 incidents reported. Data sourced from the official Fly.io status page.

50
Total Incidents
20
Major/Critical
23
Minor
49
Resolved

June 2026

Log search unavailable

minor
Jun 19, 06:05 PMmonitoring
Jun 19, 06:19 PM
monitoringWe've applied a fix for this issue. Historical logs are currently backfilling. We will post an update once logs have finished backfilling and current logs are being ingested normally.
Jun 19, 06:10 PM
investigatingLog search is available; however, new app logs since ~1 hour ago are missing and new logs are not being ingested. We are continuing to investigate.
Jun 19, 06:05 PM
investigatingWe are investigating an issue causing application log search to be unavailable. This is affecting the Fly Metrics log search panels, and historical application logs initially returned from the `fly lo...

Network Issues in SIN

major
Jun 17, 03:21 AMJun 17, 04:55 AMresolved
Jun 17, 04:55 AM
resolvedThis incident has been resolved.
Jun 17, 04:38 AM
monitoringNetwork connectivity in SIN has been fully restored. We're continuing to monitor.
Jun 17, 04:22 AM
identifiedWe are seeing recovery of network connectivity between SIN and most destinations. We're continuing to work with our upstream provider to resolve the remaining issues.
+2 more updates

Macaroon Auth + Machines API Issues

critical
Jun 15, 03:03 PMJun 15, 04:41 PMresolved
Jun 15, 04:41 PM
resolvedThis incident has been resolved and we are seeing all platform functions operate normally.
Jun 15, 04:15 PM
monitoringA fix has been implemented and we are monitoring the results.
Jun 15, 04:15 PM
identifiedWe have deployed another change and are seeing wider improvements in platform stability across all regions. Performance is trending to normal, though users may still see some degradation at this time....
+6 more updates

MPG cluster provisioning is broken

none
Jun 15, 06:20 AMJun 15, 06:39 AMresolved
Jun 15, 06:39 AM
resolvedThis incident has been resolved.
Jun 15, 06:20 AM
investigatingNew MPG cluster provisioning is broken. Existing MPG clusters are not affected. Newly created organizations may see errors while SSHing into their machines. We are investigating the issue.

Elevated Sprites error rates in SIN

major
Jun 12, 02:35 AMJun 12, 04:30 AMresolved
Jun 12, 04:30 AM
resolvedThis incident has been resolved.
Jun 12, 03:24 AM
monitoringA fix has been implemented and we are seeing error rates for sprites in SIN normalize. We are continuing to monitor to ensure full recovery.
Jun 12, 02:35 AM
investigatingWe are investigating elevated 500 / internal server error rates with Sprites in the SIN region. Users may see increased errors when accessing sprites located in this region, or for requests to the Spr...

Increased network latency in North America

minor
Jun 11, 01:16 AMJun 11, 11:12 AMresolved
Jun 11, 02:12 PM
resolvedThis incident has been resolved.
Jun 11, 10:09 AM
monitoringOur upstream paths have been fixed. We are monitoring the results.
Jun 11, 09:35 AM
identifiedWe are working with our upstream network provider to address periodic loss of connectivity over transit in ord
+5 more updates

Ingress Traffic issues in GRU

major
Jun 10, 07:39 PMJun 10, 07:52 PMresolved
Jun 10, 07:52 PM
resolvedThis incident has been resolved.
Jun 10, 07:39 PM
identifiedSome of our edge nodes in GRU has suffered an error that crashed some of the critical services. We're currently working to bring them back online. Some traffic entering through GRU (i.e. users connect...

Emergency maintenance of Petsem causing some control plane errors

major
Jun 9, 06:20 PMJun 9, 06:42 PMresolved
Jun 9, 06:42 PM
resolvedThis incident has been resolved.
Jun 9, 06:28 PM
monitoringThe maintenance has been completed and control plane functions should recover to normal. We're monitoring for any further complications.
Jun 9, 06:20 PM
identifiedWe're performing an emergency maintenance on Petsem, our secrets management service. Some control plane write operations may temporarily fail, for example, creating new apps or secrets. Existing apps ...

Managed Postgres Control Plane Issues in IAD

major
Jun 9, 02:57 PMJun 9, 04:15 PMresolved
Jun 9, 09:31 PM
resolvedThis incident has been resolved.
Jun 9, 03:19 PM
monitoringAn initial fix has been implemented and connectivity to all impacted clusters has been restored. We are continuing to monitor to ensure stable recovery.
Jun 9, 03:06 PM
identifiedWe are continuing to address this issue. Some clusters in IAD are unavailable at this time, some users may have seen unexpected cluter restarts. We are working on restoring normal performance for all ...
+1 more updates

egress ips are broken in ORD

minor
Jun 9, 09:16 AMJun 9, 10:01 AMresolved
Jun 9, 10:01 AM
resolvedThis incident has been resolved.
Jun 9, 09:37 AM
monitoringA fix has been implemented and we are monitoring the results.
Jun 9, 09:16 AM
investigatingEgress ips are broken in most of ORD, we are currently investigating this issue

Capacity issues in ARN region

minor
Jun 8, 10:31 AMJun 8, 12:30 PMresolved
Jun 8, 12:30 PM
resolvedThis incident has been resolved.
Jun 8, 10:31 AM
investigatingThe ARN region is low on available host capacity. Creating new machines, or starting currently stopped/suspended machines, may fail at this time. We are working on provisioning new host capacity in t...

Consul cluster degradation

none
Jun 4, 01:44 PMJun 4, 05:03 PMresolved
Jun 4, 05:03 PM
resolvedWe have restored the degraded Consul cluster. All affected functionality is now working correctly: Unmanaged Postgres and LiteFS with dynamic leases.
Jun 4, 04:02 PM
monitoringWe have restored the degraded Consul cluster and are monitoring for stability. All affected functionality should now be working correctly: Unmanaged Postgres and LiteFS with dynamic leases.
Jun 4, 01:44 PM
identifiedOne of our Consul clusters is in degraded state due to a failed node. This can cause issues with LiteFS primary node selection, Unmanaged Postgres (14.x and older *only*), and creation of new Unmanage...

Issues with flyctl ssh console and Machines OIDC

minor
Jun 1, 06:33 PMJun 1, 06:43 PMresolved
Jun 1, 06:43 PM
resolvedThis incident has been resolved.
Jun 1, 06:35 PM
monitoringA fix has been implemented and we are monitoring the results.
Jun 1, 06:33 PM
investigatingWe're currently investigating an issue affecting flyctl ssh console functionality and machines' OIDC tokens.

May 2026

IPv6 outage for some machines in ORD

major
May 31, 06:34 AMMay 31, 10:52 AMresolved
May 31, 10:52 AM
resolvedThis incident has been resolved.
May 31, 07:03 AM
monitoringA fix has been implemented and we are monitoring the results.
May 31, 06:34 AM
investigatingWe're working with our upstream providers to investigate an IPv6 networking failure in ORD.

Private networking issues in SYD

minor
May 30, 01:29 PMMay 30, 01:37 PMresolved
May 30, 01:37 PM
resolvedThis incident has been resolved.
May 30, 01:32 PM
monitoringA fix has been implemented and we are monitoring the results.
May 30, 01:29 PM
identifiedDue to an upstream provider issue, Private Networking (6PN) is currently degraded in SYD region. Communication between Machines in SYD region and Machines in other regions may fail at this time. Newly...

Elevated deployment errors

minor
May 30, 02:42 AMMay 30, 04:21 AMresolved
May 30, 04:21 AM
resolvedThis incident has been resolved.
May 30, 03:37 AM
monitoringA fix has been implemented and we are monitoring the results.
May 30, 03:13 AM
identifiedWe identified the issue and are working on a fix.
+1 more updates

Networking issues in ORD

minor
May 29, 07:09 PMMay 29, 07:45 PMresolved
May 29, 07:45 PM
resolvedThis incident has been resolved.
May 29, 07:21 PM
monitoringA fix has been implemented and we are monitoring the results.
May 29, 07:09 PM
identifiedWe are aware of increased latency and connection drops for clients located near Chicago (ORD) and are currently working on a fix.

Networking issues in ORD

minor
May 29, 06:59 AMMay 29, 04:50 PMresolved
May 29, 04:50 PM
resolvedThis incident has been resolved.
May 29, 10:22 AM
monitoringA fix has been implemented and we are monitoring the results.
May 29, 06:59 AM
investigatingWe are currently investigating increased latency and dropped connections in ORD (Chicago).

Networking issues in ORD

minor
May 29, 12:42 AMMay 29, 01:27 AMresolved
May 29, 01:27 AM
resolvedThis incident has been resolved.
May 29, 01:00 AM
monitoringA fix has been implemented and we are monitoring the results.
May 29, 12:42 AM
investigatingWe are currently investigating increased latency and dropped connections in ORD (Chicago).

Increased latency in SJC

minor
May 28, 09:08 PMMay 28, 11:07 PMresolved
May 28, 11:07 PM
resolvedThis incident has been resolved.
May 28, 10:34 PM
monitoringWe are continuing to monitor for any further issues.
May 28, 10:33 PM
monitoringA fix has been implemented and we are monitoring the results.
+3 more updates

Private networking issues in SYD

major
May 27, 11:54 AMMay 27, 12:52 PMresolved
May 27, 12:52 PM
resolvedThis incident has been resolved.
May 27, 12:13 PM
monitoringA fix has been implemented and we are monitoring the results.
May 27, 11:54 AM
identifiedDue to an upstream provider issue, Private Networking (6PN) is currently degraded in SYD region. Communication between Machines in SYD region and Machines in other regions may fail at this time. Newly...

Elevated GraphQL API Latency

minor
May 27, 03:50 AMMay 27, 04:36 AMresolved
May 27, 04:36 AM
resolvedThis incident has been resolved.
May 27, 04:07 AM
monitoringA fix has been implemented and we are monitoring the results.
May 27, 03:50 AM
investigatingWe are investigating elevated API Latency. Users may see delays or errors creating apps, as well as on some dashboard pages.

Capacity issues in EWR region

none
May 26, 05:34 PMMay 26, 07:06 PMresolved
May 26, 07:06 PM
resolvedThis incident has been resolved.
May 26, 05:34 PM
identifiedThe EWR region is low on available host capacity. Creating new machines, or starting currently stopped/suspended machines, may fail at this time. We are working on provisioning new host capacity in th...

Networking performance degraded in BOM and SJC

minor
May 23, 01:00 AMMay 24, 11:07 PMresolved
May 24, 11:07 PM
resolvedThis incident has been resolved.
May 23, 02:14 AM
monitoringNetwork performance has been restored and we're continuing to monitor.
May 23, 01:01 AM
investigatingWe're currently looking into this issue.

IPv6 outage for some machines in SIN

minor
May 22, 01:32 PMMay 22, 03:44 PMresolved
May 22, 03:44 PM
resolvedThis incident has been resolved.
May 22, 01:57 PM
monitoringA fix has been implemented and we are monitoring the results.
May 22, 01:34 PM
investigatingWe are continuing to investigate this issue.
+1 more updates

Network issues in SIN region

major
May 21, 06:30 PMMay 21, 07:59 PMresolved
May 21, 07:59 PM
resolvedThis incident has been resolved.
May 21, 06:45 PM
monitoringOur upstream provider has implemented a fix and all apps and Managed Postgres clusters are now reachable. For the time being, ingress traffic is being re-routed to other regions, so users around the ...
May 21, 06:30 PM
investigatingWe are investigating network issues in the Singapore region. Apps may experience higher latency or be unreachable at this time. Some Managed Postgres clusters may be unreachable.

IPv6 outage for some machines in ORD

none
May 21, 12:02 PMMay 21, 02:43 PMresolved
May 21, 02:43 PM
resolvedThis incident has been resolved.
May 21, 12:02 PM
investigatingWe're working with our upstream providers to investigate an IPv6 networking failure in ORD. Impacted apps may wish to temporarily provision additional capacity in nearby regions.

Networking issues in SIN

minor
May 20, 08:06 AMMay 21, 12:15 AMresolved
May 21, 12:15 AM
resolvedThis incident has been resolved.
May 20, 08:59 AM
identifiedAll MPGs in SIN are reachable again. We are seeing high latency to the affected provider. This may affect a subset of machines hosted in SIN.
May 20, 08:23 AM
identifiedOne of our upstreams is experiencing high packet loss and latency. We are actively working with them.
+1 more updates

Networking issues with egress IP addresses in SYD

minor
May 20, 05:05 AMMay 20, 08:05 AMresolved
May 20, 08:05 AM
resolvedThis incident has been resolved.
May 20, 07:11 AM
monitoringA fix has been implemented and we are monitoring the results.
May 20, 05:57 AM
identifiedWe've identified the issue and are working on a fix.
+1 more updates

Issues with the Fly.io dashboard

major
May 19, 11:19 PMMay 20, 12:36 AMresolved
May 20, 12:36 AM
resolvedThis incident has been resolved.
May 20, 12:26 AM
monitoringA fix has been implemented and we are monitoring the results.
May 19, 11:50 PM
identifiedWe are continuing to work on a fix for this issue.
+2 more updates

Logs issues in IAD

major
May 19, 07:16 PMMay 19, 08:56 PMresolved
May 19, 08:56 PM
resolvedThis incident has been resolved.
May 19, 07:16 PM
investigatingWe are investigating an issue with logs and metrics in IAD region. New logs and metrics from machines in IAD region may be missing, but past logs/metrics are still accessible. Apps continue to run.

Proxy issues in SIN region

major
May 19, 11:15 AMMay 19, 12:00 PMresolved
May 19, 12:00 PM
resolvedThis incident has been resolved.
May 19, 11:22 AM
monitoringA fix has been implemented and we are seeing proxy performance in SIN return to normal. All Managed Postgres clusters in the region are reachable. We are continuing to monitor to ensure stable recover...
May 19, 11:15 AM
investigatingWe are investigating issues with fly-proxy on a subset of hosts in the Singapore region. Apps are still running, but requests to/from some apps may fail, and some Managed Postgres clusters may be inac...

Some Managed Postgres clusters in FRA are unreachable

major
May 16, 12:45 PMMay 16, 03:10 PMresolved
May 16, 03:10 PM
resolvedThis incident has been resolved.
May 16, 02:03 PM
monitoringAll affected clusters have recovered
May 16, 01:40 PM
identifiedSome of unreachable clusters are showing recovery. We are still fixing the root cause.
+3 more updates

fly ssh console returns error 500

none
May 15, 04:29 PMMay 15, 05:24 PMresolved
May 15, 05:24 PM
resolvedThis issue has been resolved, fly ssh console invocations are now working correctly.
May 15, 04:40 PM
monitoringWe've deployed a fix and are monitoring as error rates normalize. `fly ssh console` should be working now.
May 15, 04:29 PM
identifiedA problem with our vault used to issue temporary certificates for SSH sessions is causing calls to `fly ssh console` and `fly console` to return error 500. Our team has identified the cause and is d...

log search unavailable

major
May 11, 08:02 PMMay 11, 08:48 PMresolved
May 11, 08:48 PM
resolvedThis incident has been resolved.
May 11, 08:32 PM
monitoringA fix has been implemented and we are monitoring the results. Logs should again be available through Log search in Grafana.
May 11, 08:02 PM
investigatingLog search in Grafana is currently unavailable. You may see `failed to make http request: 503` errors when accessing logs from fly-metrics.net at this time. App logs are still available using the `fly...

Upstash Redis Outage

major
May 11, 02:36 PMMay 11, 06:41 PMresolved
May 11, 06:41 PM
resolvedThis incident has been resolved.
May 11, 06:26 PM
monitoringA fix has been implemented and we are seeing Upstash Redis connectivity return to normal across all regions. We continuing to monitor to ensure stable recovery.
May 11, 05:31 PM
investigatingWe are continuing to work with Upstash on this issue. We have received reports of partial recovery for some users, however we are still seeing higher levels of degraded or failing connections connecti...
+2 more updates

Certificate Issuance failing due to LetsEncrypt Outage

critical
May 8, 06:45 PMMay 8, 09:09 PMresolved
May 8, 09:09 PM
resolvedThis incident has been resolved.
May 8, 09:02 PM
monitoringWe are seeing recovery and certificates are now issuing normally. We are continuing to monitor to ensure full recovery.
May 8, 08:25 PM
identifiedDue to a service outage at LetsEncrypt, creating new certificates with `fly certs add` is failing. Existing certificates and `*.fly.dev` preview certificates are not impacted. For additional details...

Connectivity issues in SJC

major
May 7, 05:49 PMMay 7, 06:28 PMresolved
May 7, 06:28 PM
resolvedThis incident has been resolved.
May 7, 06:18 PM
monitoringA fix has been implemented and we are monitoring the results.
May 7, 05:49 PM
investigatingSome hosts in SJC are currently experiencing an upstream network issue. Apps running on these hosts may be temporarily unavailable.

Intermittent machines issues in BOM

minor
May 7, 10:16 AMMay 7, 12:08 PMresolved
May 7, 12:08 PM
resolvedThis incident has been resolved.
May 7, 10:16 AM
identifiedCreation and updating of machines in BOM are affected. Some metrics and logs for resources in BOM may be delayed.

Elevated error rates on List Machines endpoint

minor
May 6, 12:06 AMMay 6, 12:41 AMresolved
May 6, 12:41 AM
resolvedWe mitigated the problem. The issue only affected apps having machines in the sin region.
May 6, 12:06 AM
investigatingWe are currently investigating this issue.

Errors Setting/Updating Secrets

major
May 5, 01:18 PMMay 5, 01:53 PMresolved
May 5, 01:53 PM
resolvedThis incident has been resolved.
May 5, 01:22 PM
investigatingCreation of new apps or changing secrets on existing apps fails
May 5, 01:22 PM
investigatingCreation of new apps or changing secrets on existing apps fails
+2 more updates

Log search unavailable

minor
May 4, 06:57 PMMay 4, 08:42 PMresolved
May 4, 08:42 PM
resolvedThis incident has been resolved.
May 4, 08:00 PM
monitoringWe have a mitigation in place and are monitoring results.
May 4, 06:57 PM
investigatingLog search in Grafana is currently unavailable. You may see `failed to make http request: 502` errors when accessing logs from fly-metrics.net at this time. App logs continue to be available using the...

April 2026

flyctl deploy creating new app instances

minor
Apr 28, 11:50 PMApr 29, 12:40 AMresolved
Apr 29, 12:40 AM
resolvedThis incident has been resolved.
Apr 29, 12:31 AM
monitoringA fix has been implemented and we are monitoring the results.
Apr 29, 12:07 AM
identifiedThe issue has been identified and a fix is being implemented.
+1 more updates

Slow machines operations in IAD region

minor
Apr 24, 10:45 PMApr 24, 11:31 PMresolved
Apr 24, 11:31 PM
resolvedThis incident has been resolved.
Apr 24, 11:19 PM
monitoringNetwork packet loss has returned to normal levels. We are monitoring the Machines API for stability.
Apr 24, 11:18 PM
investigatingWe are continuing to investigate this issue.
+2 more updates

Errors when adding or editing Github integrations for deployments

minor
Apr 23, 03:05 PMApr 23, 04:26 PMresolved
Apr 23, 04:26 PM
resolvedThis incident has been resolved.
Apr 23, 03:39 PM
monitoringA fix has been implemented and we are monitoring the results.
Apr 23, 03:22 PM
identifiedWe are continuing to work on a fix for this issue.
+2 more updates

Errors (5xx, timeouts) in Fly.io dashboard

major
Apr 23, 11:17 AMApr 23, 11:50 AMresolved
Apr 23, 11:50 AM
resolvedThis incident has been resolved.
Apr 23, 11:45 AM
monitoringA fix has been implemented and we are monitoring the results.
Apr 23, 11:35 AM
identifiedThe issue has been identified and a fix is being implemented.
+1 more updates

Increased latency in SIN

minor
Apr 20, 02:29 PMApr 20, 05:38 PMresolved
Apr 20, 05:38 PM
resolvedThis incident has been resolved.
Apr 20, 03:29 PM
identifiedWe are currently working on resolving increased latencies in our Singapore region.

TLS certificate issues

major
Apr 17, 01:06 PMApr 18, 08:42 PMresolved
Apr 18, 08:42 PM
resolvedThis incident has been resolved.
Apr 17, 03:34 PM
monitoringA fix has been implemented and we are monitoring the results.
Apr 17, 01:06 PM
investigatingWe are investigating an issue with the Vault server that stores TLS certificates. Provisioning new TLS certificates may fail, and connecting to domains whose existing certificate has not yet been cach...

Network issues in SYD

none
Apr 15, 11:08 AMApr 16, 10:59 AMresolved
Apr 16, 10:59 AM
resolvedThis incident has been resolved.
Apr 15, 11:40 AM
monitoringWe've identified the issue and applied a fix. All services should be working as normal.
Apr 15, 11:08 AM
investigatingWe're currently investigating some networking issues in SYD. This is affecting a number of our central services.

Heightened latency in ORD

none
Apr 12, 06:50 PMApr 12, 11:03 PMresolved
Apr 12, 11:03 PM
resolvedThis incident has been resolved.
Apr 12, 07:26 PM
monitoringA fix has been implemented and we are monitoring the results.
Apr 12, 06:50 PM
investigatingWe are currently investigating heightened network latency in ORD.

Get Fly.io Outage Alerts

Be the first to know when Fly.io go down.