F

Fly.io Outage History

Past incidents and downtime events

Complete history of Fly.io outages, incidents, and service disruptions. Showing 50 most recent incidents.

June 2026(12 incidents)

majorresolvedJun 17, 03:21 AM — Resolved Jun 17, 04:55 AM

Network Issues in SIN

5 updates
resolvedJun 17, 04:55 AM

This incident has been resolved.

monitoringJun 17, 04:38 AM

Network connectivity in SIN has been fully restored. We're continuing to monitor.

identifiedJun 17, 04:22 AM

We are seeing recovery of network connectivity between SIN and most destinations. We're continuing to work with our upstream provider to resolve the remaining issues.

identifiedJun 17, 03:47 AM

Some machines in SIN are unreachable. A few Managed Postgres clusters may fail to fail-over or update. We are in the process of fixing this with our upstream provider.

investigatingJun 17, 03:21 AM

We are currently investigating network connectivity issues in the SIN region. Hosted apps may be unavailable.

criticalresolvedJun 15, 03:03 PM — Resolved Jun 15, 04:41 PM

Macaroon Auth + Machines API Issues

9 updates
resolvedJun 15, 04:41 PM

This incident has been resolved and we are seeing all platform functions operate normally.

monitoringJun 15, 04:15 PM

A fix has been implemented and we are monitoring the results.

identifiedJun 15, 04:15 PM

We have deployed another change and are seeing wider improvements in platform stability across all regions. Performance is trending to normal, though users may still see some degradation at this time. We are continuing to closely monitor to ensure full, stable recovery. We will provide another update in 15m.

identifiedJun 15, 04:00 PM

We are seeing elevated cluster errors with Managed Postgres clusters as the MPG control plane recovers from the API outage. MPG Users may see elevated rates of failing or slow connections, as well as increased primary/replica failovers. The managed postgres team is addressing any degraded clusters. We will provide a further update within 15m.

identifiedJun 15, 03:57 PM

We continue to seeing degraded performance and increased errors with the Machines API and other platform features at this time. We are continuing to work on fully restoring service.

identifiedJun 15, 03:41 PM

An initial fix has been deployed and we are starting to see platform features recover. Users may still see degraded performance and intermittent failures at this time. We are continuing to address the issue to ensure a full stable recovery.

identifiedJun 15, 03:30 PM

We have identified the cause of the issue and are working on deploying a fix. Impacted features remain unavailable or degraded at this time. Already running customer applications/machines remain available. MPG clusters remain generally reachable and healthy, however new clusters cannot be provisioned and failovers may not complete. We will provide another update within 15 minutes.

identifiedJun 15, 03:13 PM

We are continuing to address this issue. Platform authentication with macaroon based tokens is currently failing. Platform features that authenticate with macaroons including Machines API operations, Dashboard logins, some flyctl commands, fly-metrics.net Grafana, and deployments are failing at this time. Existing, running customer applications and machines remain reachable and running. We will provide another update within 15 minutes

investigatingJun 15, 03:03 PM

We are investigating issues with Macaroon based authentication. This is impacting parts of the Machines API, Fly.io Dashboard, some flyctl operations and other platform features that rely on this.

noneresolvedJun 15, 06:20 AM — Resolved Jun 15, 06:39 AM

MPG cluster provisioning is broken

2 updates
resolvedJun 15, 06:39 AM

This incident has been resolved.

investigatingJun 15, 06:20 AM

New MPG cluster provisioning is broken. Existing MPG clusters are not affected. Newly created organizations may see errors while SSHing into their machines. We are investigating the issue.

majorresolvedJun 12, 02:35 AM — Resolved Jun 12, 04:30 AM

Elevated Sprites error rates in SIN

3 updates
resolvedJun 12, 04:30 AM

This incident has been resolved.

monitoringJun 12, 03:24 AM

A fix has been implemented and we are seeing error rates for sprites in SIN normalize. We are continuing to monitor to ensure full recovery.

investigatingJun 12, 02:35 AM

We are investigating elevated 500 / internal server error rates with Sprites in the SIN region. Users may see increased errors when accessing sprites located in this region, or for requests to the Sprites API originating from SIN

minorresolvedJun 11, 01:16 AM — Resolved Jun 11, 11:12 AM

Increased network latency in North America

8 updates
resolvedJun 11, 02:12 PM

This incident has been resolved.

monitoringJun 11, 10:09 AM

Our upstream paths have been fixed. We are monitoring the results.

identifiedJun 11, 09:35 AM

We are working with our upstream network provider to address periodic loss of connectivity over transit in ord

identifiedJun 11, 07:58 AM

We are seeing a recurrence of elevated latency across some hosts in ORD impacting the MPG control plane and a subset of clusters there. We are working to address this.

monitoringJun 11, 05:24 AM

The network between our regions has been performing well for the majority of traffic. We're still continuing to monitor a few impacted routes in North America that may be seeing elevated latency and packet loss.

monitoringJun 11, 03:30 AM

Impacted NA backbones have been sidestepped where possible, and we're continuing to monitor network health.

monitoringJun 11, 01:59 AM

Managed Postgres in ORD has returned to normal operation. We continue to see slightly elevated latencies and loss over transits in North America. We are working with our upstream network providers to improve performance.

investigatingJun 11, 01:16 AM

We are investigating elevated network instability across some hosts in ORD. Apps and managed postgres clusters on impacted hosts may see elevated latency or networking errors at this time.

majorresolvedJun 10, 07:39 PM — Resolved Jun 10, 07:52 PM

Ingress Traffic issues in GRU

2 updates
resolvedJun 10, 07:52 PM

This incident has been resolved.

identifiedJun 10, 07:39 PM

Some of our edge nodes in GRU has suffered an error that crashed some of the critical services. We're currently working to bring them back online. Some traffic entering through GRU (i.e. users connecting from around GRU) may be temporarily affected: connections may see increased latency or be occasionally dropped.

majorresolvedJun 9, 06:20 PM — Resolved Jun 9, 06:42 PM

Emergency maintenance of Petsem causing some control plane errors

3 updates
resolvedJun 9, 06:42 PM

This incident has been resolved.

monitoringJun 9, 06:28 PM

The maintenance has been completed and control plane functions should recover to normal. We're monitoring for any further complications.

identifiedJun 9, 06:20 PM

We're performing an emergency maintenance on Petsem, our secrets management service. Some control plane write operations may temporarily fail, for example, creating new apps or secrets. Existing apps and machines should keep functioning without issues.

majorresolvedJun 9, 02:57 PM — Resolved Jun 9, 04:15 PM

Managed Postgres Control Plane Issues in IAD

4 updates
resolvedJun 9, 09:31 PM

This incident has been resolved.

monitoringJun 9, 03:19 PM

An initial fix has been implemented and connectivity to all impacted clusters has been restored. We are continuing to monitor to ensure stable recovery.

identifiedJun 9, 03:06 PM

We are continuing to address this issue. Some clusters in IAD are unavailable at this time, some users may have seen unexpected cluter restarts. We are working on restoring normal performance for all clusters in IAD

investigatingJun 9, 02:57 PM

We are investigating MPG control plane instability in a subset of the IAD region. A small number of clusters in the region may have seen unexpected failovers or connection issues over the past 30m.

minorresolvedJun 9, 09:16 AM — Resolved Jun 9, 10:01 AM

egress ips are broken in ORD

3 updates
resolvedJun 9, 10:01 AM

This incident has been resolved.

monitoringJun 9, 09:37 AM

A fix has been implemented and we are monitoring the results.

investigatingJun 9, 09:16 AM

Egress ips are broken in most of ORD, we are currently investigating this issue

minorresolvedJun 8, 10:31 AM — Resolved Jun 8, 12:30 PM

Capacity issues in ARN region

2 updates
resolvedJun 8, 12:30 PM

This incident has been resolved.

investigatingJun 8, 10:31 AM

The ARN region is low on available host capacity. Creating new machines, or starting currently stopped/suspended machines, may fail at this time. We are working on provisioning new host capacity in the region. Please consider using nearby regions if possible.

noneresolvedJun 4, 01:44 PM — Resolved Jun 4, 05:03 PM

Consul cluster degradation

3 updates
resolvedJun 4, 05:03 PM

We have restored the degraded Consul cluster. All affected functionality is now working correctly: Unmanaged Postgres and LiteFS with dynamic leases.

monitoringJun 4, 04:02 PM

We have restored the degraded Consul cluster and are monitoring for stability. All affected functionality should now be working correctly: Unmanaged Postgres and LiteFS with dynamic leases.

identifiedJun 4, 01:44 PM

One of our Consul clusters is in degraded state due to a failed node. This can cause issues with LiteFS primary node selection, Unmanaged Postgres (14.x and older *only*), and creation of new Unmanaged Postgres clusters. Impact is limited to these legacy products and does not affect deployments, running Fly applications in general, or Managed Postgres clusters.

minorresolvedJun 1, 06:33 PM — Resolved Jun 1, 06:43 PM

Issues with flyctl ssh console and Machines OIDC

3 updates
resolvedJun 1, 06:43 PM

This incident has been resolved.

monitoringJun 1, 06:35 PM

A fix has been implemented and we are monitoring the results.

investigatingJun 1, 06:33 PM

We're currently investigating an issue affecting flyctl ssh console functionality and machines' OIDC tokens.

May 2026(29 incidents)

majorresolvedMay 31, 06:34 AM — Resolved May 31, 10:52 AM

IPv6 outage for some machines in ORD

3 updates
resolvedMay 31, 10:52 AM

This incident has been resolved.

monitoringMay 31, 07:03 AM

A fix has been implemented and we are monitoring the results.

investigatingMay 31, 06:34 AM

We're working with our upstream providers to investigate an IPv6 networking failure in ORD.

minorresolvedMay 30, 01:29 PM — Resolved May 30, 01:37 PM

Private networking issues in SYD

3 updates
resolvedMay 30, 01:37 PM

This incident has been resolved.

monitoringMay 30, 01:32 PM

A fix has been implemented and we are monitoring the results.

identifiedMay 30, 01:29 PM

Due to an upstream provider issue, Private Networking (6PN) is currently degraded in SYD region. Communication between Machines in SYD region and Machines in other regions may fail at this time. Newly created Machines in SYD may fail to sync to other regions (may not show up in Machines API List endpoint, or state may be incorrect). Additionally, TLS certificate resolution and Machines API authentication may currently be degraded in the SYD region. We are working with our upstream providers to resolve this issue.

minorresolvedMay 30, 02:42 AM — Resolved May 30, 04:21 AM

Elevated deployment errors

4 updates
resolvedMay 30, 04:21 AM

This incident has been resolved.

monitoringMay 30, 03:37 AM

A fix has been implemented and we are monitoring the results.

identifiedMay 30, 03:13 AM

We identified the issue and are working on a fix.

investigatingMay 30, 02:42 AM

We're investigating an increase in deployment errors affecting some users. At this time, creating or updating Machines may erroneously fail with the message: "We require your billing information, please add it at https://fly.io/dashboard//billing".

minorresolvedMay 29, 07:09 PM — Resolved May 29, 07:45 PM

Networking issues in ORD

3 updates
resolvedMay 29, 07:45 PM

This incident has been resolved.

monitoringMay 29, 07:21 PM

A fix has been implemented and we are monitoring the results.

identifiedMay 29, 07:09 PM

We are aware of increased latency and connection drops for clients located near Chicago (ORD) and are currently working on a fix.

minorresolvedMay 29, 06:59 AM — Resolved May 29, 04:50 PM

Networking issues in ORD

3 updates
resolvedMay 29, 04:50 PM

This incident has been resolved.

monitoringMay 29, 10:22 AM

A fix has been implemented and we are monitoring the results.

investigatingMay 29, 06:59 AM

We are currently investigating increased latency and dropped connections in ORD (Chicago).

minorresolvedMay 29, 12:42 AM — Resolved May 29, 01:27 AM

Networking issues in ORD

3 updates
resolvedMay 29, 01:27 AM

This incident has been resolved.

monitoringMay 29, 01:00 AM

A fix has been implemented and we are monitoring the results.

investigatingMay 29, 12:42 AM

We are currently investigating increased latency and dropped connections in ORD (Chicago).

minorresolvedMay 28, 09:08 PM — Resolved May 28, 11:07 PM

Increased latency in SJC

6 updates
resolvedMay 28, 11:07 PM

This incident has been resolved.

monitoringMay 28, 10:34 PM

We are continuing to monitor for any further issues.

monitoringMay 28, 10:33 PM

A fix has been implemented and we are monitoring the results.

identifiedMay 28, 10:25 PM

We're still seeing connection issues originating from SJC/LAX/West Coast US and are still investigating.

monitoringMay 28, 09:56 PM

We have implemented a mitigation for the issue and are monitoring for stability. This issue happened on the edge hosts in SJC, which means that any traffic near/around SJC would have been affected during the incident. We will provide a more detailed write-up on our Infra Log (https://fly.io/infra-log/) later once we have a better picture of the entire incident.

investigatingMay 28, 09:08 PM

We are currently investigating increased latency in SJC.

majorresolvedMay 27, 11:54 AM — Resolved May 27, 12:52 PM

Private networking issues in SYD

3 updates
resolvedMay 27, 12:52 PM

This incident has been resolved.

monitoringMay 27, 12:13 PM

A fix has been implemented and we are monitoring the results.

identifiedMay 27, 11:54 AM

Due to an upstream provider issue, Private Networking (6PN) is currently degraded in SYD region. Communication between Machines in SYD region and Machines in other regions may fail at this time. Newly created Machines in SYD may fail to sync to other regions (may not show up in Machines API List endpoint, or state may be incorrect). Additionally, TLS certificate resolution and Machines API authentication may currently be degraded in the SYD region. We are working with our upstream providers to resolve this issue.

minorresolvedMay 27, 03:50 AM — Resolved May 27, 04:36 AM

Elevated GraphQL API Latency

3 updates
resolvedMay 27, 04:36 AM

This incident has been resolved.

monitoringMay 27, 04:07 AM

A fix has been implemented and we are monitoring the results.

investigatingMay 27, 03:50 AM

We are investigating elevated API Latency. Users may see delays or errors creating apps, as well as on some dashboard pages.

noneresolvedMay 26, 05:34 PM — Resolved May 26, 07:06 PM

Capacity issues in EWR region

2 updates
resolvedMay 26, 07:06 PM

This incident has been resolved.

identifiedMay 26, 05:34 PM

The EWR region is low on available host capacity. Creating new machines, or starting currently stopped/suspended machines, may fail at this time. We are working on provisioning new host capacity in the region. Please consider using nearby regions such as ord if possible.

minorresolvedMay 23, 01:00 AM — Resolved May 24, 11:07 PM

Networking performance degraded in BOM and SJC

3 updates
resolvedMay 24, 11:07 PM

This incident has been resolved.

monitoringMay 23, 02:14 AM

Network performance has been restored and we're continuing to monitor.

investigatingMay 23, 01:01 AM

We're currently looking into this issue.

minorresolvedMay 22, 01:32 PM — Resolved May 22, 03:44 PM

IPv6 outage for some machines in SIN

4 updates
resolvedMay 22, 03:44 PM

This incident has been resolved.

monitoringMay 22, 01:57 PM

A fix has been implemented and we are monitoring the results.

investigatingMay 22, 01:34 PM

We are continuing to investigate this issue.

investigatingMay 22, 01:32 PM

We are currently investigating this issue.

majorresolvedMay 21, 06:30 PM — Resolved May 21, 07:59 PM

Network issues in SIN region

3 updates
resolvedMay 21, 07:59 PM

This incident has been resolved.

monitoringMay 21, 06:45 PM

Our upstream provider has implemented a fix and all apps and Managed Postgres clusters are now reachable. For the time being, ingress traffic is being re-routed to other regions, so users around the Singapore area may experience higher latency.

investigatingMay 21, 06:30 PM

We are investigating network issues in the Singapore region. Apps may experience higher latency or be unreachable at this time. Some Managed Postgres clusters may be unreachable.

noneresolvedMay 21, 12:02 PM — Resolved May 21, 02:43 PM

IPv6 outage for some machines in ORD

2 updates
resolvedMay 21, 02:43 PM

This incident has been resolved.

investigatingMay 21, 12:02 PM

We're working with our upstream providers to investigate an IPv6 networking failure in ORD. Impacted apps may wish to temporarily provision additional capacity in nearby regions.

minorresolvedMay 20, 08:06 AM — Resolved May 21, 12:15 AM

Networking issues in SIN

4 updates
resolvedMay 21, 12:15 AM

This incident has been resolved.

identifiedMay 20, 08:59 AM

All MPGs in SIN are reachable again. We are seeing high latency to the affected provider. This may affect a subset of machines hosted in SIN.

identifiedMay 20, 08:23 AM

One of our upstreams is experiencing high packet loss and latency. We are actively working with them.

investigatingMay 20, 08:06 AM

Machines may see high packet loss. Some MPGs are unable to connect to their config store and may be unreachable right now.

minorresolvedMay 20, 05:05 AM — Resolved May 20, 08:05 AM

Networking issues with egress IP addresses in SYD

4 updates
resolvedMay 20, 08:05 AM

This incident has been resolved.

monitoringMay 20, 07:11 AM

A fix has been implemented and we are monitoring the results.

identifiedMay 20, 05:57 AM

We've identified the issue and are working on a fix.

investigatingMay 20, 05:05 AM

We are currently investigating an issue affecting networking for new machines whose apps have assigned egress IP addresses in our SYD region

majorresolvedMay 19, 11:19 PM — Resolved May 20, 12:36 AM

Issues with the Fly.io dashboard

5 updates
resolvedMay 20, 12:36 AM

This incident has been resolved.

monitoringMay 20, 12:26 AM

A fix has been implemented and we are monitoring the results.

identifiedMay 19, 11:50 PM

We are continuing to work on a fix for this issue.

identifiedMay 19, 11:34 PM

The issue has been identified and a fix is being implemented.

investigatingMay 19, 11:19 PM

We're currently investigating an issue where the Fly.io dashboard is failing to load in some cases.

majorresolvedMay 19, 07:16 PM — Resolved May 19, 08:56 PM

Logs issues in IAD

2 updates
resolvedMay 19, 08:56 PM

This incident has been resolved.

investigatingMay 19, 07:16 PM

We are investigating an issue with logs and metrics in IAD region. New logs and metrics from machines in IAD region may be missing, but past logs/metrics are still accessible. Apps continue to run.

majorresolvedMay 19, 11:15 AM — Resolved May 19, 12:00 PM

Proxy issues in SIN region

3 updates
resolvedMay 19, 12:00 PM

This incident has been resolved.

monitoringMay 19, 11:22 AM

A fix has been implemented and we are seeing proxy performance in SIN return to normal. All Managed Postgres clusters in the region are reachable. We are continuing to monitor to ensure stable recovery.

investigatingMay 19, 11:15 AM

We are investigating issues with fly-proxy on a subset of hosts in the Singapore region. Apps are still running, but requests to/from some apps may fail, and some Managed Postgres clusters may be inaccessible.

majorresolvedMay 16, 12:45 PM — Resolved May 16, 03:10 PM

Some Managed Postgres clusters in FRA are unreachable

6 updates
resolvedMay 16, 03:10 PM

This incident has been resolved.

monitoringMay 16, 02:03 PM

All affected clusters have recovered

identifiedMay 16, 01:40 PM

Some of unreachable clusters are showing recovery. We are still fixing the root cause.

identifiedMay 16, 01:22 PM

The issue has been identified and a fix is being implemented.

investigatingMay 16, 12:45 PM

We are continuing to investigate this issue.

investigatingMay 16, 12:45 PM

We are currently investigating this issue.

noneresolvedMay 15, 04:29 PM — Resolved May 15, 05:24 PM

fly ssh console returns error 500

3 updates
resolvedMay 15, 05:24 PM

This issue has been resolved, fly ssh console invocations are now working correctly.

monitoringMay 15, 04:40 PM

We've deployed a fix and are monitoring as error rates normalize. `fly ssh console` should be working now.

identifiedMay 15, 04:29 PM

A problem with our vault used to issue temporary certificates for SSH sessions is causing calls to `fly ssh console` and `fly console` to return error 500. Our team has identified the cause and is deploying a fix.

majorresolvedMay 11, 08:02 PM — Resolved May 11, 08:48 PM

log search unavailable

3 updates
resolvedMay 11, 08:48 PM

This incident has been resolved.

monitoringMay 11, 08:32 PM

A fix has been implemented and we are monitoring the results. Logs should again be available through Log search in Grafana.

investigatingMay 11, 08:02 PM

Log search in Grafana is currently unavailable. You may see `failed to make http request: 503` errors when accessing logs from fly-metrics.net at this time. App logs are still available using the `fly logs` command and in the Fly.io dashboard.

majorresolvedMay 11, 02:36 PM — Resolved May 11, 06:41 PM

Upstash Redis Outage

5 updates
resolvedMay 11, 06:41 PM

This incident has been resolved.

monitoringMay 11, 06:26 PM

A fix has been implemented and we are seeing Upstash Redis connectivity return to normal across all regions. We continuing to monitor to ensure stable recovery.

investigatingMay 11, 05:31 PM

We are continuing to work with Upstash on this issue. We have received reports of partial recovery for some users, however we are still seeing higher levels of degraded or failing connections connecting to Upstash Redis databases at this time.

investigatingMay 11, 03:04 PM

We are continuing to work with Upstash on this issue.

investigatingMay 11, 02:36 PM

We are working with Upstash to investigate issues with their Fly hosted Redis service. Users may see degraded or failing connections connecting to their Upstash Redis databases at this time.

criticalresolvedMay 8, 06:45 PM — Resolved May 8, 09:09 PM

Certificate Issuance failing due to LetsEncrypt Outage

3 updates
resolvedMay 8, 09:09 PM

This incident has been resolved.

monitoringMay 8, 09:02 PM

We are seeing recovery and certificates are now issuing normally. We are continuing to monitor to ensure full recovery.

identifiedMay 8, 08:25 PM

Due to a service outage at LetsEncrypt, creating new certificates with `fly certs add` is failing. Existing certificates and `*.fly.dev` preview certificates are not impacted. For additional details please see LetsEncrypt's statuspage https://letsencrypt.status.io/pages/incident/55957a99e800baa4470002da/69fe2d6698ca07050eb4b1b3

majorresolvedMay 7, 05:49 PM — Resolved May 7, 06:28 PM

Connectivity issues in SJC

3 updates
resolvedMay 7, 06:28 PM

This incident has been resolved.

monitoringMay 7, 06:18 PM

A fix has been implemented and we are monitoring the results.

investigatingMay 7, 05:49 PM

Some hosts in SJC are currently experiencing an upstream network issue. Apps running on these hosts may be temporarily unavailable.

minorresolvedMay 7, 10:16 AM — Resolved May 7, 12:08 PM

Intermittent machines issues in BOM

2 updates
resolvedMay 7, 12:08 PM

This incident has been resolved.

identifiedMay 7, 10:16 AM

Creation and updating of machines in BOM are affected. Some metrics and logs for resources in BOM may be delayed.

minorresolvedMay 6, 12:06 AM — Resolved May 6, 12:41 AM

Elevated error rates on List Machines endpoint

2 updates
resolvedMay 6, 12:41 AM

We mitigated the problem. The issue only affected apps having machines in the sin region.

investigatingMay 6, 12:06 AM

We are currently investigating this issue.

majorresolvedMay 5, 01:18 PM — Resolved May 5, 01:53 PM

Errors Setting/Updating Secrets

5 updates
resolvedMay 5, 01:53 PM

This incident has been resolved.

investigatingMay 5, 01:22 PM

Creation of new apps or changing secrets on existing apps fails

investigatingMay 5, 01:22 PM

Creation of new apps or changing secrets on existing apps fails

investigatingMay 5, 01:19 PM

We are continuing to investigate this issue.

investigatingMay 5, 01:18 PM

We are currently investigating this issue.

minorresolvedMay 4, 06:57 PM — Resolved May 4, 08:42 PM

Log search unavailable

3 updates
resolvedMay 4, 08:42 PM

This incident has been resolved.

monitoringMay 4, 08:00 PM

We have a mitigation in place and are monitoring results.

investigatingMay 4, 06:57 PM

Log search in Grafana is currently unavailable. You may see `failed to make http request: 502` errors when accessing logs from fly-metrics.net at this time. App logs continue to be available using the `fly logs` command and in the Fly.io dashboard.

April 2026(9 incidents)

minorresolvedApr 28, 11:50 PM — Resolved Apr 29, 12:40 AM

flyctl deploy creating new app instances

4 updates
resolvedApr 29, 12:40 AM

This incident has been resolved.

monitoringApr 29, 12:31 AM

A fix has been implemented and we are monitoring the results.

identifiedApr 29, 12:07 AM

The issue has been identified and a fix is being implemented.

investigatingApr 28, 11:50 PM

We're investigating an issue where fly deploy is creating new Fly machine instances rather than updating existing ones, leading to apps with a mixed state. We're currently investigating the issue. As a workaround, please try removing the "processes = [ "app" ]" line from your fly.toml configuration file and redeploying. Another workaround is to downgrade flyctl to 0.4.40 - this should resolve the issue in the meantime.

minorresolvedApr 24, 10:45 PM — Resolved Apr 24, 11:31 PM

Slow machines operations in IAD region

5 updates
resolvedApr 24, 11:31 PM

This incident has been resolved.

monitoringApr 24, 11:19 PM

Network packet loss has returned to normal levels. We are monitoring the Machines API for stability.

investigatingApr 24, 11:18 PM

We are continuing to investigate this issue.

investigatingApr 24, 10:58 PM

We are deploying a partial mitigation while we continue investigating.

investigatingApr 24, 10:45 PM

We are currently investigating the issue. Only a portion of machines within the region are impacted.

minorresolvedApr 23, 03:05 PM — Resolved Apr 23, 04:26 PM

Errors when adding or editing Github integrations for deployments

5 updates
resolvedApr 23, 04:26 PM

This incident has been resolved.

monitoringApr 23, 03:39 PM

A fix has been implemented and we are monitoring the results.

identifiedApr 23, 03:22 PM

We are continuing to work on a fix for this issue.

identifiedApr 23, 03:22 PM

The issue has been identified and a fix is being implemented.

investigatingApr 23, 03:05 PM

We're investigating reports of "500" errors when trying to add a new Github integration or edit an existing Github integration in Fly.io/dashboard. This only affects "Launch an app from Github" or trying to change settings for an app set up this way. Existing integrations continue to work normally. It does not affect deploys done with `flyctl` or existing, running apps.

majorresolvedApr 23, 11:17 AM — Resolved Apr 23, 11:50 AM

Errors (5xx, timeouts) in Fly.io dashboard

4 updates
resolvedApr 23, 11:50 AM

This incident has been resolved.

monitoringApr 23, 11:45 AM

A fix has been implemented and we are monitoring the results.

identifiedApr 23, 11:35 AM

The issue has been identified and a fix is being implemented.

investigatingApr 23, 11:17 AM

We are investigating issues with web dashboard.

minorresolvedApr 20, 02:29 PM — Resolved Apr 20, 05:38 PM

Increased latency in SIN

2 updates
resolvedApr 20, 05:38 PM

This incident has been resolved.

identifiedApr 20, 03:29 PM

We are currently working on resolving increased latencies in our Singapore region.

majorresolvedApr 17, 01:06 PM — Resolved Apr 18, 08:42 PM

TLS certificate issues

3 updates
resolvedApr 18, 08:42 PM

This incident has been resolved.

monitoringApr 17, 03:34 PM

A fix has been implemented and we are monitoring the results.

investigatingApr 17, 01:06 PM

We are investigating an issue with the Vault server that stores TLS certificates. Provisioning new TLS certificates may fail, and connecting to domains whose existing certificate has not yet been cached may fail.

noneresolvedApr 15, 11:08 AM — Resolved Apr 16, 10:59 AM

Network issues in SYD

3 updates
resolvedApr 16, 10:59 AM

This incident has been resolved.

monitoringApr 15, 11:40 AM

We've identified the issue and applied a fix. All services should be working as normal.

investigatingApr 15, 11:08 AM

We're currently investigating some networking issues in SYD. This is affecting a number of our central services.

noneresolvedApr 12, 06:50 PM — Resolved Apr 12, 11:03 PM

Heightened latency in ORD

3 updates
resolvedApr 12, 11:03 PM

This incident has been resolved.

monitoringApr 12, 07:26 PM

A fix has been implemented and we are monitoring the results.

investigatingApr 12, 06:50 PM

We are currently investigating heightened network latency in ORD.

minorresolvedApr 10, 06:42 PM — Resolved Apr 10, 09:48 PM

Managed Postgres control plane instability in NRT (Tokyo)

4 updates
resolvedApr 10, 09:48 PM

This incident has been resolved.

monitoringApr 10, 08:32 PM

A fix has been implemented and we are seeing MPG performance in NRT normalize. We are continuing to monitor to ensure a stable recovery

identifiedApr 10, 08:13 PM

The issue has been identified and a fix is being implemented. Users with clusters in NRT may continue to see instability at this time

investigatingApr 10, 06:42 PM

We are investigating instability in the MPG control plane in the NRT (Toyko, Japan) region causing unexpected cluster failovers. Clusters return to health shortly after, but some users with clusters in NRT may see dropped connections or degraded performance at this time.

📡 Tired of checking Fly.io status manually?

Better Stack monitors uptime every 30 seconds and alerts you instantly when Fly.io goes down.

Start Free Monitoring →