GitHub Status

Mar 29, 2025

No incidents reported today.

Mar 28, 2025

[Retroactive] Disruption with Pull Request Ref Updates

Resolved - Beginning at 21:24 UTC on March 28 and lasting until 21:50 UTC, some customers of github.com had issues with PR tracking refs not being updated due to processing delays and increased failure rates. We did not status before we completed the rollback, and the incident is currently resolved. We are sorry for the delayed post on githubstatus.com.
Mar 28, 22:50 UTC

Disruption with some GitHub services

Resolved - This incident was opened by mistake. Public services are currently functional.
Mar 28, 18:14 UTC

Investigating - We are currently investigating this issue.
Mar 28, 17:53 UTC

Disruption with Pull Request Ref Updates

Resolved - This incident has been resolved.
Mar 28, 01:40 UTC

Update - This issue has been mitigated and we are operating normally.
Mar 28, 01:40 UTC

Update - We are continuing to monitor for recovery.
Mar 28, 00:54 UTC

Update - We believe we have identified the source of the issue and are monitoring for recovery.
Mar 28, 00:20 UTC

Update - Pull Requests is experiencing degraded performance. We are continuing to investigate.
Mar 27, 23:52 UTC

Investigating - We are currently investigating this issue.
Mar 27, 23:49 UTC

Mar 27, 2025

Mar 26, 2025

No incidents reported.

Mar 25, 2025

No incidents reported.

Mar 24, 2025

No incidents reported.

Mar 23, 2025

[Retroactive] Incident with Migrations Submitted Via GitHub UI

Resolved - Between 2024-03-23 18:10 UTC and 2024-03-24 16:10 UTC, migration jobs submitted through the GitHub UI experienced processing delays and increased failure rates. This issue only affected migrations initiated via the web interface. Migrations started through the API or the command line tool continued to function normally. We are sorry for the delayed post on githubstatus.com.
Mar 23, 18:00 UTC

Mar 22, 2025

No incidents reported.

Mar 21, 2025

Disruption with some GitHub services

Resolved - On March 21st, 2025, between 11:45 UTC and 13:20 UTC, users were unable to interact with GitHub Copilot Chat in GitHub. The issue was caused by a recently deployed Ruby change that unintentionally overwrote a global value. This led to GitHub Copilot Chat in GitHub being misconfigured with an invalid URL, preventing it from connecting to our chat server. Other Copilot clients were not affected.

We mitigated the incident by identifying the source of the problematic query and rolling back the deployment.

We are reviewing our deployment tooling to reduce the time to mitigate similar incidents in the future. In parallel, we are also improving our test coverage for this category of error to prevent them from being deployed to production.
Mar 21, 13:44 UTC

Update - Copilot is operating normally.
Mar 21, 13:44 UTC

Update - Mitigation is complete and we are seeing full recovery for GitHub Copilot Chat in GitHub.
Mar 21, 13:43 UTC

Update - We have identified the problem and have a mitigation in progress.
Mar 21, 13:16 UTC

Update - Copilot is experiencing degraded performance. We are continuing to investigate.
Mar 21, 13:00 UTC

Update - We are investigating issues with GitHub Copilot Chat in GitHub. We will continue to keep users updated on progress toward mitigation.
Mar 21, 12:42 UTC

Investigating - We are currently investigating this issue.
Mar 21, 12:40 UTC

Intermittent GitHub Actions workflow failures

Resolved - On March 21st, 2025, between 05:43 UTC and 08:49 UTC, the Actions service experienced degradation, leading to workflow run failures. During the incident, approximately 2.45% of workflow runs failed due to an infrastructure failure. This incident was caused by intermittent failures in communicating with an underlying service provider. We are working to improve our resilience to downtime in this service provider and to reduce the time to mitigate in any future recurrences.
Mar 21, 09:34 UTC

Update - Actions is operating normally.
Mar 21, 09:34 UTC

Update - We have made progress understanding the source of these errors and are working on a mitigation.
Mar 21, 09:05 UTC

Update - We're continuing to investigate elevated errors during GitHub Actions workflow runs. At this stage our monitoring indicates that these errors are impacting no more than 3% of all runs.
Mar 21, 08:20 UTC

Update - We're continuing to investigate intermittent failures with GitHub Actions workflow runs.
Mar 21, 07:27 UTC

Update - We're seeing errors reported with a subset of GitHub Actions workflow runs, and are continuing to investigate.
Mar 21, 06:55 UTC

Investigating - We are investigating reports of degraded performance for Actions
Mar 21, 06:21 UTC

Incident with Codespaces

Resolved - On March 21, 2025 between 01:00 UTC and 02:45 UTC, the Codespaces service was degraded and users in various regions experienced intermittent connection failures. The peak error error was 30% of connection attempts across 38% of Codespaces. This was due to a service deployment.

The incident was mitigated by completing the deployment to the impacted regions.

We are working with the service team to identify the cause of the connection losses and perform necessary repairs to avoid future occurrences.
Mar 21, 03:08 UTC

Update - Codespaces is operating normally.
Mar 21, 03:08 UTC

Update - We have seen full recovery in the last 15 minutes for Codespaces connections. GitHub Codespaces are healthy. For users who are still seeing connection problems, restarting the Codespace may help resolve the issue.
Mar 21, 03:08 UTC

Update - We are continuing to investigate issues with failed connections to Codespaces. We are seeing recovery over the last 10 minutes.
Mar 21, 02:53 UTC

Update - Customers may be experiencing issues connecting to Codespaces on GitHub.com. We are currently investigating the underlying issue.
Mar 21, 02:19 UTC

Investigating - We are investigating reports of degraded performance for Codespaces
Mar 21, 02:12 UTC

Mar 20, 2025

Incident with Pages

Resolved - On March 20, 2025, between 19:24 UTC and 20:42 UTC the GitHub Pages experience was degraded and returned 503s for some customers. We saw an error rate of roughly 2% for Pages views, and new page builds were unable to complete successfully before timing out.

This was due to replication failure at the database layer between a write destination and read destination. We mitigated the incident by redirecting reads to the same destination as writes.

The error with replication occurred while in this transitory phase, as we are in the process of migrating the underlying data for Pages to new database infrastructure. Additionally our monitors failed to detect the error.

We are addressing the underlying cause of the failed replication and telemetry.

Mar 20, 20:54 UTC

Update - We have resolved the issue for Pages. If you're still experiencing issues with your GitHub Pages site, please rebuild.
Mar 20, 20:53 UTC

Update - Customers may not be able to create or make changes to their GitHub Pages sites. Customers who rely on webhook events from Pages builds might also experience a downgraded experience.
Mar 20, 20:38 UTC

Update - Webhooks is experiencing degraded performance. We are continuing to investigate.
Mar 20, 20:33 UTC

Investigating - We are investigating reports of degraded performance for Pages
Mar 20, 20:04 UTC

Mar 19, 2025

Scheduled Migrations Maintenance

Completed - The scheduled maintenance has been completed.
Mar 19, 05:00 UTC

In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Mar 18, 21:00 UTC

Scheduled - Migrations will be undergoing maintenance starting at 21:00 UTC on Tuesday, March 18 2025 with an expected duration of up to eight hours.

During this maintenance period, users will experience delays importing repositories into GitHub.

Once the maintenance period is complete, all pending imports will automatically proceed.
Mar 18, 19:28 UTC

Incident with Actions: Queue Run Failures

Resolved - On March 18th, 2025, between 23:20 UTC and March 19th, 2025 00:15 UTC, the Actions service experienced degradation, leading to run start delays. During the incident, about 0.3% of all workflow runs queued during the time failed to start, about 0.67% of all workflow runs were delayed by an average of 10 minutes, and about 0.16% of all workflow runs ultimately ended with an infrastructure failure. This was due to a networking issue with an underlying service provider. At 00:15 UTC the service provider mitigated their issue, and service was restored immediately for Actions. We are working to improve our resilience to downtime in this service provider to reduce the time to mitigate in any future recurrences.
Mar 19, 00:55 UTC

Update - Actions is operating normally.
Mar 19, 00:55 UTC

Update - The provider has reported full mitigation of the underlying issue, and Actions has been healthy since approximately 00:15 UTC.
Mar 19, 00:55 UTC

Update - We are continuing to investigate issues with delayed or failed workflow runs with Actions. We are engaged with a third-party provider who is also investigating issues and has confirmed we are impacted.
Mar 19, 00:22 UTC

Update - Some customers may be experiencing delays or failures when queueing workflow runs
Mar 18, 23:45 UTC

Investigating - We are investigating reports of degraded performance for Actions
Mar 18, 23:45 UTC

Mar 18, 2025

Disruption with some GitHub services

Resolved - On March 18th, 2025, between 13:35 UTC and 17:45 UTC, some users of GitHub Copilot Chat in GitHub experienced intermittent failures when reading or writing messages in a thread, resulting in a degraded experience. The error rate peaked at 3% of requests to the service. This was due to an availability incident with a database provider. Around 16:15 UTC the upstream service provider mitigated their availability incident, and service was restored in the following hour.

We are working to improve our failover strategy for this database to reduce the time to mitigate similar incidents in the future.
Mar 18, 18:45 UTC

Update - We are seeing recovery and no new errors for the last 15mins.
Mar 18, 18:28 UTC

Update - We are still investigating infrastructure issues and our provider has acknowledged the issues and is working on a mitigation. Customers might still see errors when creating messages, or new threads in Copilot Chat. Retries might be successful.
Mar 18, 17:42 UTC

Update - We are still investigating infrastructure issues and collaborating with providers. Customers might see some errors when creating messages, or new threads in Copilot Chat. Retries might be successful.
Mar 18, 16:42 UTC

Update - We are experiencing issues with our underlying data store which is causing a degraded experience for a small percentage of users using Copilot Chat in github.com
Mar 18, 16:00 UTC

Investigating - We are currently investigating this issue.
Mar 18, 15:58 UTC

macos-15-arm64 hosted runner queue delays

Resolved - On March 18, between 13:04 and 16:55 UTC, Actions workflows relying on hosted runners using the beta MacOS 15 image experienced increased queue time waiting for available runners. An image update was pushed the previous day that included a performance reduction. The slower performance caused longer average runtimes, exhausting our available Mac capacity for this image. This was mitigated by rolling back the image update. We have updated our capacity allocation to the beta and other Mac images and are improving monitoring in our canary environments to catch this potential issue before it impacts customers.
Mar 18, 17:15 UTC

Update - We are seeing improvements in telemetry and are monitoring for full recovery.
Mar 18, 16:56 UTC

Update - We've applied a mitigation to fix the issues with queuing Actions jobs on macos-15-arm64 Hosted runner. We are monitoring.
Mar 18, 16:36 UTC

Update - The team continues to investigate issues with some Actions macos-15-arm64 Hosted jobs being queued for up to 15 minutes. We will continue providing updates on the progress towards mitigation.
Mar 18, 15:43 UTC

Investigating - We are currently investigating this issue.
Mar 18, 15:05 UTC

Mar 17, 2025

Incident with Issues

Resolved - Between March 17, 2025, 18:05 UTC and March 18, 2025, 09:50 UTC, GitHub.com experienced intermittent failures in web and API requests. These issues affected a small percentage of users (mostly related to pull requests and issues), with a peak error rate of 0.165% across all requests.

We identified a framework upgrade that caused kernel panics in our Kubernetes infrastructure as the root cause. We mitigated the incident by downgrading until we were able to disable a problematic feature. In response, we have investigated why the upgrade caused the unexpected issue, have taken steps to temporarily prevent it, and are working on longer term patch plans while improving our observability to ensure we can quickly react to similar classes of problems in the future.
Mar 17, 23:02 UTC

Update - We saw a spike in error rate with issues related pages and API requests due to some problems with restarts in our kubernetes infrastructure that, at peak, caused 0.165% of requests to see timeouts or errors related to these API surfaces over a 15 minute period. At this time we see minimal impact and are continuing to investigate the cause of the issue.
Mar 17, 23:01 UTC

Update - We are investigating reports of issues with service(s): Issues We're continuing to investigate. Users may see intermittent HTTP 500 responses when using Issues. Retrying the request may succeed.
Mar 17, 21:25 UTC

Update - We are investigating reports of issues with service(s): Issues We're continuing to investigate. We will continue to keep users updated on progress towards mitigation.
Mar 17, 20:51 UTC

Update - We are investigating reports of issues with service(s): Issues. We will continue to keep users updated on progress towards mitigation.
Mar 17, 19:19 UTC

Investigating - We are investigating reports of degraded performance for Issues
Mar 17, 18:39 UTC

Mar 16, 2025

No incidents reported.

Mar 15, 2025

No incidents reported.

All Systems Operational

About This Site

Past Incidents