Observability

Observability Domain Inspector Origin Inspector
 
15 April 2024, 21:42 UTC

We are investigating elevated errors for evaluating Fastly Alerts configured on Domain Inspector and Origin Inspector metrics.

 
15 April 2024, 22:00 UTC

Our engineers have identified the contributing factor and are applying a fix our Domain Inspector, Origin Inspector.

 
15 April 2024, 22:46 UTC

Engineering has confirmed the impact to Domain Inspector, Origin Inspector has been mitigated.

 
15 April 2024, 23:05 UTC

Engineering has confirmed that impact to the Domain Inspector and Origin Inspector metrics has been fully restored. Customers may have experienced degraded service for Fastly Alerts from 21:42 to 23:05 UTC.

This incident is resolved.

Observability Real-time Log Streaming
 
14 February 2024, 17:28 UTC

Fastly Engineering detected a performance impact event affecting Streaming Logs in various POPs throughout our network. Customers may have experienced a delay or discarded log messages from 16:19 to 16:38 UTC.

This incident is resolved.

Platform Observability North America Historical Stats Real-time Analytics Columbus (CMH)
 
29 December 2023, 03:00 UTC

We're currently investigating performance issues with our Real Time Analytics and Historical Stats services at our Columbus (CMH) Point of Presence (POP).

All other services are unaffected.

 
29 December 2023, 04:03 UTC

Our engineers have identified the contributing factor and are applying a fix to Real Time Analytics and Historical Stats services at our Columbus (CMH) Point of Presence (POP).

All other locations and services are unaffected.

 
29 December 2023, 04:28 UTC

Engineering has confirmed the impact to Real Time Analytics and Historical Stats services at our Columbus (CMH) Point of Presence (POP) has been mitigated.

 
29 December 2023, 04:40 UTC

Engineering has confirmed that Real Time Analytics and Historical Stats services at our Columbus (CMH) Point of Presence (POP) has been fully restored. Customers may have experienced a delay or dropped observability product data from. 01:15 to 04:05 UTC.

This incident is resolved.

Affected customers may have experienced impact to varying degrees and to a shorter duration than as set forth above.

19 December 2023, 18:51 UTC
Observability Origin Inspector
 
19 December 2023, 18:51 UTC

We're currently investigating performance issues with our Origin Inspector service.

All other services are unaffected.

 
19 December 2023, 20:47 UTC

Our engineers have identified the contributing factor and are applying a fix to Origin Inspector

 
19 December 2023, 23:11 UTC

Engineering has deployed a fix and have confirmed a gradual recovery to Origin Inspector. We will continue to monitor until we’ve confirmed that customer experience has been fully restored.

 
20 December 2023, 03:50 UTC

Engineering has confirmed that our Origin Inspector service has been fully restored. Customers may have experienced missing historical Origin Inspector metrics in both the APIs and web interface from the 14th December 2023 at 00:00 to the 20th of December 2023 at 03:18 UTC.

This incident is resolved.

Platform Observability Europe Historical Stats Real-time Analytics Lisbon (LIS)
 
03 December 2023, 17:53 UTC

We're currently investigating performance issues with our Real Time Analytics and Historical Stats services only in our Lisbon (LIS) data center.

All other services and data centers are unaffected.

 
03 December 2023, 22:51 UTC

This event has been resolved.

14 November 2023, 21:49 UTC
Observability Real-time Log Streaming
 
14 November 2023, 21:49 UTC

We have identified the cause of elevated errors in our Streaming Logs service and are deploying a fix. 

Our network availability and all other services are unaffected by this incident 

 
14 November 2023, 22:14 UTC

A fix has been implemented and we are monitoring the results.

 
14 November 2023, 22:53 UTC

Engineering has confirmed that the degraded performance for streaming log services has been fully restored. Customers may have experienced varying degrees of log loss or delays in log delivery from 20:50 UTC to 21:50 UTC as a result of this incident.

This incident has been resolved.

Observability Real-time Log Streaming
 
14 November 2023, 13:41 UTC

Fastly has identified an issue in which customers may see error messages in the Fastly UI for S3 and Kinesis endpoints indicating that a token is expired. However, this is isolated to the Fastly logging system is intermittently hitting a rate with the AWS Security Token Service (STS) API. This intermittent error does not appear to be causing log loss, but results in an error messaging in the UI. This only affects endpoints S3 and Kinesis endpoints that are using role-based authentication. 

Fastly is currently working to resolve this intermittent error. All other locations and services are unaffected. 

 
15 November 2023, 00:16 UTC

Engineering has deployed a fix to mitigate rate limiting errors and have observed a gradual recovery for streaming log services. We will continue to monitor the effects of the change and will post an update once services have been fully restored.

 
15 November 2023, 16:52 UTC

Our investigations into previously deployed mitigation measures has verified that our customers should no longer experience log loss as a result of this incident.

We investigated into the continued reports of error messages observed within the Fastly App and identified an error in the timing when reacquiring temporary credentials. We have confirmed that the impact to streaming log services has been resolved, and we do not see log loss in connection to this error message.

We are deploying an additional fix to resolve this Fastly App UI error message for our customers. We will post an update once all remaining error messages have been fully corrected.

 
15 November 2023, 22:49 UTC

A fix was deployed and we have observed role-based S3 and Kinesis logging endpoints returning to normal in the Fastly UI. Services that handle little to no traffic may see the error remaining until the logging system has successfully sent a batch of logs.

13 November 2023, 03:23 UTC
Observability Historical Stats
 
13 November 2023, 03:23 UTC

We're investigating elevated errors in Historical Stats.

All other locations and services are unaffected

 
13 November 2023, 04:23 UTC

We're still investigating elevated errors in Historical Stats.

During this time, customers may experience 5xx errors when accessing the Historical Stats endpoint.

All other locations and services are unaffected

 
13 November 2023, 06:36 UTC

This issue has been identified and a fix is being implemented. 

 
13 November 2023, 06:49 UTC

A fix has been implemented and we are monitoring the results.

We've observed a significant decrease in 5xx errors, but are currently monitoring for logs that may have failed to deliver during this incident.

 
13 November 2023, 07:06 UTC

This incident has been resolved and no further impacts have been observed. 

09 November 2023, 15:37 UTC
Observability Real-time Log Streaming
 
09 November 2023, 15:37 UTC

We're investigating elevated errors in Streaming Logs. 

 
09 November 2023, 15:41 UTC

This issue has been identified and a fix is being implemented. 

 
09 November 2023, 15:55 UTC

A fix has been implemented and we are monitoring the results.

 
09 November 2023, 16:29 UTC

Engineering has confirmed that Streaming Logs has been fully restored. Customers sending log messages from the affected POPs would experience a similar proportion of log messages discarded from 15:20 to 15:53 UTC.

This incident is resolved.

Affected customers may have experienced impact to varying degrees and to a shorter duration than as set forth above.

08 November 2023, 17:30 UTC
Observability Real-time Log Streaming
 
08 November 2023, 17:30 UTC

We're investigating elevated errors in Streaming Logs. 

All other locations and services are unaffected

 
08 November 2023, 17:33 UTC

This issue has been identified and a fix is being implemented. 

 
08 November 2023, 17:55 UTC

A fix has been implemented and we are monitoring the results.

 
08 November 2023, 19:05 UTC

A fix has been implemented and we are still monitoring the results. We have seen partial improvement for the Streaming Logs. Our Dublin (DUB) POP is still experiencing elevated errors at this time.

 
08 November 2023, 19:45 UTC

Engineering has confirmed that Streaming Logs has been fully restored. Customers sending log messages from the affected POPs would experience a similar proportion of log messages discarded from 16:50 to 19:35 UTC.

This incident is resolved.

Affected customers may have experienced impact to varying degrees and to a shorter duration than as set forth above.