Step Function has a high error percentage

Check for detecting high error rates in your Step Functions.

Dashbird continuously monitors and analyses your serverless applications to ensure reliability, cost and performance optimisation and alignment with the Well Architected Framework.

Product Features Start Free Trial

Severity: CRITICAL
Interval: 30 minutes
Time slot: 60 minutes
Threshold: 5% error rate

Metrics:
METRICS.STEPFUNCTIONS.FAILED
METRICS.STEPFUNCTIONS.SUCCEEDED

Why do I see this?

One of your state machines has a high error rate.

What does this mean?

Some errors are inevitable and should be handled from the application side. However, if error rates increase significantly, it is cause for concern and requires further investigation.

How do I fix Step Function has a high error percentage?

Check the payloads of your state machine in the time period where the errors started rising.


This rule resolution is part of the Dashbird Serverless Well Architected Reports tool for AWS. Dashbird features a collection of rules and checks continuously applied to your infrastructure, surfacing ways to improve it.

Catch errors and detect anomalies for AWS Step functions and learn the best practice rules for Step Functions.

Industry leader in serverless monitoring

Dashbird is a monitoring, debugging and intelligence platform designed to help serverless developers build, operate, improve, and scale their modern cloud applications on AWS environment securely and with ease.