Previous incidents

May 2022
May 17, 2022
1 incident

Occasional API timeouts

Degraded

Resolved May 17 at 03:00pm PDT

We have scaled services to address the issue.

1 previous update

May 15, 2022
1 incident

Test run scaling maintenance

Maintenance

Resolved May 15 at 07:45am PDT

We are changing the way scaling works for test runs. This requires us to migrate our database and have a short window of downtime for test runs. Test runs will resume after maintenance.

May 11, 2022
1 incident

Emergency downtime to restore proper database backups

Maintenance

Resolved May 11 at 06:00am PDT

Due to problems with database backups, we are taking a short downtime to switch to some newer database infrastructure. Runs that were started before the downtime may fail but should succeed when rerun.

May 10, 2022
2 incidents

Database issues

Downtime

Resolved May 10 at 01:30pm PDT

We will be providing a post mortem by the end of the week.

2 previous updates

Infrastructure upgrade

Maintenance

Resolved May 10 at 07:29am PDT

This is completed

1 previous update

May 06, 2022
1 incident

New test runs are not scheduling

Degraded

Resolved May 06 at 10:25am PDT

Test runs are scheduling again, we are still actively monitoring to make sure there are not additional issues.

2 previous updates

April 2022
Apr 29, 2022
1 incident

Planned maintenance

Maintenance

Resolved Apr 29 at 06:00am PDT

app.qawolf.com and all test runs will be down for planned maintenance for approximately 30-60 minutes. Scheduled runs will run after the downtime. Deployment notifications may receive error responses and will need to be resent after the downtime.

March 2022
Mar 09, 2022
1 incident

Test runs are slow to start or stalled

Degraded

Resolved Mar 09 at 11:20am PST

We identified the cause and runs are happening normally now

1 previous update

Mar 02, 2022
1 incident

Test runs slow to start or canceled

Degraded

Resolved Mar 02 at 01:33pm PST

This partial run outage is now resolved. For approximately 2 hours, some test runner nodes were unable to pull images from DockerHub, leading to scheduled runs failing to start. We took several steps to mitigate the issue but we believe it to be a temporarily issue with the DockerHub API that is resolved. At this point, all runs seem to be succeeding normally.

1 previous update