Previous incidents
Occasional API timeouts
Resolved May 17 at 03:00pm PDT
We have scaled services to address the issue.
1 previous update
Test run scaling maintenance
Resolved May 15 at 07:45am PDT
We are changing the way scaling works for test runs. This requires us to migrate our database and have a short window of downtime for test runs. Test runs will resume after maintenance.
Emergency downtime to restore proper database backups
Resolved May 11 at 06:00am PDT
Due to problems with database backups, we are taking a short downtime to switch to some newer database infrastructure. Runs that were started before the downtime may fail but should succeed when rerun.
Database issues
Resolved May 10 at 01:30pm PDT
We will be providing a post mortem by the end of the week.
2 previous updates
Infrastructure upgrade
Resolved May 10 at 07:29am PDT
This is completed
1 previous update
New test runs are not scheduling
Resolved May 06 at 10:25am PDT
Test runs are scheduling again, we are still actively monitoring to make sure there are not additional issues.
2 previous updates
Planned maintenance
Resolved Apr 29 at 06:00am PDT
app.qawolf.com and all test runs will be down for planned maintenance for approximately 30-60 minutes. Scheduled runs will run after the downtime. Deployment notifications may receive error responses and will need to be resent after the downtime.
Test runs are slow to start or stalled
Resolved Mar 09 at 11:20am PST
We identified the cause and runs are happening normally now
1 previous update
Test runs slow to start or canceled
Resolved Mar 02 at 01:33pm PST
This partial run outage is now resolved. For approximately 2 hours, some test runner nodes were unable to pull images from DockerHub, leading to scheduled runs failing to start. We took several steps to mitigate the issue but we believe it to be a temporarily issue with the DockerHub API that is resolved. At this point, all runs seem to be succeeding normally.
1 previous update