(All times in UTC)
11:30 Our main API keeps stopping to respond. We are investigating it.
This impacts the following, in an irregular fashion:
-
clever ssh
may not succeed
- Some deployments may not go through
Applications should keep running, but some monitoring deployments may fail.
12:55 The API seems to have stabilized. The database seems to have had a huge load. We are investigating the queries responsible for that load and try to improve them.