A misconfiguration on new Metrics storage nodes caused a bad cluster state which in turn causes issues with the ingestion. We are working on fixing this issue.
14:35 UTC: We found the cause of the issue and are working on fixing it.
14:47 UTC: The root cause is fixed and the ingestion is now running at full speed. The misconfiguration issue was just half the story, what caused this issue was a partial network split.
14:56 UTC: Ingestion is all caught up. Incident is over.