There were two problems: First, The monitoring system did not send out an alert so we didn't know something was wrong with the host server until around 3PM.
Second, we would run into hardware problems even back in AWS cloud computing days. In this case, the host server's DNS resolution was broken and failed to resolve the domain name of the internal monitoring system, resulting in the failure to execute processing of plurks on a timeline.