What is Wrong with Facebook today 2019
By
fardhan alief
—
Saturday, December 28, 2019
—
What's Wrong With Facebook
What Is Wrong With Facebook Today
The key flaw that triggered this outage to be so severe was an unfavorable handling of a mistake problem. A computerized system for validating configuration worths wound up creating a lot more damage than it taken care of.
The intent of the automatic system is to check for setup worths that are invalid in the cache as well as change them with updated values from the persistent store. This functions well for a transient problem with the cache, yet it does not work when the persistent store is invalid.
Today we made an adjustment to the relentless duplicate of a configuration worth that was taken invalid. This meant that every single customer saw the invalid worth and attempted to repair it. Because the fix includes making a question to a cluster of databases, that cluster was promptly bewildered by numerous countless queries a second.
To make issues worse, whenever a customer got a mistake attempting to inquire among the data sources it translated it as a void worth, and also erased the matching cache secret. This suggested that even after the original trouble had actually been repaired, the stream of queries continued. As long as the databases stopped working to service several of the requests, they were creating much more requests to themselves. We had entered a responses loop that really did not allow the data sources to recover.
The method to stop the feedback cycle was fairly agonizing - we needed to quit all web traffic to this database collection, which meant shutting off the site. As soon as the data sources had recovered and the origin had actually been repaired, we gradually permitted even more people back onto the site.
This obtained the site back up and running today, as well as in the meantime we have actually turned off the system that tries to correct arrangement worths. We're discovering new layouts for this setup system adhering to style patterns of other systems at Facebook that deal even more with dignity with feedback loops and also transient spikes.
We apologize again for the site blackout, as well as we want you to recognize that we take the efficiency and also reliability of Facebook very seriously.