Fault injectionFault injection is the practice of injecting artificial faults or failures in the system and watching how the system behaves. The core…3d ago3d ago
What does it mean to have a fail-safe distributed systemIf something is fail-safe, it has been designed so that if one part of it does not work, the whole thing does not become dangerous…Mar 4, 2024Mar 4, 2024
Sorry, You don’t have enough motivation to join our company or teamIn 2013 google news had a main feature: it would give you different coverages from different online news sources to the same story or…Jan 29, 20241Jan 29, 20241
Uninformed opinion in a meeting is just plain noiseOne feedback that I got more than once from different people in different companies is that I do my homework extensively before meetings…Jul 3, 2023Jul 3, 2023
Soft skills are not enough to be an engineering managerIn your opinion, What is the starting point for someone to get a position as a junior or a graduate developer in any company? My answer is…Jun 26, 2023Jun 26, 2023
Components of a good postmortemPostmortem is a written record of an incident. Postmortem after an incident is how to ensure that this incident will not happen again in…Aug 1, 2022Aug 1, 2022
When a postmortem is not enoughAs SREs what can we learn from Richard Feynman’s approach in inspecting the challenger space shuttle’s explosionMar 6, 2021Mar 6, 2021
Service quotas and spike patterns as part of the SLAs of multi tenant servicesIn his famous book, “High output management”, Andrew Grove (Intel’s co founder and ex-CEO) explained that sometimes quality checks can…Feb 11, 20211Feb 11, 20211