News: An outage that lasted almost a day came as a result of using the wrong case.
Google found out that getting even the smallest detail wrong can result in severe problems for its Cloud Platform.
An outage that lasted 21 hours and 38 minutes on December the 8th and 9th has been attributed to incorrectly typing the name of a case-sensitive variable.
In an incident report on the company’s site, it has been revealed that: "a minor update to the Compute Engine API inadvertently changed the case-sensitivity of the "sessionAffinity" enum variable in the target pool definition, and this variation was not covered by testing."
"Google Container Engine was not aware of this change and sent requests with incompatible case, causing the Compute Engine API to return an error status."
The problem was resolved through a rollback, with Google engineers committing a fix to the Kubernetes public open source repository.
Google, said: "This is not the level of quality and reliability we strive to offer you, and we have taken and are taking immediate steps to improve the platform’s performance and availability."