and see them coming so that you can proactively solve them. Logging is when you get an unanticipated failure. It allows you to go back see what happened. You can do a an RCA, a Root Cause Analysis , on it and figure out how to
What is Site Reliability Engineering (SRE)?