Logging is when you get an unanticipated failure. It allows you to go back see what happened. You can do a an RCA, a Root Cause Analysis , on it and figure out how to solve it, not just for now, but for the future. That gets back into the automation again, right?
What is Site Reliability Engineering (SRE)?