1

How does one get better at responding to issues that occur in production?? I feel the only way I learn is by having the issue occur and then fixing it...any more proactive ways available??

Comments
  • 1
    Monitoring (I like new relic for webapps but there's loads out there)
    Logging (centralised)
    Alerts
    Thorough testing
    Proper validation...
    It's a bit hard to anticipate the unknown. And once it's known? You fix it...
  • 1
    Netflix runs a so called chaos monkey that randomly kills services then if anything breaks that gets fixed.
  • 0
    @atheist I like this idea
    Gonna unleash some chaos monkeys at work tomorrow (helped by the fact that one of Android's test tools is called monkey...)
Add Comment