Automating Analysis of z/OS Alerts

jerrystreet

By Jerry Streetz/OS Alert

You are nowhere near your workstation, and you receive an urgent text that z/OS has an unexpected increase in CPU utilization. While the alert is beneficial, it would be more helpful if you had all the information you needed to diagnose the issue in hand. We discussed the pains of bad alerts in part 1 of this blog. The remainder of this blog discusses how you can integrate your z/OS alerting with the rich artificial intelligence of IntelliMagic Vision.

Rather than just another alarm in a sea of alarms, you should be able to expect more from your alerts. Especially with something as important as your z/OS, alerts should provide actionable recommendations to the problems they are alerting you to. IntelliMagic Vision does just that. IntelliMagic Vision can be configured to send detailed root cause analysis reports based on pre-defined z/OS alerts allowing you to understand the impact and urgency of the alert and guiding subsequent investigation.

Continue reading

Do Not Settle for Bad z/OS Alerts

jerrystreetBy Jerry Streetz/os alert

When I was growing up, long car rides were a bit challenging due to our car’s alerting system: smoke, steam, horrible clunking noises, or dead silence. Everything was great until Betsy (my mom always named our car Betsy) did not move anymore. Then we had to get the car to a mechanic who was an expert at making us feel ignorant and took a lot of our money to fix something simple (usually).

Then cars started getting better at alerting the operator about simple problems, but you still had to take the car to a mechanic to fix the problem. Today, between YouTube, Google, and Internet forums, you can often get the steps it takes to resolve a lot of these alerts for a whole lot less money; however, there is still more that needs to be done between getting an alert from your car and solving/fixing the issue.

What if your car could alert you to an issue, do an Internet search for you, and send a fixit video to your smartphone before you could even get to a safe place to check your smartphone? That kind of intelligence would be convenient. The same principle applies to alerts you get from your z/OS Operating System.

When I started working in Operations, when we still called it “MVS”, an Operator would see an alert and call me (usually at night). I would sometimes have to drive into the office or call another Systems Programmer, analyze the alert, and act upon it. What if now, the alert could automatically perform root cause analysis and send supporting reports to your smartphone?

One of the major problems with alerts in IT is that digitally oriented machines are generating so many that the Operators become desensitized to them. Projects to “clean up” alerts may end up filtering out necessary ones. I know of one project that was started to reduce alerts, which was intended to improve the alerting, and it created more problems than it solved. Many customers even ask for a single pane of glass to contain alerts and want them to be smarter. This can wind up being a single glass of pain, that adds no value if alerts don’t lead to actionable solutions.

Continue reading