There are good news and bad news in the world of NonStop Operation support.
First the good news: NonStop operation support teams have always been known to be one of the most technical and versatile groups in any enterprise environment. The team embodies a wealth of technical knowledge that is just as critical in ensuring the fault tolerance of the applications as the underlying NonStop hardware and Guardian operating system. They can diagnose and fix issues ranging from hardware, system software, application scheduling, database, network and others.
Now the bad news: Many members of that superb NonStop technical team are either retiring or looking to retire soon. What are you going to do to replace that talent pool? Is there a way to bring in new and younger resources to support your NonStop systems and applications to ensure it will continue to run smoothly?
I believe the answer is: "Yes." But it requires a new set of considerations.
"Set the right expectations"
Let's face it: Any new, young person you are hiring to learn the NonStop will never acquire the same level of knowledge as the retiring members. Why? Because most of the existing team members have accumulated their expertise through years (actually decades) of work experience. In fact, they have acquired knowledge for them to do work that usually require multiple people on other platforms like Windows or Unix to perform, e.g. Safeguard security (Security Department), Database and TMF (DBA), Web applications (eCommerce and Network team), Operating Systems (Architects), etc.
So, don't expect any incoming new hire to be able to step into those big shoes any time soon, if ever. This is an important point because that it clarifies the scope of what you are trying to accomplish with this new generation of team members. But that doesn't mean that the new team member couldn't be just as productive in many areas. It just requires some proper planning and commitments. First step is...
"Define the tasks that need to be done"
While some NonStop organizations have "Run Books" for operation, quite often, they are not maintained up to date. Even if they are, they represent only a subset of activities that NonStop Support team performs on a regular basis. Most NonStop shops will tell you that many NonStop operation activities revolve around dealing with things that come up unexpectedly. Ad hoc events like:
"This application seems to be not responding..."
"We got errors `out on EMS that we don't understand..."
"That disk is very busy..."
These tasks actually require a lot of technical experience to address, and usually we take it for granted that the team knows from experience what steps to follow to analyze the problem. If we are to bring in new members to support Operations, it is important that the tasks be defined and procedures be "codified" properly.
I am not suggest that we go back and create volumes and volumes of run time documentation. But, I believe we should at least categorize the level of operation support work that needs to be performed, so that a new comer can learn to grow into that certain technical level incrementally.
As an example of a "Technical Ladder" could be:
Level 1 - Basic operations to be functional on the system. Start and Stop jobs. Execute job streams.
Level 2 - What to monitor on the system. Look for error messages. Follow standard recovery procedure.
Level 3 - Handling problems. How to trouble shoot. Analyze performance issues.
There is no one size fits all, and every organization needs to define what fits their environment.
"Have a training plan"
Hint: Sending the new hire to standard NonStop Education class does not automatically fulfill your commitment to address this need.
In fact, I advocate that training the next generation of support team members on using the NonStop requires a whole new approach to training beyond standard HPE classes. I propose a new training paradigm that includes: Just-in-time training, Learning through analogies and Modern GUI tools. I hope to discuss these in a future blog installment.
What do you think? Please share your feedback. Thank you.
Phil Ly
TIC Software
I would also like to thank Mr. Jonathan Deveaux of comForte for reviewing and editing this blog.
Hello. Thank you for your comment to my post, and for sharing the syllabus of the new classes. The structure of the new Admin classes looks good, but the scope of topics could be overwhelming to a new, young newcomer to the Nonstop world. To reach a younger audience more effectively, I think the content, the length and the delivery method may need to be adjusted accordingly.
I plan to discuss more about "Just In Time" (JIT) training in my next post. But in general, the JIT approach is to provide just enough training content that the person needs to learn right now to apply in his or her job immediately. Training will be iterative with different "sprints" (to borrow a term from Agile development), according to a predefined technical ladder for the person to follow.
This goes back to the point that it requires involvement, planning and commitment from Management in the first place to make it work. Like the good question you posed in your comment: How realistic is that? Yet, we all know that management support is probably the most crucial requirement for success.
I applaud the inclusion of GUI tools in HPE training. But I would like to take it to the next level with this challenging question: Why not replace everything in Operation Training with GUI tools and eliminate the use of "green screen" (TACL) altogether? Do we think it is doable? Why or why not? Perhaps we can discuss that in another future post. Thanks.
Phil
Last year I was involved on re-writing the HPE NonStop Administration classes,
with the structure described in the Technical Ladder,
and including examples of the modern GUI tools.
H1SC3S HPE NonStop Admin I
H1SC4S HPE NonStop Admin II
H1SC5S HPE NonStop Admin III
I am the first to admit there cannot be a "one size fits all" solution.
The classes have to be generic to some extent.
They are designed to form a starting point for the training and development of the Operations Team. This must be reinforced by "in house" and "on the job" for the your particular environment.
I am not sure what is meant by "Just in Time Training".
I like the idea, but wonder how many organisations are pro-active enough for this to be a reality?
Please take a look at the areas covered in the new classes and let me have your feedback