Traditionally, most Enterprise Monitoring Teams (EMTs) took the top-down approach of consolidating company-wide monitoring in as few central platforms as possible. The EMT sometimes often referred to as the NOC (Network Operations Center) focused primarily on the availability monitoring of networks and infrastructures. Application teams were mostly expected to monitor their own applications, being the subject matter expert (SME), and only relied on the EMT for infrastructure monitoring.
However, with the rise of Apps (everything from back-office to customer-facing eCommerce) coupled with multi-tier and multi-vendor environments extending to virtualization and cloud frameworks, the lines have blurred between Apps and Infrastructure. SAP customers are in the thick of this change. EMTs need to evolve to meet the new demands of enterprise apps in the "Age of Agile" as referred to in a 2017 BigPanda annual State of Monitoring report. While SAP teams are focused on delivering IT business values to internal and external customers, they continue to look for opportunities to offload application monitoring to EMTs. EMTs risk losing the battle to monitor their critical IT operations when they don't equip themselves with the ability to monitor SAP effectively today and in the future.
We'll explore the key points that we see are challenges customers face every day as well as highlight some of the important findings in the BigPanda State of Monitoring report. Some are generic in the IT operations management (ITOM) arena, while others are specific to SAP based on our experience with customers.
Problems
- SAP landscapes and systems keep growing, despite promises of consolidation from SAP 10 years ago such as landscape consolidation & harmonization and Run Simple
- SAP has gotten too complex with all the modules and issues that could occur both at the application level as well as the infrastructure level, see Cause section
- Decentralized teams - both in roles and logistics, tools proliferation: average IT team member users 6-7 tools on a regular basis
- IT teams are mandated to "Do more with less", and think of more Apps but fewer SAPs (Systems Administration Professionals)
- Modern IT stack is complex and diverse (e.g. on-premise, cloud, SaaS, ABAP, Java, integration systems/API servers, front-end servers)
- Age of agile: Increased adoption of DevOps practices and monitoring as code, causes the flood of 'things' that need to be monitored, and we're not even referring to IoT
- Higher Frequency of code deployments due to growing business needs to stay competitive and data-driven
- Higher Frequency of infrastructure changes primarily due to the availability of platform technologies such as IaaS, PaaS, and Containers to meet application demands
- Additional challenges from enterprise cybersecurity threats, SIEM (Security Information and Event Management) requirements
- Incidence response and audit requirements becoming critical for security breach, availability issues, performance degradation, regulatory compliance
Cause
Business disruptions can come from any of these SAP-related problems, some documented from our SAP Monitoring Basis checklist:
- Job failures (e.g. Batch job aborts, excess job runtimes, BW process chain failures)
- Connectivity issues between business partner systems (e.g. RFC availability, communication latency)
- Transaction failures (e.g. program dumps, data errors, iDoc errors)
- Transaction delays (e.g. batch job delays, tRFC/qRFC queue delays)
- Poor performance (e.g. online, batch, updates, queue processing)
- Service component failures (e.g. Message server, Internet Connection Manager, Print services, DB issues)
- Security (e.g. user locks, expired service accounts)
- Code deployment errors (e.g. transport errors, support pack issues)
- Infrastructure outages (e.g. physical server failures, space issues, capacity utilization)
- Configuration (e.g. misconfiguration or under-configured services such as dialog & batch queues)
State of Monitoring report found that among 1500+ ITOM respondents of medium to large enterprises:
Top 5 Monitoring Challenges- Quickly remediating service disruptions
- Securing a budget for the proper monitoring tools
- Reducing alert noise from the organization's monitoring tools
- Delivering a product or business objective to schedule
- Quickly identifying service disruptions
- Among respondents that reported over 100 alerts per day, only 26% are able to investigate and remediate the majority (75-100%) within 24 hours.
- Few developers build monitoring into their code (SAP is slow in the DevOps world) and rely on others to monitor their enterprise application
- Customer experience is king: Customer satisfaction has become the most important KPI, others include SLA compliance and MTTR (Mean time to repair/restore)
Solution
How does the enterprise monitoring industry go about meeting the increased challenges? Only a troubling 13% agreed that they are very satisfied with their approach to monitoring, and just 11% are satisfied based on overall investment. Based on survey respondents, here are some key focuses:
- Separate Signal from the Noise: with all leading indicators suggesting that alert volumes are likely to continue their upwards trend, IT teams will be compelled to find a way to make relevant the monitored data effectively scale
- Those who are satisfied with their monitoring strategy demonstrate far better rates of remediation, in addition to many other benefits including some outlined in our SAP Performance articles
- Biggest IT monitoring challenge of 2017 and beyond:
- Improving monitoring strategy
- Modernizing monitoring architecture
- Effectively managing alerts
- Security
- Budget
- Cloud migration
- Centralizing and consolidating monitoring tools
- Scaling monitoring with growth
- Improving root cause identification
- Staffing qualified personnel
 
- If you could make one change to your current monitoring strategy, what would it be?
- Refining overall strategy or processes
- Investing in new tools
- Centralizing/consolidating the monitoring stack
- Event management and alert correlation
- Automation
- Staffing qualified personnel
- Refining monitoring metrics
- Alert noise reduction
- Securing additional funding
- Improving incident management
 
Summary
EMT can have the edge in the battle to monitor SAP effectively as part of the growing organizational complexity and challenges, provided it focuses on digital transformation based on these key takeaways:
- Doing more with less
- Automate, automate, automate!
 
- More tools, more moving parts
- Select enterprise solutions that can adapt to the growing complexity and reduce the noise as well as administration overhead
 
- Alert noise is not getting any quieter
- Centralize and automate the monitoring processes while enhancing the ability to filter, correlate, and manage events in relation to SLA
 
- An effective monitoring strategy is key
- Develop a future-proof monitoring process with speed and agility that can scale with organizational growth
 
- It all boils down to the customer experience
- Bridge the great IT and Business divide by focusing on solutions that are great with usability, personalization, cross-platform support, and customer service excellence.
 



