Enhance Services Availability Assurance in alignment with ELM Business Object and IT Operations Targets, Sustainability of ELM OMC (Observability & Monitoring Center) operations as per defined objectives and Operate, Support and Modernize ELM Observability Stack to develop culture of proactive services support and operations toward availability assurance, Recruit and Develop a caliber of manpower to source operations support team.
Duties and responsibilities:
- Ensure implementation of Departmental strategy and related strategic plans in order to achieve agreed upon strategic goals and objectives
- Prepare and recommend the annual OMC budget by conducting analysis and preparing data related to specific elements as directed, and assure needed optimization considering IT Ops scale plans.
- Monitor the financial performance of the OMC against budgets so that areas of unsatisfactory performance are identified and rectified promptly and potential performance improvement opportunities are capitalized upon.
- Manage the daily operations of OMC to ensure that work processes are implemented as designed and comply with established policies and procedures
- Manage the preparation of timely and accurate reports to meet company and departmental requirements, policies and standards
- Assure 24x7 OMC Shifts Operations & Assure Alerts & Events Handling and Analysis
- Assure Proactive Services Issue discovery
- Dashboards Observability and reporting Issue reporting
- Assure Tuning Monitoring Implementation
- Assure ongoing Reporting of Services Availability Status
- Assure Auto reported issue resolution ownership and tracking (communicate and coordinate resolution with all parties) , Orchestrate resolution via collaboration and war room ownership.
- Plan and implement observability road map and design and Implement observability platform.
- Manage Ongoing Adoption of monitoring systems.
- Manage Operations and Support of monitoring solution in compliance to defined targets.
- Implement, maintain & adoption a centralized configuration management system.
- Support L2 operations and support activities automation.
- Develop Self-healing capabilities toward resolution automation.
- Assure ongoing compliance to ELM Audit, GRC, Risk.
- Assure Execution of Resolution Matrices and Escalation Procedure in adherence to incident management process
- Manage the activities and work of subordinates and evaluate their performance by providing formal and informal feedback to ensure that all work within a specific area is carried out in an efficient manner and in accordance with set individual targets
- Stimulate subordinates and manage the identification of opportunities for continuous improvement of systems.
Qualifications:
- Bachelor's degree in Information System/Computer Since or related filed
- Preferably ITIL Certified.
Experience:
- 12+ years of relevant experience in technical operations including at least 4 years in positions of progressively increasing managerial responsibilities in same Observability/OMC/Monitoring filed
- Proven experience successfully rolling out AIOPS.
- Strong Hands on experience in implementing full stack at least one of ( ELK Stack, Appdynamics, Dynatrace, Key monitoring Tools)
- Strong Hands on experience in Configuration management system (CMS) & CMDB in front of Implementation, maintaining and adaptation.
- Strong experience in implementing Synthetic ,Real User Monitoring , APM (Application Performance monitoring
- Experience in configuration management implementation with one of well-known tools.
- Experience in modern observability center operations and implementation