Problem Manager

Website Chalhoub Careers

Job Description:

The role will be responsible for managing production incidents and outage events as well managing problems within the Group Technology division. The role will provide leadership and coordination across infrastructure, application and partner teams to quickly remediate production issues and reduce mean time to resolution; as well as pushing for active problem records to be addressed and managed effectively so root causes are identified quickly with a plan to eliminate them clearly defined as part of the problem management processes with the Technology Operation and Product teams. Ensures appropriate managerial relationships are established and maintained to build and strengthen trust regarding end-to-end enterprise incident management resolution and enterprise problem management; serves as a focal point for escalation of issues to be resolved and for problems to be addressed. Facilitates ITIL standards adherence.

Job Responsibilities:

  • Review incident, outage and problem processes, identify trends and recommend improvements
  • Make recommendations for resolution and improvements to mitigate risk and prevent the replication of problems across systems
  • Providing incident resolution status as requested.
  • Validating incident severity if required, or assisting with correcting invalid incident severity.
  • Ensuring the quality and accuracy of incident information, as appropriate.
  • Process Review for Incident/ Problem Management and implement enhancements and document process.
  • Perform other related duties as required and assigned
  • Identifying and resolving Service Desk incident assignment issues.
  • Managing exceptions of rejected incident records at a Service Delivery level.
  • Resolving day-to-day incident coordination actions for Service Delivery.
  • Incident Management Acting as a Service Delivery escalation point for day-to-day Incident Management process issues.
  • Monitoring unassigned and reassigned incidents and taking action if appropriate.
  • Handling day-to-day incident issues and escalating the Incident Resolver Groups as required to bring the resolution of the incidents back on schedule.
  • Assisting in reassignment of misdirected incidents.
  • Providing incident resolution status as requested.
  • Create and review incident and problem management reports and identify action plans to improve key performance indicators as necessary
  • Introduces key ITIL disciplines and practical project management techniques to ensure effective end to end problem management
  • Ensure proper usage of incident, outage, problem and change management systems and processes
  • Perform quality assurance on completed incident, outage, problem investigations and change management records
  • Conduct Root Cause Analysis (RCA), Port Mortem and Problem Management meetings
  • Ensure that root-cause is established for all major incidents and that a formal RCA is published within agreed SLAs
  • Define reporting requirements needed in the management of the incident, outage and problem management processes

Job Requirements:

  • Experience managing 24/7 Application, Infrastructure and/or Operation teams preferred
  • Experience supporting Application and Infrastructure in AWS preferred
  • Strong business acumen and ability to interface with executive management
  • Must be able to work in fast paced environment.
  • ITIL framework certification / ITIL v3 foundation certified
  • Ability to manage an incident/outage bridge with 50+ technical and business stakeholders
  • Ability to manage competing priorities and operate under pressure
  • Ability to adjust schedule based on business need
  • Ability to be proactive, takes action and anticipates opportunities
  • Ability to guide and assist in technical troubleshooting during an incident/outage
  • Excellent management, interpersonal, communication, presentation, and organizational skills
  • The ability to lead cross functional teams effectively at all levels of the organization
  • Coordination skills: managing (complex) IT technical investigations
  • Competent in defining, documenting and managing procedures and processes
  • Advanced knowledge of incident, outage, problem and change management
  • Adaptability to demanding circumstances that require timely and accurate responses
  • Strong analytical, multitasking and prioritization skills
  • Strong collaboration and partnering skills
  • Excellent verbal and written communication skills with the ability to articulate complex ideas in easy-to-understand business terms to senior leaders

Job Details:

Company: Chalhoub Careers

Vacancy Type: Full Time

Job Location: Dubai, UAE

Application Deadline: N/A

To apply for this job please visit

 Report Job
Back to top button