• Job code: QR8265
  • Management/Consultancy/Analyse/Coaching

Incident Manager (Testing)

omgeving Amsterdam

For our client we are looking for an Incident Manager (Testing).


As Incident Managerwithin the IT-department, you are responsible for the rapid resolution (mitigation of impact for customers and/or business) of major IT-incidents. In addition, you are developing Dashboards and or automation to make the processes easier and faster. Thanks to your input, everyone in the department is aware of the up-to-date processes and you ensure timely communication in case something is broken. You also ensure that these processes are supported with tools and technology that you can develop yourself.


Next to the primary rol as command manager, candidate one will also have focus on Chaos Engineering:

- Chaos Engineering is a subject we want to develop further, the ideal candidate has knowledge / experience in

- Chaos Engineering and wants to share his/her enthusiasm with the rest of the department.


As an Incident Manageryou make your presence felt by:

- Continuously improving the processes with regard to Incident Command and spread this knowledge by providing training. With the aim of making each Incident a learning moment, so that it won’t reoccur or can be solved even faster.

- Being able to independently manage major incidents and, together with involved resolution teams, ensure minimal impact for customers and business.  

- Initiating continuous improvements to solve incidents structurally or make us able to identify incidents before any impact occurs.

- Run disaster recovery drills and implement chaos testing.


The client is working with the latest technologies and innovative solutions such as Kubernetes, CI/CD, Github actions etc. The entire platform is in Azure and is a state of the art scalable platform. There is an open development culture, with room for your ideas, you get ownership, and we expect your own initiative. You are really a spider in the web and have to deal with different departments and levels within the organization.


You will be part of the Enabling cluster. We work daily on the web, the iOS/Android App and more. Within the Enabling cluster, we work on various matters to ensure that product teams can focus primarily on delivering customer value. Our focus is mainly on automation. The team makes sure that if something breaks, we quickly fix it together again. We hate it when customers or our own business has impact from a disruption. There is a no blame culture, we roll up our sleeves and fix the problem. Afterwards we look back together where we can improve.


To maximize the customer impact, it is important that you:

- Have a technical higher professional education or master's degree, preferably in Information Technology.

- Have about 4-6 years of relevant work experience in which you have proven your added value in roles as Incident Manager, Incident Commander or DevOps Engineer within a complex technology environment.

- Have deep knowledge of Incident management (ITIL).

- Have experience with Grafana, Chaos Monkey, Gremlin, LitmusChaos, Azure kennis, Chaos Engineering principles

- Having knowledge of OpsGenie and ServiceNow is an advantage.

- Are used to working with time-critical deadlines, and able to evaluate and improve processes.

- Have great communication skills in English and/or Dutch

Apply