Q: Does time to restore start from the point the fault was introduced or the point the fault is first noticed?
A: It starts from the point where the incident is created (in your incident management system). In future, we may extend the functionality to capture additional/alternative fields (e.g. an ‘incident start’ event).
Q: What happens if I update the incident start time after the incident is created?
A: That is not currently supported in our calculations. But let us know if this is important to you.
Q: Does time to restore stop at the point the fix is deployed into production?
A: No. time to restore stops only once the incident is declared resolved. This can be after manual checking, after the production deployment including the fix.
Q: What do you mean by “resolved”?
A: The lifecycle of an incident generally follows the lifecycle:
- Incident Created - an incident is identified and recorded in the system
- Incident Acknowledged - the incident is acknowledged by support staff who start actively working on the problem
- Incident Resolved - the support team have worked to stem the bleeding and restore service to users. There may still be underlying issues to be resolved (i.e. a workaround is in place).
- Incident Closed - the incident post mortem process has been completed and countermeasures are in place to prevent future incidents.
Q: What are the types of incident are included in time to restore?
A: This is up to you, but we count the following examples as incidents:
- Production infrastructure incidents
- Service interruptions
- Bugs or faults introduced to production through a change