1. Internal Knowledge Base
  2. Internal Process and Playbooks

Major Incident Communication Playbook

Use this process to provide communication to customers when a product incident/issue is affecting a subset or the entire customerbase

There are two primary ways iDonate may identify an incident:

  • Internal monitoring alerts of outage (Engineering)
  • Multiple concurrent customer reports (Customer Support or Success)

Workflow

  • Notify teams in #support_tech Slack channel
  • Support and Success - Create a ticket for each customer report and use the templates below to update customer as the issue progresses
  • File JIRA ticket including the following:
    • Customer(s) impacted
    • Start time of incident
    • Steps taken prior to issue/error (steps to duplicate)
    • Symptoms and/or errors
  • Link all related customer tickets to JIRA ticket
  • Support - All tickets should receive regular updates on the status of the issue and should be notified that the issue is resolved as soon as it is confirmed with Engineering. Responses to these tickets should be considered top priority.
  • Engineering - What expectations should we set for timing on RCAs?

Templates for Pendo

-------------------------------------------------------
:rotating_light: Service Status Update: Issue Under Investigation :rotating_light:
We're currently investigating an issue that may be impacting some of our services. Our team is working diligently to identify the cause and resolve it as quickly as possible.Estimated Resolution Time: Unknown
Symptoms: <Insert error(s) or other>We apologize for any inconvenience this may cause and appreciate your patience as we work to address this issue.

----------------------------------------------------------
:rotating_light: Service Status Update: Issue Identified - Fix in Progress :rotating_light:
We've identified the issue that may be impacting your experience with iDonate. Our team is actively working on implementing the fix to resolve this issue as quickly as possible.Estimated Resolution Time: <Insert ETA>
Symptoms: <Insert error(s) or other>We apologize for any inconvenience this may cause and appreciate your patience and understanding.

-------------------------------------------------------------

NOTE: This final notification should have an expiration time set for 1 hour from posting

:rotating_light: Service Status Update: Issue Resolved :rotating_light:
Great news! The issue has been resolved, and our services are back to normal. Thank you for your patience and understanding while we worked to address this issue. If you have any further questions or concerns, please don't hesitate to reach out to our support team.