Wiz
Secure everything you build and run in the cloud
Incident Manager – Public Sector
Location
United States
Posted
2 days ago
Salary
$142.5K - $197K / year
Bachelor Degree7 yrs expEnglishAWSCloudGrafanaKubernetesPrometheusService NowSplunk
Job Description
• Serve as the lead incident coordinator for high-severity events, activating playbooks, declaring incident severity, and coordinating with functional leads to drive a structured response.
• Define, operationalize, and document the end-to-end incident response lifecycle that aligns to FedRAMP High, IL5, and NIST 800-53 requirements.
• Drive readiness activities by designing and facilitating cross-functional tabletop exercises, hands-on simulations exercises, incident response team training, and review of playbooks to validate response protocols.
• Facilitate Root Cause Analysis by leading post-incident reviews using structured methodologies and documentation to separate root causes from contributing factors and drive business-wide corrective actions to closure.
• Serve as the primary liaison between technical and business units by translating incident details into business impact assessments that drive informed decision-making for legal, compliance, and operational teams.
• Bridge technical and operational responses by building communication paths between engineering, operations, legal, compliance, and customer facing teams to translate complex incidents into actionable updates for leadership.
• Establish centralized reporting, dashboards, and KPIs to monitor response efficiency, trend analysis, and program maturity.
• Manage and optimize incident response tools like ServiceNow, PagerDuty, and Jira to ensure.
Job Requirements
- 7+ years of experience leading crisis management and incident response programs in FedRAMP High, IL5, or NIST 800-53 environments.
- Direct experience in managing and leading major incidents
- Direct experience working cloud environments, AWS required (other clouds a plus)
- Experience working with cloud native technologies like containers and container orchestration platforms like Kubernetes.
- Ability to interpret metrics and logs in observability and security event management tools such as Grafana, Prometheus, DataDog, Splunk, etc.
- Experience with incident management platforms such as PagerDuty, ServiceNow, or Jira, including experience building automated notification trees and dashboards.
- Strategic thinking and a risk focused mindset on reliability improvements
- Ability to identify systemic gaps that feed back into program design and operations teams
- Strong writing and documentation skills to effectively communicate with both technical and business audiences
- Ability to maintain composure and exercise sound judgement while navigating high-stake decision making during complex and ambiguous incidents
Benefits
- Medical, dental and vision insurance
- Home Office Setup reimbursement
- Flexible Spending Accounts
- Monthly Connectivity reimbursement
- Employee Assistance Program (EAP)
- Short- and Long-term Disability Insurance
- Life & Accident Insurance
- 401(k) Retirement Savings Plan (with employer match)
- Flexible paid time off + 11 paid holidays
- Paid leave programs, including parental, pregnancy health, medical and bereavement leave