Incident response (IR) is the process by which an organization handles a data breach or cyberattack. It is an effort to quickly identify an attack, minimize its effects, contain damage, and remediate the cause to reduce the risk of future incidents.
There are several approaches to building a structured IR process:
- NIST 4-step process: According to the National Institute of Standards and Technology (NIST), incident response has four steps: preparation; detection and analysis; containment, eradication and recovery; and post-incident activity.
- SANS 6-phase process: According to the SANS Institute, incident response should have six steps: preparation; identification; containment; eradication; recovery; and lessons learned.
- Some cybersecurity experts propose a 7-step process with an additional step: ongoing re-testing and training to ensure organizational readiness.
This is part of an extensive series of guides about data security.
What Is an Incident Response Plan (IRP)?
An incident response plan is a set of documented procedures detailing the steps that should be taken in each phase of incident response. It should include guidelines for roles and responsibilities, communication plans, and standardized response protocols.
Within your plan it is important to use clear language and define any ambiguous terms. One set of terms that are frequently confused is event, alert, and incident. When using these terms in your plan, it can help to restrict use as follows:
- Event—a change in system settings, status, or communication. Examples include server requests, permissions update, or the deletion of data.
- Alert—a notification triggered by an event. Alerts can warn of suspicious events or of normal events that need your attention. For example,the use of an unused port vs storage resources running low.
- Incident—an event that puts your system at risk. For example, theft of credentials or installation of malware.
Learn more in our in-depth guide about incident response planning.
8 Types of Security Incidents
There are many types of cybersecurity incidents that could result in intrusions on an organization’s network:
- Unauthorized Attempts to Access Systems or Data: Occurs when an individual or group attempts to gain unauthorized access to an organization’s systems or data. Examples include hacking attempts, brute force attacks, and social engineering.
- Privilege Escalation Attack: Occurs when an attacker is able to gain access to a system with limited privileges and then uses that access to gain higher-level privileges. This can be done by exploiting vulnerabilities in the system or using stolen credentials.
- Insider Threat: Occurs when a current or former employee, contractor, or other insider uses their access to an organization’s systems or data for malicious purposes. Examples include stealing sensitive information or sabotaging systems.
- Phishing Attack: Occurs when an attacker sends an email or message that appears to be from a legitimate source, but is actually a trap to steal sensitive information or spread malware.
- Malware Attack: Occurs when an attacker uses malware, such as a virus or Trojan horse, to gain access to an organization’s systems or data or perform other malicious activities. Different types of malware can perform different activities. For example, ransomware can prevent access to data until a ransom has been paid.
- Denial-of-Service (DoS) Attack: Occurs when an attacker floods a system or network with traffic, causing it to become unavailable to legitimate users.
- Man-in-the-Middle (MitM) Attack: Occurs when an attacker intercepts and alters communications between two parties. The attacker can steal sensitive information or spread malware this way.
- Advanced Persistent Threat (APT): A sophisticated and targeted attack designed to gain access to an organization’s systems or data, often with the goal of stealing sensitive information or maintaining a long-term presence.
Learn more in our detailed guide to security stacks.
In my experience, here are tips that can help you better adapt to the topic of incident response (IR):
- Prioritize visibility across endpoints and network traffic Integrating endpoint detection and response (EDR) with network traffic analysis allows quicker identification of lateral movement and command-and-control (C2) channels used by attackers.
- Segment incident response by incident type Create specialized IR playbooks for different attack vectors (e.g., ransomware, insider threat, DDoS). This ensures a targeted and faster response, optimizing containment and remediation steps.
- Incorporate behavioral analytics Enhance incident detection by implementing user and entity behavior analytics (UEBA). This identifies unusual behaviors that may not trigger traditional alerts but could signal insider threats or APTs.
- Automate response to low-risk alerts Use automation to handle routine tasks like isolating compromised endpoints, revoking credentials, or blocking IP addresses. This allows human analysts to focus on more complex incidents requiring manual intervention.
- Develop post-incident monitoring protocols Ensure long-term monitoring of the systems involved in an incident, even after recovery. Attackers may attempt re-entry through backdoors or compromised accounts that escaped initial detection.
Eyal Gruner is the Co-Founder and CEO of Cynet. He is also Co-Founder and former CEO of BugSec, Israel’s leading cyber consultancy, and Versafe, acquired by F5 Networks. Gruner began his career at age 15 by hacking into his bank’s ATM to show the weakness of their security and has been recognized in Google’s security Hall of Fame.
Incident Response Steps: 6 Phases of the Incident Response Lifecycle (According to SANS)
In the introduction to this article we discussed two main options for an IR process, the NIST incident response process with four steps and the SANS incident response process with six phases.
According to SANS, there are six phases to incident response. These six steps occur in a cycle each time an incident occurs. The steps are:
- Preparation of systems and procedures
- Identification of incidents
- Containment of attackers and incident activity
- Eradication of attackers and re-entry options
- Recovery from incidents, including restoration of systems
- Lessons learned and application of feedback to the next round of preparation
Learn more in our detailed guide to incident response policy.
Preparation
During your first preparation phase, you review existing security measures and policies to determine effectiveness. This involves performing a risk assessment to determine what vulnerabilities currently exist and the priority of your assets. Information is then applied to prioritizing responses for incident types. It is also used, if possible, to reconfigure systems to cover vulnerabilities and focus protection on high-priority assets.
This phase is where you refine existing policies and procedures or write new ones if you are lacking. These procedures include a communication plan and assignment of roles and responsibilities during an incident.
Identification of threats
Using the tools and procedures determined in the preparation phase, teams work to detect and identify any suspicious activity. When an incident is detected, team members need to work to identify the nature of the attack, its source, and the goals of the attacker.
During identification, any evidence collected needs to be protected and retained for later in-depth analysis. Responders should document all steps taken and evidence found, including all details. This can help you more effectively prosecute if an attacker is identified.
During this phase, after an incident is confirmed, communication plans are also typically initiated. These plans inform security members, stakeholders, authorities, legal counsel, and eventually users of the incident and what steps need to be taken.
Learn more in our detailed guide to incident response analyst.
Containment of threats
After an incident is identified, containment methods are determined and enacted. The goal is to advance to this stage as quickly as possible to minimize the amount of damage caused.
Containment is often accomplished in sub-phases:
- Short term containment—immediate threats are isolated in place. For example, the area of your network that an attacker is currently in may be segmented off. Or, a server that is infected may be taken offline and traffic redirected to a failover.
- Long term containment—additional access controls are applied to unaffected systems. Meanwhile, clean, patched versions of systems and resources are created and prepared for the recovery phase.
Elimination of threats
During and after containment, the full extent of an attack is made visible. Once teams are aware of all affected systems and resources, they can begin ejecting attackers and eliminating malware from systems. This phase continues until all traces of the attack are removed. In some cases, this may require taking systems off-line so assets can be replaced with clean versions in recovery.
Recovery and restoration
In this phase, teams bring updated replacement systems online. Ideally, systems can be restored without loss of data but this isn’t always possible.
In the latter case, teams must determine when the last clean copy of data was created and restore from it. The recovery phase typically extends for a while as it also includes monitoring systems for a while after an incident to ensure that attackers don’t return.
Feedback and refinement
The lessons learned phase is one in which your team reviews what steps were taken during a response. Members should address what went well, what didn’t, and make suggestions for future improvements. Any incomplete documentation should also be wrapped up in this phase.
Learn more in our detailed guide to incident response certification.
Incident Response Plan Templates
Instead of building your IRP from scratch, you can save time by starting from an IRP template. The following templates are free and are good options to consider.
Provider |
# of Pages |
Key Content |
Download Link |
Berkeley University |
7 |
- Overview and definitions
- Contacts
- Incident response processes
|
Website |
SANS Institute / Patrick Kral |
19 |
- Preparing for security incidents
- Identifying incidents
- Containing cybersecurity threats
- Eradicating threats
- Recovery
- Lessons learned
|
.PDF file |
TechTarget / Paul Kirvan |
14 |
- Plan guidelines and planning scenarios
- Suggested actions and activities
- Notification, escalation and communication processes
- IR checklists
- IR documentation forms
|
.DOC file |
Delinea |
19 |
- Team roles and responsibilities
- Incident classification guidelines
- Legal and compliance and guidelines
- Phases and steps to be taken
|
.DOC file
(requires registration) |
California Government Department of Technology |
4 |
- 17-step IR procedure
- Type-specific guidelines
|
.DOC file |
I-Sight |
6 |
- Plan purpose and scope
- Incident definitions and examples
- Team responsibilities and roles
- IR stages and procedures
|
.DOC file
(requires registration) |
Learn more in our in-depth guide about incident response templates.
Incident Response Frameworks
Incident response frameworks are developed to help organizations create standardized response plans. These frameworks are typically developed by large organizations with a significant amount of security expertise and experience. Two of the best known of these frameworks are those developed by NIST and SANS.
The NIST Incident Response Framework
The National Institute of Standards and Technology (NIST) is a U.S. government agency dedicated to advancements in technology. As part of their cybersecurity efforts, they developed the NIST incident response framework. This framework is comprehensive, including details of how to create an IRP, an incident response team, a communication plan, and training scenarios.
This framework has four official steps which condense the 6 phases of incident response into the following:
- Preparation
- Detection and Analysis
- Containment, Eradication, and Recovery
- Post-Incident Activity
The reason for this condensation is that NIST believes that containment, eradication, and recovery are all overlapping phases. For example, as you contain threats within your systems, you should not wait to eradicate issues until all threats are found. Rather, you should contain and eliminate threats as soon as possible, even if other threats remain.
Likewise, recovery is not a strict step, rather a process that depends on the priority and content of the assets being recovered. For example, you may choose to hold off on recovering high priority assets until an attack is fully eliminated to keep your data more secure.
Learn more in our in-depth guide about NIST Incident Response.
The SANS Incident Response Framework
SysAdmin, Audit, Network, and Security (SANS) is a private organization that works to cooperatively research and educate the public on security issues. One of their major contributions to cybersecurity is the SANS incident response framework.
The SANS framework includes the six phases individually, calling the phases:
- Preparation
- Identification
- Containment
- Eradication
- Recovery
- Lessons Learned
Inside the SANS framework, are basic descriptions of the phases. SANS also includes an IR checklist for each phase and two templates with useful system commands for the preparation and identification phases. These templates are available for Windows and UNIX systems.
Learn more in our in-depth guide about Incident Response SANS.
What Does an Incident Response Team Do?
An incident response team is a team responsible for enacting your IRP. This team is sometimes also referred to as a computer security incident response team (CSIRT), cyber incident response team (CIRT), or a computer emergency response team (CERT).
The key duties of your CSIRT are to prevent, manage, and respond to security incidents. This can involve researching threats, developing policies and procedures, and training end users in cybersecurity best practices.
Learn more in our detailed guide to incident response management.
Building a CSIRT in Your Organization
How well you build your CSIRT plays a major role in how effective your incident response efforts are. If you are unable to fill all of the necessary roles and responsibilities, your response will have gaps that can lead to more damage and longer attacks. To avoid this, you should consider developing your team with the help of the NIST guidelines.
Incident Response Team Models
According to the NIST framework, there are three different models of CSIRT you can apply:
- Central—the team consists of a centralized body that manages IR for the whole organization.
- Distributed—multiple teams exist and coordinate efforts as needed. Typically, each team is responsible for a specific part of the IT infrastructure, physical location, or department.
- Coordinated—a central team serves as a command center or knowledge base for distributed teams. Central teams often take care of system monitoring and can alert and assist distributed teams as needed.
Selecting a Team Model
Knowing which model is best for your organization can be a challenge. To help you decide, you can again refer to the NIST guidelines which provide some considerations to help:
- What availability do you need?—you need to decide whether you want to have 24/7 response availability and what level of availability. For example, is it enough that teams can respond remotely or do they need to be on-site. Preferably your team is available in real-time and in-person.
- What level of staffing do you want?—you should decide whether you want full-time staff dedicated to your team or whether shifts of part-time staff are sufficient. Part-time staff are best for boosting team response during an incident. Full-time staff are best for ensuring that your response is organized, consistent, and immediate.
- How much expertise is needed?—the more expertise you have on your team, the more effective it can be. However, many organizations do not have a high level of security expertise in-house. If this is the case, you may want to have external experts available to assist your in-house team during response activities.
- What is your budget?—your IR budget plays a large role in limiting the above aspects. When putting your team together, you need to be realistic about the budget that is needed and how money is best allocated.
Learn more in our detailed guide to csirt.
What are Incident Response Services?
Incident response (IR) services are managed services that can replace or supplement in-house teams. These services usually work on retainer with a monthly cost and a set range of services. The benefit of these services is that they typically offer a higher level of expertise than is available in-house and can provide 24/7 monitoring and response. This service usually includes a service level agreement (SLA) ensuring confidentiality and response.
Additional benefits of managed services include:
- Incident response preparation and planning—services can help you review IT systems and develop IRPs suited to your specific needs.
- Incident triage and classification—services can monitor for security events, identify incidents, and classify threats.
- Initial response—services can perform initial response steps or even come on-site to assist in-house responders.
- Post-breach assessment—services can help teams perform root-cause analyses and provide evaluations of response efforts and effectiveness.
Learn more in our in-depth guide about incident response services.
Need an incident response provider?
Cynet is a trusted partner that analyses network and endpoint data, raises alerts, and protects against a wide range of known and zero-day threats. Cynet provides CyOps, an outsourced incident response team on call 24/7/365 to respond to critical incidents quickly and effectively. Cynet can deploy its powerful extended endpoint detection and response (XDR) system across thousands of endpoints in up to two hours to effectively mitigate threats across an enterprise.
Incident Response Automation
Effective incident response is time-sensitive and relies on teams quickly identifying threats and initiating IRPs. Unfortunately, most teams are not capable of investing all alerts in real-time to determine if something is an incident. This can lead to incidents being missed entirely or only being caught after significant damage has occurred.
Automating parts of your incident response can help avoid this oversight or delay. It can be used to:
- Quickly triage alerts and identify incidents
- Compile and centralize relevant data for incident investigations
- Perform incident response tasks and processes, such as isolating affected areas or blocking IP addresses
Incident Response Playbooks
When automating IR, a common method you can use is to create playbooks. Playbooks are essentially scripts that team members or security solutions can follow or initiate. These scripts define response steps to be taken and instruct responders, systems, or solutions to perform the defined actions.
Playbooks can be used for:
- Manual incident response processes—playbooks define steps to be taken, including which tools should be used, which processes performed, and who is responsible for performing those actions. These playbooks can be printed or electronic and are generally specific to incident type.
- Automated incident response processes—playbooks are programmatic scripts that integrate with relevant systems and tools. When alerts are triggered or incidents are identified, the system or tool can initiate the script, automatically performing the predefined actions.
If you have manual playbooks, you can often easily transform the contained steps into automated processes. Depending on the programming knowledge of your responders, you can also use automation playbooks as backup manual playbooks as needed.
How to create an incident response playbook
When creating an incident response playbook, it should contain the following components:
- Initiating condition—the event that triggers the playbook to run. This can be an alert, an incident identification threshold, or some other event.
- Mandatory steps—the actual steps and processes to be taken. These typically include triage, analysis, containment, and removal actions.
- End state—the terminating event for the playbook. This is defined by your playbook goal. For example, resetting passwords and permissions.
Learn more in our in-depth guide about incident response playbooks.
In addition to playbooks, you can also employ IR platforms. These platforms are software that you can use to guide, assist, and automate your response efforts. Platforms are often comprehensive and can integrate with your existing systems.
Common features of IR platforms include:
Analyst support |
Intelligence and analytics |
Security automation |
- Knowledgebase of regulations, response plans, and contacts
- Automatic escalation and assignment of alerts
- SLA tracking
- Compliance and breach reporting
|
- Integration with SIEMs and other monitoring tools
- Analysis and correlation of event timelines
- Real-time analysis of attack behaviors
- Forensic data retention and querying
|
- Pre-configured IR playbooks
- Support for customizable playbooks
- Automatic isolation compromised systems or user accounts
- Automatic remediation
|
Learn more in our in-depth guide about incident response platforms.
Automated Incident Response With Cynet
Cynet provides a holistic solution for cybersecurity, including the Cynet Response Orchestration which can automate your incident response policy. Users can define automated playbooks, with pre-set or custom remediation actions for multiple attack scenarios. Cynet automated playbooks also help detect threats to ensure that you only implement a manual response when it is necessary.
Cynet Response Orchestration can address any threat that involves infected endpoints, malicious processes or files, attacker-controlled network traffic, or compromised user accounts.
Incident Response Questions and Answers
What is Cyber Incident Response?
Cybersecurity incidents can quickly escalate into business crises, leading to financial loss, legal consequences, service disruption and damage to reputation and customer trust.
A well-managed CSIRT (Cyber Security Incident Response Team) is a key component of an information security program. The team can include full time security employees, part-time help from other departments, and external security providers using models like managed detection and response (MDR).
The CSIRT or external incident response provider is tasked with identifying real security incidents, rapidly investigating them, and responding to contain the threat, eradicate it, and ensure speedy recovery of organizational systems.
What Does an Incident Response Team Do?
An incident response team consists of a group of IT professionals who respond to suspected security incidents. The work of the incident response team includes developing an active incident response plan, system vulnerability testing and remediation, and support for all incident management activities performed across the organization. Members of the incident response team can include Level 1, 2, and 3 security analysts, security engineers, and operations specialists.
Why Do you Need an Incident Response Plan?
It is important for companies to detect and effectively respond to security incidents, to protect their reputation and prevent tangible and intangible losses. Organizations should develop an incident response plan that covers any security incident—large or small—to prevent incidents from becoming security breaches.
An incident response strategy and plan summarizes the organization’s main security risks, the process for detecting a security incident, the roles and responsibilities of the security team, and the tools needed to manage an incident. A key part of the plan is the steps needed to resolve security incidents, restore systems to normal operations, investigate the root cause and communicate the event to all concerned parties.
What is the Incident Response Cycle?
The incident response cycle is a structured process organizations follow to respond to security incidents, and continuously improve their incident management process. The process includes:
- Proactive preparation for security incidents
- Detection and analysis of a possible security incident
- Containment of the threat and remediation of affected systems
- Recovering systems to normal operations
- Post-incident investigation to identify root causes and threat actors
- Lessons learned to improve the process for the next incident
What are Some Common Causes of Incident Response Problems?
Common causes of incident response problems include:
- Missing event context, which requires time-consuming manual investigation.
- Lack of effective prioritization, which wastes time for security analysts.
- Lack of communication tools, enabling analysts to easily report and escalate the incident to others.
- Lack of opportunities for effective cooperation between security teams and operations, development, or other departments.
- Lack of visibility of security incidents to other parts of the organization, such as senior leadership, legal teams, PR teams, customers and shareholders.
See Our Additional Guides on Key Data Security Topics
Together with our content partners, we have authored in-depth guides on several other topics that can also be useful as you explore the world of data security.
Authored by Cynet
Authored by Cynet
Authored by Cloudian