Defending organizations against cyber threats is becoming an increasingly sophisticated science. Detailed analysis of the high profile data breaches at second largest retailer in the US or the more recent bank heist confirm that determined attackers breach into organizations months before they finally carry out their objective and are able to evade detection. Further to this, as a growing number of systems and devices across the world get connected to the Internet of Things, attack surfaces have expanded, leading organizations to examine new ways to protect their assets. There is an ever-growing acceptance among the Information Security Officers (ISOs) to the fact that it is just a matter of time before their organizations experience a significant IT security breach. This paradigm change in the drivers influencing information security strategy of organizations has also influenced the shift in focus from protection against threats to timely detection and effective response to these threats.
Challenges in identifying ‘low and slow’ breaches
Protection and detection security controls from the past decade have primarily used signature based approaches to detect threats. However, this approach has a few downsides: • Controls could only detect previously known threat vectors and actors; • Controls including security information and event management solutions (SIEM) could only review and assimilate information over a short period of time; • Mature SIEM threat detection use-cases reflected the organization’s collective knowledge of previously known (or well researched) threat scenarios. These solutions are ineffective when indicators of compromise of a ‘low and slow’ breach have to be accumulated over a long period.
Leveraging machine learning
Detection of these ‘low and slow’ threats primarily require:
Over the past 18 months, the security controls domain has been abuzz with commercial solutions using machine learning capability to detect these previously unknown threats. Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Such algorithms are often categorized as being supervised or unsupervised1 .
Supervised algorithms can apply what has been learned in the past, to new data. In the context of threat detection, it manifests in the form of:
For example, by analyzing large numbers of remote access tools RATs, a supervised machine learning model can learn how traffic from these tools differ from normal traffic.
Supervised learning algorithms support detection of threats on ‘day one’ of their use within the organization’s network and do not have an organization specific learning phase. Unsupervised learning models augment their intelligence specific to the customer’s environment. While supervised learning models are useful to provide day one benefits, detection of some threat scenarios can only be learned specific to each customer’s environment.
For example, to determine anonymous employee access behavior. Unsupervised learning algorithms focus on understanding what makes the customer’s IT usage pattern unique and identifies abnormalities when they occur. The risk carried by unsupervised learning algorithms is, that it may learn bad behavior as the baseline if it is exposed to bad usage. In general, this category of algorithms has a higher degree of being associated with false positives.
Adoption of machine learning capabilities
According to me, the selection of machine learning capabilities is underpinned by the following belief:
Given the above belief, the endpoint is the best pivot point to start the journey towards adopting machine learning capabilities for threat detection. The class of solutions generally categorized as Endpoint Threat Detection and Response (ETDR) offers considerable promise. Beyond threat detection, these solutions also provide extensive threat containment and forensics capabilities on the endpoints.
While the ETDR solutions provide a granular visibility to the endpoint threat surface, organizations do have the challenges with the roll-out and lifecycle management of their agents on the endpoints. Further to this, with the disruptions such as BYOD and IoT, the form factor of the endpoint is constantly changing and the agent based approach may not be scalable in-time.
For such organizations, the network provides the best pivot point to detect these ‘low and slow’ threats. It is said that ‘the network never lies’. An interesting category of solution is Network Traffic Analysis (NTA). Through the intelligent use of supervised and unsupervised learning algorithms coupled with host and asset context as well as tracking the threat progression over time, NTA solutions offers a quick way to gauge the organization’s security posture from the perspective of these ‘low and slow’ threats. Given such solutions can be introduced in the organization’s network with least amount of intrusive changes, organizations find it very easy to adopt them.
Finally, pivoting around the usage of user’s access credentials is another approach to leverage machine learning capabilities to detect these ‘low and slow’ threats. This segment of solutions is classified as User and Entity Behavior (UEBA). While it offers an alternate approach towards leveraging machine learning threat detection capabilities; it primarily leverages unsupervised learning algorithms for threat detection. Key use-cases covered by this segment of solution include privileged account usage analytics, insider threats, data exfiltration, account sharing, etc.
Finally…
While the selection of the solution is specific to each organization’s context, the following recommendations can help in the selection of the right solution.
It would be an overstatement to suggest that machine learning can effectively solve any cybersecurity problem. However, machine learning does offer a chance to use the vast amount of data being created by organizations to weed out the ‘low and slow’ threats better than human experts.