The Cutting Edge of Analytics

23 Friday May 2014

Posted by Emma in Forensics and Investigation

Tags

analysis, analytics, banking, casefile, detection, detica, financials, fraud, intelligent, investigation, maltego, netreveal

Over the past month I’ve spent some time looking at intelligent fraud and anomaly detection systems, authoring a journal paper comparing a handful of methods, and more recently focussing my attention on Detica’s systems. Plus I’m working with someone to develop a multi-featured case management system for tracking malware, to save us switching between applications.

But anyway, the technologies for intelligence gathering are actually far beyond what Glenn Greenwald published from the rather outdated Snowden archive, and it’s not simply about the warehousing of intercepted data. Stream mining ‘digests’ the data in real-time, deriving from it information that analytics systems can evaluate and contextualise without human intervention. The analyst works on the end product of this. Another thing worth mentioning about the Snowden/NSA thing is that nobody’s quite sure what’s retained or discarded in the stream mining process.

Analytics has been deployed long before the ‘threat intelligence’ snake oil industry materialised, and it has uses in preventing or mitigating real threats. For example, the protection of bank accounts over perhaps the last two decades, which is where my interest in this began. An advanced field-tested detection system would also have prevented victims of identity theft being wrongly associated with Operation Ore between 1999 and 2002. Alert Logic’s own service, again a much-needed system for detecting genuine threats, was built around a core system dating from the late 90s.

I’ve singled out Detica’s NetReveal for reasons that should become obvious, the primary one being it appears the most advanced I’ve come across after much digging around.

AML Applications
After being first deployed in 2005 as a proof-of-concept system for the Insurance Fraud Bureau, NetReveal was adopted by AXA, Zurich, Nationwide and several other major financial institutions, so it therefore might be the very system I was looking for when writing up the review paper.
Anti Money Laundering (AML) is actually just one of the ‘use cases’ for NetReveal, and one application where the capabilities are truly tested – the whole point of money laundering is to get money from one place to another without authorities knowing, typically by disguising transactions among legitimate or routine activities. I’ve seen real-world examples of this in the past: One involved ordering surplus on behalf of an employer, selling it then pocketing the money. Another example involved non-existent employees on the books, all presumably with the same bank account created by the fraudster. Of course, this probably continued for years after I resigned, because management only saw payrolls and accounts. They didn’t give minimum-waged employees the time of day, and so didn’t have the situational awareness for spotting the discrepancies.

So the problem NetReveal must solve is quite complex. How can it discern fraudulent transactions from legitimate transactions? How can transactions be associated with seemingly unconnected events? How could a system identify a suspicious event and tie it to a sequence of other events? More importantly, how can the system be made to work in real-time?
What I’ve found is there are two broad categories of fraud detection. First there are the rule-based, signature-based and expert systems – these tend to be static, comparing current transactions with signatures of known fraud cases. While these are fast and efficient, they’re less reliable. Secondly there are deeper analysis methods such as clustering, pattern recognition and Bayesian systems – these are more adaptive and thorough, but computationally more expensive.

So what NetReveal does is take the raw data from whatever sources, categorise them into entities, perform some analysis, construct a relational map and determine the weighting of each link. The latter two stages are possible with Maltego Casefile and Palantir anyway, as a very simple but highly effective method of revealing patterns an intelligence analyst would otherwise miss. The following screenshot is an example from my malware tracking project that would, with a much larger database, be useful in attributing malware and incidents to known ‘actors’:

On the analytics side it appears a hybrid of several intelligent fraud detection systems, the specifics I won’t reveal here as they’re also used by electronic payment systems and have fundamental limitations that aren’t easily resolved. What I could reveal is that Detica was rather ahead of the curve, as the research papers I’ve found that proposed hybrid systems were mostly published after 2009. Alert Logic, an entirely different company also dealing with vast amounts of data, also appears to have followed the hybrid model.

Modules
From what I can determine, NetReveal is a modular system that can be remixed for whatever customer, with the following three ‘components’:
* Detection Modules
* Analysis Modules
* Investigation Modules

NetReveal also got reworked specifically for ‘cyber threats’, in the form of the CyberReveal product.

Detection modules appear to provide the basic rule-based system that’s highly efficient at handling real-time data. Its function is mainly to flag anything deemed as suspicious or matching predefined rules, and could be used to filter out redundant data from the sources to reduce load on the analytics engine(s).

Analysis modules are essentially a highly advanced form of analytics engine, doing stuff that’s computationally more expensive and time-consuming. It analyses transactions after they have been completed, possibly adapting the detection modules.

Investigation modules appear to provide a glorified search engine and visualisation thing, which is pretty much what you get with Casefile. Whereas information is entered manually for Casefile, Detica’s eye candy is presenting a higher volume of information from an advanced back-end.

System Call Policies and systrace

18 Tuesday Dec 2012

Posted by Emma in Linux OS

≈ Leave a comment

Tags

apparmor, call, detection, function, ids, intrusion, kernel, linux, niels, operating, os, privilege, provos, root, security, selinux, syslog, system, systrace

Recently I’ve been looking at a little-known utility called ‘systrace‘, which in theory (I’ll come to that) protects Linux boxes against most exploits and privilege escalation. Basically it controls and logs user-space access to the kernel resources. This isn’t to be confused with the Android OS diagnostic tool of that name.

System Calls
The user and kernel spaces need a method of communicating with each other, especially if a program needs to interact with the harware for memory management, writing to disk, sending data to a network interface, etc., and this is done through ‘system calls’. This is vaguely analogous to functions or .NET objects a program uses to handle whatever operation. In fact, a function itself can contain a system call, so it becomes a ‘system call wrapper’. I’ve yet to figure out how exactly systrace replaces those in glibc.

How is this related to security? The kernel, being the operating system, runs at the highest privilege level, which means a malicious program or exploit could also manipulate it through system calls. Malicous code might also find its way into applications the user trusts, since nobody has time to inspect all the code in every application they use. Worse still, the entire application might be run with root privileges, as happens when launched with the sudo command.

systrace as a Solution
Niels Provos, at the University of Michigan, presented a solution to this in his paper ‘Improving Host Security with System Call Policies‘ (download here). The literature can be a little hard to follow even for an experienced Linux user, and I’ve interpreted it the best I can.
Instead of launching a whole program with root privileges, Provos suggests using ‘system call interposition’ (interception, basically) to control which system calls are allowed, and systrace will permit, deny or ask (the user), depending on whatever policies are set. According to the ONLamp.com page, anything that’s denied by systrace will be recorded in syslog, which I assume is the file in /var/log/syslog.
For performance reasons, the permit and deny rules are enforced in kernel space, while the ask rule causes a user space program to wait for input from the user.

Putting all this together, we can see that systrace actually performs three functions:
* Policy enforcement
* Intrusion detection
* Automated privilege elevation

Intrusion Detection
As we’ve seen, anything denied by systrace is logged, and this is where the intrusion detection bit comes in. This has obvious advantages over perimeter-based IDS, as it records actions made locally as well as remotely, and could potentially reveal why those actions were made. The downside is setting this up across multiple hosts could be a pain in the ass, even if the syslog data were somehow aggregated.

File Permissions vs. systrace
Another advantage systrace might have is better access control than UNIX file permissions. Although the latter gives granular control over what files users read, write and execute, it must be done meticulously in order to be totally effective. There are just too many files. This was the primary reason Provos decided to use something that enforced controls at the system call level.

Disadvantages and Vulnerabilities in systrace
But is systrace really worth installing? Updates and changes have been occasional, with the latest made in 2009. It also has vulnerabilities, but even then it would make exploits much harder overall. Perhaps the main reason it’s not commonly used is it’s been superseded by SELinux and AppArmor, which apply Mandatory Access Controls to the kernel itself.

As I understand it, the main vulnerability in systrace is a kind of ‘race condition’ or timing attack, where a malicious program changes a system call just after it’s permitted, although Provos did anticipate this in his paper. Whether that’s likely to happen inpractice is anyone’s guess, but it’s possible.

The Krypt

Tag Archives: detection

The Cutting Edge of Analytics

System Call Policies and systrace

Share this:

Share this: