Extrusion detection and prevention products are designed to inspect and/or deny network traffic carrying unauthorized content beyond the perimeter of the enterprise. Terms associated with extrusion products include data leak protection (DLP), exfiltration, and intellectual property leakage (IPL). The basic idea is to identify and/or stop sensitive business content -- such as Social Security numbers, credit card numbers, sales data, and the like -- from leaving the network.
But before buying an extrusion detection or prevention product, security professionals must prepare by engaging in a number of technical steps. It's worth noting that a number of non-technical steps -- including creating or reviewing the organization's security policy, identifying and prioritizing business information and systems and formulating attack scenarios -- are just as important and should typically come first. But once these non-technical steps are taken care of, one can turn to technical considerations. The most efficient way to do this is by conducting a network traffic analysis.
Knowing the network
It's important to acquire and maintain a sense of the traffic traversing the network. This sort of situational awareness doesn't need to take place at the per-packet level. Instead, start with statistical data. Open source tools like Darkstat and Ntop can be deployed on stand-alone passive sensors to gather traffic volume statistics, active IP addresses and observed services. For example, one might run Darkstat for 48 hours and notice a lot of traffic from a company host to a machine in Russia. Simply seeing this traffic could indicate a security problem.
The following is an example of output for a specific IP from Darkstat:
|(1-5 of 5)|
The table is empty.
|(1-1 of 1)|
Statistical data is helpful, but it's not granular enough to identify individual connections of interest. To acquire information on hosts conversing on a per-connection basis, I recommend collecting session data. Session data records source IP, destination IP, source port, destination port, protocol and traffic sent by either side of a conversation. Layer 3 switches and routers can export session data in NetFlow and similar formats to open source collectors and analyzer like Flow-tools. Other open source tools like Argus can operate independently, collecting and analyzing session data. The Security Analyst's Network Connection Profiler (SANCP) is integrated into Sguil, an open source suite for network security monitoring.
The following is an example of session data for a conversation exported from SANCP and Sguil. (In Sguil this data is represented in a row format.)
Sensor: cel433 Session
Start Time: 2007-04-20 15:45:35 End
Time: 2007-04-20 15:45:35
184.108.40.206:1031 -> 220.127.116.11:80
Source Packets: 5 Bytes:302
Dest Packets: 5 Bytes:131
Beyond the packet
In addition to statistical and session data, one should be familiar with the process of collecting full content data in order to identify exactly what's represented by a session of interest. Full content data can be collected by many tools. Open source options include Tcpdump, Wireshark/Tshark/Dumpcap, Snort and Daemonlogger. When confronting unencrypted traffic, full content is the only way to identify the information transmitted in a session of interest.
The following is the full content for the session previously demonstrated. It was collected by Snort running in packet-collection mode and reconstructed within Sguil by Tcpflow. P0f provided operating system identification.
Sensor Name: cel433
Timestamp: 2007-04-20 15:45:35
Connection ID: .cel433_5055537005472227539
Src IP: 18.104.22.168 (c-69-143-202-28.hsd1.va.comcast.net)
Dst IP: 22.214.171.124 (vhost.identityvector.com)
Src Port: 1031
Dst Port: 80
OS Fingerprint: 126.96.36.199:1031 - Windows XP SP1+, 2000 SP3
OS Fingerprint: -> 188.8.131.52:80 (distance 2, link: ethernet/modem)
SRC: GET / HTTP/1.1
SRC: Accept: */*
SRC: Accept-Language: en-us
SRC: UA-CPU: x86
SRC: Accept-Encoding: gzip, deflate
SRC: If-Modified-Since: Mon, 08 Jan 2007 04:44:47 GMT
SRC: If-None-Match: "403b6-d5e-16cc4dc0"
SRC: User-Agent: Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1)
SRC: Host: www.taosecurity.com
SRC: Connection: Keep-Alive
DST: HTTP/1.1 304 Not Modified
DST: Date: Fri, 20 Apr 2007 15:45:47 GMT
DST: Server: Apache/2
DST: Connection: close
DST: ETag: "403b6-d5e-16cc4dc0"
As you can see, the connection shows a request for the / or index of www.taosecurity.com.
Conducting a network traffic analysis or forensics exam to reveal statistical, session, and full content data helps security professionals understand their networks, thereby guiding their decision to implement extrusion products. After all it does not make sense to try to select and deploy an extrusion product if an organization doesn't understand the traffic on its network. Only after gaining the ability to recognize the properties of the data traversing the wire does it become possible to be an informed buyer of extrusion tools.
About the author
Richard Bejtlich is an expert on data protection and information leakage. He is the author of The Tao of Network Security Monitoring and Extrusion Detection, and co-author of Real Digital Forensics. He is a frequent speaker and author of TaoSecurity blog.