Researchers mine emails for criminal characteristics

 

Spammer identities to be exposed.

Secret identities of phishers, spammers and online bullies could be exposed by a newly published data-mining technique that provides evidence to the courts.

The method, which was recently accepted to the peer-reviewed journal Digital Investigation, identified patterns in vocabulary, punctuation and spelling to infer the gender, nationality and educational background of a message’s author.

It identified authors with an accuracy of 80 to 90 percent in a study of 100 emails from the now-defunct Enron Corporation, researchers from Canada’s Concordia University reported.

Study author Benjamin Fung said it could help authorities narrow their search for cybercriminals and identify a perpetrator from suspects.

It was based on Fung’s previous work into grouping and analysing emails from the same author to extract identifying patterns and generate a “write-print”.

Write-prints were distinctive identifiers, like fingerprints, and could be used for comparing criminal emails to any writing samples obtained via law enforcement warrants.

Although experts might avoid generating identifiable write-prints, Fung expected most cybercriminals to be prone to subconscious clues such as typographical errors and style.

He said it should be combined with IP tracing to strengthen law enforcement capabilities.

“My method cannot replace an IP address,” he told iTnews, explaining that write-print comparisons would be particularly useful if emails were traced to a location that housed multiple people.

Fung described highly accurate pattern recognition methods such as the Support Vector Machine (pdf) as “black box” methods that relied on multi-dimensional modelling and were too complex to be meaningful in courts of law.

By contrast, the Concordia technique was designed to match sets of data and reasons that could presented to, and understood by, judicial authorities.

“For evidence to be admissible, investigators need to explain how they have reached their conclusions. Our method allows them to do this,” he said.

Researchers will extend write-printing to chat logs and SMS.

Fung hoped it would be used by law enforcement in the “near future”, noting that his research group had worked with Canada’s National Cyber-Forensics and Training Alliance.

Copyright © iTnews.com.au . All rights reserved.


Researchers mine emails for criminal characteristics
"@Res - Yes, and if the author claims "80 to 90 percent" in a selected study, then given the looseness of the description it could mean a much wider margin of error. If anybody was foolish enough ..."
By anonymous
 
 
 
Comments: 2
Res
Mar 10, 2011 8:44 AM
"It identified authors with an accuracy of 80 to 90 percent in a study of 100 emails" - WTF ?

Only 100? how the hell can they say it is accurate, lets see the figures out of 1K or 10K, then if it gets 80-90% accuracy poke your chest out, but out of only 100, that's a joke.
anonymous
Mar 10, 2011 11:54 AM

@Res - Yes, and if the author claims "80 to 90 percent" in a selected study, then given the looseness of the description it could mean a much wider margin of error.

If anybody was foolish enough to try and introduce this as evidence in a court case, it would presumably get thrown out immediately.
Comments have been disabled for this article.
 
 
 
Top Stories
Australia turns to homegrown drones
Debating the finer points of unmanned aerial vehicle design.
 
The New Zealand telco problem
Opinion: Could Telstra save Kiwi telcos?
 
IT price probe to 'name and shame' gougers
Industry ducking the issue, committee claims.
 
Sign up to receive iTnews email bulletins
   FOLLOW US...

Latest VideosSee all videos »

Latest Comments
Polls
Should the Government enact new legislation to protect copyright holders in the digital age?

   |   View results
Yes
  20%
 
No
  80%
TOTAL VOTES: 516

Vote