Internet.com ISP-Planet
Search ISP-Planet


Search internet.com
internet.com

IT
Developer
Internet News
Small Business
Personal Technology
International

Search internet.com
Advertise
Corporate Info
Newsletters
Tech Jobs
E-mail Offers

internet.commerce
Partner With Us














ISP Equipment

E-Mail

Pearl of an Anti-Spam Program

ActiveState's PureMessage application uses probability modeling to determine whether a particular e-mail message is or is not spam. Savvy Perl programmers like the flexible nature of this product rooted in open source projects.

by Alex Goldman
ISP-Planet Associate Editor
[November 8, 2002]
Email a colleague

Founded in 1997, ActiveState started out by building what Chris Kraft, ActiveState director of product management describes as "quality assured open source distributions." The company used basic programming tools such as Perl, Python, and Tcl, but branched out into building developer tools for open source languages. Today, ActiveState claims to have built working relationships with over 4 million developers through sales channels and online forums.

ActiveState developers decided that Perl was the best open source text processor, so naturally it would be a good foundation for an anti-spam application. The company launched PerlMX over a year ago, and sought out enterprise-class clients as its primary target. Two months ago, ISPs starting taking an interest in ActiveState's anti-spam applications. As a result, the company is currently working with service providers to make its anti-spam solution more attractive to ISPs.

Set to debut in mid-November, ActiveState's PureMessage application uses probability modeling to determine whether a particular e-mail message is or is not spam. The application incorporates other open source anti-spam initiatives including blacklists and SpamAssassin. Some day, ActiveState hopes to incorporate the SIEVE anti-spam language (IETF RFC 3028) into the application.

The program can recognize basic spam techniques, such as dictionary attacks. It also performs reverse domain name server (DNS) lookups on incoming messages. The program examines e-mail messages for spam characteristics, which include:

  • Keywords or phrases such as "amazing" or "casino"
  • Specific HTML style tags such as a colored background, forms and iframes
  • Complex spam patterns such as
    (?:You (?:were sent|have received|are receiving)|You're receiving).{0,15}(?:message|e-?mail)s? because - if you (?:(?:want|wish|care|prefer) not to |(?:don't|do not) (?:want|wish|care) to )(?:be contacted again|receive (any)?\s*(?:more|future|further) (?:e?-?mail|messages?|offers|solicitations))
  • Text formats, such as all caps, spaces between all caps, and multiple exclamation points

The application completes a probability assessment that determines whether the e-mail is unsolicited based on the percentage of spam-like characteristics the message exhibits. Next, it reviews settings to determine whether the e-mail is rejected or sent through to the end user. ActiveState claims its anti-spam product rejects 98 percent of all unsolicited e-mail.

A special policy enforcement bundle is capable of forwarding questionable e-mail to a quarantine box, and can also enforce corporate e-mail policies or append a legal disclaimer to every outgoing e-mail. Through an OEM agreement with McAfee, the company provides McAfee anti-virus to PureMessage users.

Hugh Messenger, senior network administrator at Andalusia, Ala.-based AlaWeb, said the key advantage of PureMessage is its adaptability.

"Before we adopted PureMessage we used Lyris MailShield, which is a very good product, but it required a lot of hands on configuration and updating," Messenger said. "We like PureMessage because we don't have to keep up with who's doing the spamming. As long as we have the latest package, it identifies the junk by itself."

Like most system administrators, Messenger said he takes spam very seriously. But ISPs have to balance anti-spam technology with the demands of different users.

"We've got two vocal classes of users. One group does not want us to touch their mail, and another group assumes that all spam they get is from us," Messenger said. "We're an ISP, not a big corporation, and we cannot make sweeping policy decisions about what mail people can and cannot receive. In a corporation you can install a filter and announce that it's company policy, but when people are paying to get e-mail, you need to listen to their needs."

In order to respond to customer's needs, Messenger set up a special Web page that allows them to determine how the spam filter is used. He said building the form and adding the opt-out feature was relatively easy.

"It's 317 lines of Perl. Developing it was not too difficult because we already had set up a Web page for customers to manage their accounts," Messenger said. "It's not a feature in the current version of the product, although they may add it in future versions."

AlaWeb customers are allowed to choose from three options: opt out of e-mail filtering entirely, accept all filtering, or ask that each filtered e-mail be forwarded with an altered subject line. Messenger said that the altered subject line shows the spam probability percentage that the filter assigned to that particular e-mail.

"We'll just prepend something like 'Spam 87%' in the subject line," Messenger said. The company calls this feature "subject striping."

When a customer complains about not receiving desired e-mail messages, such as a newsletter subscription or other bulk notices, they can add the newsletter to a whitelist and allow the messages to reach its target. But Messenger advises that novice Perl programmers precede with caution.

"Perl is a powerful language," Messenger said. "You can get into trouble with any language that makes it easy to do cool things. Nobody should ever work on live servers."

Messenger is monitoring the success of the program with his own e-mail account.

"I get about 100 spam messages per day, and perhaps 6 get through," he said. "I've had only one or two false positives in six months. I use the subject striping option."

AlaWeb uses a pair of Linux boxes, each a dual processor 1.2 GHz box with 3 GB of RAM, although Messenger is planning to upgrade to 4GB of RAM. He estimates that AlaWeb receives 600,000 e-mail messages each day, of which 40 percent are spam.

"Six months ago, we saw what we feared was the elbow of a massive spam curve," Messenger said. "We were right. E-mail traffic doubled, and the percentage that is spam rose from about 25 percent to 40 percent or even 50 percent."

Messenger is glad to have been prepared for the surge in junk e-mail. He said the best benefit, however, was the positive responses from users.

"We got many thank you notes from users when we switched to the new product, even from people who did not yet know what we'd changed," Messenger said.

Pricing and availability
The product is available now. Pricing is based on a one-time, per server license fee, making it potentially very appealing for use in an ISP environment.

—End

Related articles:
  [Oct. 4, 2002] Making Spammers Pay
  [Jan. 29, 2001] Windows 2000: CGI Scripts
  [Nov. 29, 2000] Extended Authentication
ISP News
IDC: Microsoft's Yahoo Deal Could be a Big Hit
Ballmer Fills in 'Software-Plus-Services' Plan
Report: Enterprise Search Will Top $1 Billion by 2010

More >


ISP Glossary
Find an ISP Term

Newsletters!
ISP-Planet Weekly


Best of ISP-Planet

 

Feedback


Advertising inquiry? Click here!

ISP-Planet's RSS feed



JupiterOnlineMedia

internet.comearthweb.comDevx.commediabistro.comGraphics.com

Search:

Jupitermedia Corporation has two divisions: Jupiterimages and JupiterOnlineMedia

Jupitermedia Corporate Info


Legal Notices, Licensing, Reprints, & Permissions, Privacy Policy.

Advertise | Newsletters | Tech Jobs | Shopping | E-mail Offers

Solutions
Whitepapers and eBooks
Intel PDF: Virtualization Delivers Data Center Efficiency
Intel eBook: Managing the Evolving Data Center
Microsoft Article: BitLocker Brings Encryption to Windows Server 2008
Symantec eBook: The Guide to E-Mail Archiving and Management
Microsoft Article: RODCs Transform Branch Office Security
Go Parallel Article: James Reinders on the Intel Parallel Studio Beta Program
Avaya Article: Advancing the State of the Art in Customer Service
Adobe Acrobat Connect Pro: Web Conferencing and eLearning Whitepapers
Avaya Article: Avaya AE Services Provide Rapid Telephony Integration with Facebook
Go Parallel Article: Getting Started with TBB on Windows
HP eBook: Storage Networking , Part 1
MORE WHITEPAPERS, EBOOKS, AND ARTICLES
Webcasts
Intel Seminar: Efficiencies in Hardware/Software Virtualization
HP Webcast: Disaster Recovery Planning
Go Parallel Video: Performance and Threading Tools for Game Developers
HP Video: StorageWorks EVA4400 and Oracle
HP Webcast: Storage Is Changing Fast - Be Ready or Be Left Behind
MORE WEBCASTS, PODCASTS, AND VIDEOS
Downloads and eKits
IBM TCO eKIT: Your IT Budget is Under Attack, Get in Control
IBM Energy Efficiency eKIT: Learn How to Reduce Costs
30-Day Trial: SPAMfighter Exchange Module
Red Gate Download: SQL Toolbelt and free High-Performance SQL Code eBook
Iron Speed Designer Application Generator
MORE DOWNLOADS, EKITS, AND FREE TRIALS
Tutorials and Demos
Microsoft Article: Silverlight Streaming--Free Video Hosting for All
Featured Algorithm: Intel Threading Building Blocks - parallel_reduce
HP Demo: StorageWorks EVA4400
MORE TUTORIALS, DEMOS AND STEP-BY-STEP GUIDES