May. 26, 2007

OSBF-lua rules

Last week, I moved my IMAP account from university to a privately hosted server. At the same time, I said goodbye to Spamassassin and switched to OSBF-lua - the results were impressive...

Already installed by Michael, I just had to create some maildrop rules to feed OSBF-lua. After training the bayes database with 42 pieces of spam and 9 of ham, almost any following spam was classified correctly as bulk. The training was easy. Besides some perl/shell scripts, the bayes filter can be trained by sending false pos/negs back to OSBF-lua. You simply have to change the 'Subject:' and 'To:' headers (remember majordomo?). The first thing that came to my mind was: Hey, this could easily be done with a one-click button in Thunderbird. Any extension out there?

Statistics for nonspam.cfc:
-----------------------------------------------
Database version: OSBF-Bayes
Total buckets in database: 94321
Buckets used (%): 9.0
Bucket size (bytes): 12
Header size (bytes): 4092
Number of chains: 7444
Max chain len (buckets): 5
Average chain length (buckets): 1.1
Max bucket displacement: 3
Buckets unreachable: 0
Trainings: 9
Classifications: 88
Learned mistakes: 7
Extra learnings: 0
Ham accuracy (%): 92.47
-----------------------------------------------

Statistics for spam.cfc:
-----------------------------------------------
Database version: OSBF-Bayes
Total buckets in database: 94321
Buckets used (%): 62.6
Bucket size (bytes): 12
Header size (bytes): 4092
Number of chains: 16430
Max chain len (buckets): 60
Average chain length (buckets): 3.6
Max bucket displacement: 29
Buckets unreachable: 0
Trainings: 42
Classifications: 134
Learned mistakes: 2
Extra learnings: 1
Spam accuracy (%): 98.45
-----------------------------------------------
Spam rate (%): 58.11
Global accuracy (%): 95.95
-----------------------------------------------


This article was yet viewed 102 times.

--> Back to the list of articles

Tags: OSBF-luaSpamBayes

Comments


Leave a comment:

(will not be published)

yes no

CAPTCHA image for SPAM prevention  

If you can't read the captcha word, please click to load a new image.
(You need Javascript turned on. Otherwise press the reload button of your browser and be warned that you'll probably have to reenter your comment.)

About this site

T3node is a TYPO3 blog by Steffen Müller. Beside TYPO3, technical and nontechnical topics about free software and networked communication are discussed. It's build with TYPO3.

Creative commons license symbolThe content of this website is distributed under the Creative Commons Attribution - NonCommercial - ShareAlike 3.0 Unported licence.

You can also follow my blubber on Twitter Feed Logo Twitter.

Article tags

--> Find a list of all blog articles

About Steffen Müller

Since 2002, I am a user and developer of the TYPO3 content management system. I understand content management as an interdisciplinary task under the terms of a knowledge society. This task combines technical, economical and social aspects as well as profund analysis, planning and implementation.

Therefore I do not focus on plain coding, but on various aspects like usability, accessibility, customizability or empirical analysis, following actual findings in communication science. I am also very interested in the subjects of knowledge communication in open source communities and knowledge management in general.

TYPO3 TRYDIVER cardI am a strong enthusiast and an active member of the TYPO3 community. Since May, 2009 I am a TYPO3 TRYDIVER ;-).

 

About TYPO3

TYPO3 is my favorite tool for content management. It combines enterprise level features with a well networked, highly active and progressive open source community.

About other sites