logo       

Re: Bogus Data within <style> tags poisoning SA results: msg#01308

users-spamassassin

Subject: Re: Bogus Data within <style> tags poisoning SA results

On Fri, 31 Jul 2009, Nathan M wrote:

Here's an example of what we're seeing within the message source.

<style>
creatures quickly produce approve crevice nuclear moping
esoteric pernicious motion faith does embodies does
purify testament maximum exceeding centralism intellect prey
tidying welcomed traal impress tuneless athwart mansions
endures flames echo motion rooms alcohol rituals
etc.. etc.. etc..
</style>

Style tags have some format requirements. It might be reasonable (though expensive) to try to detect style tags that do not have any of those syntactic elements...

For now, though, this is just more bayes poison. Train it as spam and the scores will go up.

--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhardin@xxxxxxxxxx FALaholic #11174 pgpk -a jhardin@xxxxxxxxxx
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
False is the idea of utility that sacrifices a thousand real
advantages for one imaginary or trifling inconvenience; that would
take fire from men because it burns, and water because one may drown
in it; that has no remedy for evils except destruction. The laws
that forbid the carrying of arms are laws of such a nature. They
disarm only those who are neither inclined nor determined to commit
crime. -- Cesare Beccaria, quoted by Thomas Jefferson
-----------------------------------------------------------------------
5 days until the 274th anniversary of John Peter Zenger's acquittal

<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | Mail Home | sitemap | FAQ | advertise