>>>
>>> iHateSpam: A community voting scheme to determine what is and is not
>>> spam. Spam is identified too late by the community for those that
>>> often poll their mailboxes.
>>>
>>> I was going to say that iHateSpam is yet another attempt to use
>>> users to vote on spam; i.e., use humans instead of blocklists,
>>> bayesian filters, DCC (how many received the same mail),
>>> greylisting, and other methods. I was also going to say that is very
>>> much like Cloudmark's SpamNet (renamed to Desktop after they yanked
>>> away the free version from all their users that helped them test and
>>> debug their app) but then I saw the following article which says
>>> that iHateSpam is just a different front-end to Cloudmark:
>>>
>>>
http://www.pcmag.com/article2/0,1895,2280345,00.asp
>>>
>>> Community voting of spam sounds great as described but it doesn't
>>> work if you grab your e-mails almost as soon as they show up in your
>>> mailbox. The idea is that users vote on the spam (by marking it as
>>> spam) and a hash value (fingerprint) gets sent to their server to
>>> get updates to other users. If they receive the spam and if they
>>> have gotten a database update then the spam gets identified. If you
>>> poll your mailbox at long enough intervals and if enough other users
>>> have voted on the message as spam and if you have gotten a database
>>> update then you won't see the spam. A lot of if's.
>>>
>>> The voting scheme is very similar to use DCC to determine if an
>>> e-mail is spam or not. A hash of the message is sent to their
>>> server to record how many recipients got that message. The idea is
>>> that you can set a threshold, say 20 or 50, after which if more than
>>> that many recipients got that e-mail. What it really does is
>>> provide a measure of whether or not an e-mail is bulk e-mail without
>>> it identifying itself as such (by using the "Precedence: Bulk"
>>> header). Cloudmark takes it a step further by having users vote on
>>> whether or not a message is spam to record the hash for that message
>>> so others that get that same message (and the same hash) might know
>>> it was voted as spam. Why do you think those spammers adds
>>> paragraphs of nonense to their spam? Because changing just one word
>>> will generate a different hash. Some of their spam source will spew
>>> the same message and you might block that one that has already been
>>> voted on sufficiently to identify it as spam, but the same spam from
>>> a different spam source has a few words changed to change the hash
>>> value. It's like when the captain on the Enterprise says to rotate
>>> frequencies of their shields to thwart the Borg: the spammer just
>>> rotates frequencies by generating a different hash for their same
>>> turds that they're firing at you.
>>>
>>> If you poll your mailbox at, say, 5- or 10-minute intervals, no one
>>> else (or few others) have yet voted on the spam that just got
>>> started in spewing out from the spam sources. There are no votes,
>>> or not enough of them, to identify the message as spam. You haven't
>>> yet gotten an update from their server so the message won't be
>>> identified as spam.
>>>
>>> I participated in the SpamNet testing. When spam was fresh, oh joy,
>>> I got to vote on it so *others* could take advantage of my voting to
>>> not see that spam. I still saw it, though. So if you poll your
>>> mailbox every hour, or longer, then you get to ride on the coattails
>>> of the other users that voted the message was spam. But obviously
>>> those folks that voted had to actually see the spam so the scheme
>>> obviously didn't help them to get rid of the spam in their mailbox.
>>> Because I was polling my accounts at under 15-minute intervals,
>>> almost all the spam got through because no one had voted on it yet
>>> except me and maybe a few others that just got the freshly spewed
>>> turd dumped in the mailbox.
>>>
>>> Another big problem with community-driven "intelligence" in
>>> identifying spam is that a large majority of users will claim
>>> something is spam when it does meet the qualifications of UCE or
>>> UBE. They say something is spam simply because it is e-mail that
>>> they don't want.
>>>
>>> So if you use Cloudmark (via iHateSpam) then don't poll your mailbox
>>> very often. Otherwise, you will see the spam and get the joy of
>>> voting on it so *others* won't see it.
>> Interesting. I have to admit that I did not even know that iHateSpam
>> had this feature.