INFORMATION FOR YOU

Christoph Biedl cbiedl@gmx.de
Fri Dec 10 21:34:14 CET 2004


Zoran Dzelajlija wrote...

> Christophe Cuq <ccuq@chc-conseil.com> wrote:
> > Le 11/21/04 8:04 PM, Christoph Biedl nous disait :
> 
> > > ObCleanfeed: How do you deal with that Matt Parker stuff that is spammed
> > > via Google for weeks?
> 
> > I didn't see it before today. Then I tried to deal with it with :
> 
> > return reject ('matt parker') if ($hdr{'Subject'} =~ /.*matt.parker.*/);
> 
> I thought there was a possibility for false positives with that (consider
> someone discussing the said spammer), so I put this:
> 
>   if ( $#groups > 3 && 
>        $hdr{'Subject'} =~ /matt(hew)?\ parker/i &&
>        $hdr{'__BODY__'} =~ /matt\ parker\ alias/ ) {
>     $rval = "Hate mail unwanted here."
>   }

Here's what I did:
- crosspost to at least five groups
- Generated at Google (check against Message-ID)
- at least 40k size
- the name somewhere in the body

| if ((scalar @groups >= 5) &&
| 	$hdr{'Message-ID'} =~ /\@posting\.google\.com>/ &&
| 	length ($hdr{'__BODY__'}) >= 40000 &&
| 	$hdr{'__BODY__'} =~ / matt parker /) {
| 
|     return reject ("Google-SPAM matt parker");
| }

This has caught several thousand articles and the only false positives I
can think of are top posters or fullquottels via Google again. I'm not
too sad about then.

However, there was no further match since
| Date: 1 Dec 2004 02:27:03 -0800
| Lines: 882
| Message-ID: <efb17758.0412010227.69ffd6c7@posting.google.com>
| NNTP-Posting-Host: 213.229.188.183
but as far as I can see this ugly flooding has really stopped.

    Christoph



More information about the cleaners mailing list