[SciPy-dev] Server spam problems spam spam: spam

Peter Skomoroch peter.skomoroch@gmail....
Mon Feb 23 20:13:03 CST 2009


What about black listing spam ips?  http://moinmoin.wikiwikiweb.de/BlackList

On Mon, Feb 23, 2009 at 8:58 PM, Robert Kern <robert.kern@gmail.com> wrote:

> On Mon, Feb 23, 2009 at 19:46, Pauli Virtanen <pav@iki.fi> wrote:
> > Sun, 22 Feb 2009 13:40:20 -0800, Michael Abshoff wrote:
> > [clip]
> >> two tips of fighting spammers from the Sage project's wiki:
> >>
> >>   * add a list of common Chinese words to LocalBadContent, i.e.
> >>
> >> http://wiki.sagemath.org/LocalBadContent
> >>
> >> Also make sure to clean out all the spammer attempts on the hard disk.
> >> I.e I deleted 6,000 directories in "pages" of the Cython wiki since Spam
> >> attempts are preserved and not actually deleted from disk. If you have a
> >> couple ten thousand of those in one directory this might make every wiki
> >> access painfully slow and impact the whole server.
> >
> > Continuing Gael's work, I tried to expand the LocalBadContent list:
> >
> >        http://scipy.org/LocalBadContent
> >
> > I wonder how useful this turns out to be in the end, this smells like an
> > arms race... I doubt the additions cause problems to real pages, but if
> > they do, some of them need to be reverted.
> >
> > [Btw, shouldn't LocalBadContent editing be restricted to those in
> > EditorGroup? And could my account PauliVirtanen be added in the group?]
>
> Done and done.
>
> > Another thing is that there are apparently ca. 11600 pages in the
> > Scipy.org wiki. I'd make a wild guess that at most ~500 of these are
> > valid content; the rest is spam. I'm not sure if getting rid of the spam
> > pages improves Moin's performance.
>
> Probably. Are you volunteering? Peter can give you a shell account. If
> you are willing to take on the other upgrades Michael recommended, to
> add the Captcha, for instance, that would go well, too.
>
> > Do we have any valid pages with CJK characters? Much of the spam seems
> > Chinese, so mass-deleting at least this portion of it shouldn't be
> > impossible to do, given Moin's database format.
>
> The Chinese localized Moin help pages are valid, but that should be it.
>
> --
> Robert Kern
>
> "I have come to believe that the whole world is an enigma, a harmless
> enigma that is made terrible by our own mad attempt to interpret it as
> though it had an underlying truth."
>  -- Umberto Eco
> _______________________________________________
> Scipy-dev mailing list
> Scipy-dev@scipy.org
> http://projects.scipy.org/mailman/listinfo/scipy-dev
>



-- 
Peter N. Skomoroch
617.285.8348
http://www.datawrangling.com
http://delicious.com/pskomoroch
http://twitter.com/peteskomoroch
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://projects.scipy.org/pipermail/scipy-dev/attachments/20090223/295e01e9/attachment-0001.html 


More information about the Scipy-dev mailing list