Entry 31126 (Berkeley CSUA MOTD)

Berkeley CSUA MOTD:Entry 31126

WIKI \| FAQ \| Tech FAQ
`http://csua.com/feed/`

2025/07/10 [General] UID:1000 Activity:popular

7/10

2004/7/2-4 [Computer/SW/SpamAssassin] UID:31126 Activity:high

7/2     I notice that in my .spamassassin directory, there is a 5
        Meg file called bayes_toks.  I _do_ understand why it is there.
        But, there is another 5 Meg file called bayes_toks.expire84232.
        And there are 4 other bayes_toks.expire files of varying
        size.  Why are they there?  Can I delete them?  My spamassassin dir
        takes up nearly 13 Megs.  I gather that the expire files can be
        deleted.  I believe that I hit my hard quota while spamc was
        running, leaving these orphaned files.  Assuming this to be the
        case, how do I stop spamc from auto-learning?
         \_ So, I noticed that in my messages that were classified as spam,
            it said "autolearn=no", but in those classified as ham, it said
            "autolearn=ham", so I thought that the expire files are created
            while it's trying to auto-learn.  But in fact, it seems that it's
            when I receive spam (at least in one case) that it creates the
            expire files.  What is it doing when it is creating
            bayes_toks.expire files and how do I get it to stop?  I just
            want it to filter my mail.  Why should it create any files?  I
            can sa-learn it on my own time.  -op
        \_ autolearn=no means spamassassin doesn't know whether it is spam or
           ham.  You need to train spamassasin manually with that message.
           According to the global setting in
           /usr/local/share/spamassassin/10_misc.cf
           Any mail with score > 12 is learnt as spam.
           Any mail with score < 0.1 is learnt as ham.
        \_ http://csua.org/u/80t
           \_ this is a good example of why url shortening can be bad.
              \_ I don't think it's terribly bad, but if it makes you feel
                 better, I've changed the result page for shortcutting so it
                 shows Title: link for easy copy-and-pasting of the whole
                 thing.  It won't necessarily end up shorter than the original
                 that way, but it will possibly be more informative.  --dbushong
                 that way, but it will possibly be more informative.
                --dbushong

2025/07/10 [General] UID:1000 Activity:popular

7/10

You may also be interested in these entries...

2012/8/16-10/17 [Computer/SW/SpamAssassin] UID:54458 Activity:nil

8/16    Why does my Y! mail account always full of unfiltered spam
        mails (and they're obviously spams)? Why can't they do
        a better job like Google mail? Why does Y! mail charge
        for exporting email? Google mail doesn't do that.
	...

2010/8/13-9/7 [Computer/SW/SpamAssassin] UID:53924 Activity:nil

8/12    Ugg, no spamd any longer?  I figured I'd have to just give up on my
        soda address (sad, very sad) but Vacation doesn't seem to be installed
        either, so I can't even leave a mesg. to people telling them where
        tom mail me now.  Or can I ?  Any advice out there.  Or can we get
        spamassassin/spamd reinstalled or Vacation or... help....
        \_ Ha, gmail as spamassassin.  presently I am forwarding to gmail
	...

2009/12/8-26 [Politics/Domestic/Crime, Computer/SW/SpamAssassin] UID:53580 Activity:low

12/8    Old news, but new to me:
        Spam King kills himself and his family after escaping prison
        http://blogs.zdnet.com/security/?p=1553&tag=rbxccnbzd1
        Hopefully more spammers will take the hint.
        \_ I wish the same fate can go to all marketing and
           advertising folks, selling people things they don't
	...

2009/7/17-24 [Computer/SW/SpamAssassin] UID:53157 Activity:nil

7/17    Thanks to steven, et al. for restoring Soda. In lieu of www.csua providing
        status, could there be a text file with current status and future plans.
        I'm wondering if SpamAssassin is obsolete (and my procmailrc and scripts)
        and won't be restored, and what's filtering spam now.  thanks!
        \_ How do I buy steven a beer or donate gobs of money?
           \_ I got him a Hacker-Pschorr, he seems to like ales.  Prob IPAs
	...

2009/5/8-14 [Computer/SW/SpamAssassin] UID:52971 Activity:nil

5/7     Dear csua, looks like /usr/bin/spamc and /usr/bin/formail don't exist
        on the emailer. I'm getting a bunch of binaries not found error
        on my .procmail-log.
    \_ Complaining via motd is not a reliable way to be heard and get your
       stuff fixed. Try emailing us. --t
	...

2009/5/5-6 [Computer/SW/SpamAssassin, Computer/SW/Unix] UID:52948 Activity:moderate

5/4     Is mail still down? I don't seem to be getting any and vermouth
        is unavailable. I saw a note saying it was down Sunday, but it's
        almost Tuesday now.
        \_ exim4 decided it wanted to just die. With the same config file and
        everything. Steven spent all weekend and a lot of yesterday migrating
        to a VM. A side effect is that NFS is now no longer on Keg, so crashy
	...

Cache (1650 bytes)

csua.org/u/80t -> lists.roaringpenguin.com/pipermail/mimedefang/2004-January/019591.htmlA few days ago I reported a problem I was having with my bayes database to the SATalk mailing list along with the observation that I was pretty sure it was a bug in the bayes expiry software. expire4752 A few days later I got a message from another person, David Lee, who had run into the same problem and who thought it might be due to the controlling agent, in his case a program called Mailscanner, timing out the expiry process before it could complete. It turns out that is exactly what was happening (I think). Bayes expiry can often take 3 or 4 minutes to complete, and if the system load happens to be really high when a mimedefang/spamassassin process decides its time to do an expiry, the process can easily take much longer, and if it takes longer than 5 minutes your're in trouble, since AFAIK the sendmail default timeout on a milter operation is 5 minutes. If a bayes expiry takes longer than 5 minutes it will be abruptly terminated? I'm also pretty sure this must be the case because I copied the files to another location for testing and ran an expire via sa-learn and it finished successfully in about 8 minutes, so it wasn't a matter of a corrupted database causing the problem. cf file and use sa-learn to force an expire on a regular basis via cron. As I recall someone in this forum suggested such an approach in a previous posting, but never gave a reason, so it didn't occur to me that it was mandatory and not just a matter of personal preference. I'll also be reporting this to the SATalk mailing list along with the observation that bayes expiry takes much too long, and the code could use some work to improve performance.