我想出了同样的问题。
贝叶斯数据库保存在运行 spamassassin 的用户的主目录中:
bayes_path /path/filename (default: ~/.spamassassin/bayes)
This is the directory and filename for Bayes databases. Several databases will be created, with this as the base directory and filename, with _toks, _seen, etc. appended to the base. The default setting results in files called ~/.spamassassin/bayes_seen, ~/.spamassassin/bayes_toks, etc.
By default, each user has their own in their ~/.spamassassin directory with mode 0700/0600. For system-wide SpamAssassin use, you may want to reduce disk space usage by sharing this across all users. However, Bayes appears to be more effective with individual user databases.
根据这个 wiki:http : //wiki.apache.org/spamassassin/SiteWideBayesSetup ,我在 /etc/mail/spamassassin/local.cf 中添加了以下两行:
bayes_path /var/spamassassin/bayes_db/bayes
bayes_file_mode 0777
我创建了所需的目录:/var/spamassassin/bayes_db/
请注意,路径中的最后一个“bayes”是数据库文件(bayes_journal、bayes_seen 等)的前缀
好的,在我重新盯着 spamassassin 之后,什么也没发生。还没有贝叶斯检验。唔...
所以,我将已经创建的数据库从/root/.spamassassin/*复制到/var/spamassassin/bayes_db
更新:看来我不得不将这4个bayes_*文件的权限更改为0666。否则自动学习器将不会保存新数据。我不同意 0666 的许可,但我希望我能尽快找到另一个解决方案。
我在 spamc 中进行了另一次测试,然后......我得到了贝叶斯!:)
spamassassin 的搜索结果
# spamassassin -t -D spf,dkim < /path/to/spam.eml
Content analysis details: (8.2 points, 5.0 required)
pts rule name description
---- ---------------------- --------------------------------------------------
3.5 BAYES_99 BODY: Bayes spam probability is 99 to 100%
[score: 1.0000]
1.3 RCVD_IN_BL_SPAMCOP_NET RBL: Received via a relay in bl.spamcop.net
[Blocked - see <http://www.spamcop.net/bl.shtml?141.146.5.61>]
1.0 DATE_IN_PAST_12_24 Date: is 12 to 24 hours before Received: date
-0.0 SPF_PASS SPF: sender matches SPF record
1.3 TRACKER_ID BODY: Incorporates a tracking ID number
0.2 BAYES_999 BODY: Bayes spam probability is 99.9 to 100%
[score: 1.0000]
0.0 HTML_MESSAGE BODY: HTML included in message
0.8 RDNS_NONE Delivered to internal network by a host with no rDNS
垃圾邮件的结果:
# spamc -R < /path/to/spam.eml
Content analysis details: (8.2 points, 5.0 required)
pts rule name description
---- ---------------------- --------------------------------------------------
1.3 RCVD_IN_BL_SPAMCOP_NET RBL: Received via a relay in bl.spamcop.net
[Blocked - see <http://www.spamcop.net/bl.shtml?141.146.5.61>]
3.5 BAYES_99 BODY: Bayes spam probability is 99 to 100%
[score: 1.0000]
1.0 DATE_IN_PAST_12_24 Date: is 12 to 24 hours before Received: date
-0.0 SPF_PASS SPF: sender matches SPF record
1.3 TRACKER_ID BODY: Incorporates a tracking ID number
0.2 BAYES_999 BODY: Bayes spam probability is 99.9 to 100%
[score: 1.0000]
0.0 HTML_MESSAGE BODY: HTML included in message
0.8 RDNS_NONE Delivered to internal network by a host with no rDNS
Content analysis details: (8.2 points, 5.0 required)