我的编程水平充其量是中级,而且我之前用过的 Perl 不多,所以请轻声回复。
我正在尝试从入站电子邮件中提取原始的“发件人地址”(不是“信封发件人地址”)。
我解析通过我服务器上的 MailScanner 软件的入站电子邮件。如果我写(使用 MailScanner 的内置消息对象):
my($message) = @_;
MailScanner::Log::InfoLog("from address: @{$message->{headers}}");
我得到以下日志条目(已清理):
Received: from [192.168.12.34] (port=56309 helo=theirserver.theirdomain.tld) by server.mydomain.tld with esmtp (Exim 4.86) (envelope-from <sender@theirdomain.tld>) id 1aG62o-0002ad-Hu for recipient@mydomain.tld; Mon, 04 Jan 2016 09:23:34 -0500 Received: from 00a657f7.theirserver.theirdomain.tld ([127.0.0.1]:8056 helo=theirserver.theirdomain.tld) by theirserver.theirdomain.tld with ESMTP id 00PA657MF7; for <recipient@mydomain.tld>; Mon, 4 Jan 2016 06:22:53 -0800 Date: Mon, 4 Jan 2016 06:22:53 -0800 To: <recipient@mydomain.tld> Message-ID: <70562391089443970564001376171645@theirserver.theirdomain.tld> From: "Sender" <sender@theirdomain.tld> Subject: test Content-Language: en-us MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Type: multipart/alternative; boundary="----=Part.960.1818.1451917373"
如果我写(基于 MailScanner 的作者的建议):
my($message) = @_;
my $from_address = grep /^From:\s+/i, @{$message->{headers}};
MailScanner::Log::InfoLog("from address after grep = $from_address ");
我得到以下日志条目:
from address after grep = 0
不确定如何处理该结果,我尝试通过我在网上找到的 MailScanner 兼容脚本使用 Data::Dumper 并产生以下结果:
> $VAR1 = bless( {
'nameinfected' => 0,
'otherinfected' => 0,
'disarmedtags' => [],
'othertypes' => {},
'file2entity' => {
'' => bless( {
'ME_Parts' => [
bless( {
'ME_Bodyhandle' => bless( {
'MB_Path' => '/var/spool/MailScanner/incoming/9365/1aG62o-0002ad-Hu/nmsg-9365-3.txt'
}, 'MIME::Body::File' ),
'ME_Parts' => [],
'mail_inet_head' => bless( {
'mail_hdr_foldlen' => 79,
'mail_hdr_modify' => 0,
'mail_hdr_list' => [
'Content-Transfer-Encoding: 8bit
',
'Content-Type: text/plain; charset="UTF-8"
'
],
'mail_hdr_hash' => {
'Content-Type' => [
\$VAR1->{'file2entity'}{''}{'ME_Parts'}[0]{'mail_inet_head'}{'mail_hdr_list'}[1]
],
'Content-Transfer-Encoding' => [
\$VAR1->{'file2entity'}{''}{'ME_Parts'}[0]{'mail_inet_head'}{'mail_hdr_list'}[0]
]
},
'mail_hdr_mail_from' => 'KEEP',
'mail_hdr_lengths' => {}
}, 'MIME::Head' )
}, 'MIME::Entity' ),
bless( {
'ME_Bodyhandle' => bless( {
'MB_Path' => '/var/spool/MailScanner/incoming/9365/1aG62o-0002ad-Hu/nmsg-9365-42.html'
}, 'MIME::Body::File' ),
'ME_Parts' => [],
'mail_inet_head' => bless( {
'mail_hdr_foldlen' => 79,
'mail_hdr_modify' => 0,
'mail_hdr_list' => [
'Content-Transfer-Encoding: 8bit
',
'Content-Type: text/html; charset="UTF-8"
'
],
'mail_hdr_hash' => {
'Content-Type' => [
\$VAR1->{'file2entity'}{''}{'ME_Parts'}[1]{'mail_inet_head'}{'mail_hdr_list'}[1]
],
'Content-Transfer-Encoding' => [
\$VAR1->{'file2entity'}{''}{'ME_Parts'}[1]{'mail_inet_head'}{'mail_hdr_list'}[0]
]
},
'mail_hdr_mail_from' => 'KEEP',
'mail_hdr_lengths' => {}
}, 'MIME::Head' )
}, 'MIME::Entity' )
],
'ME_Epilogue' => [
'
'
],
'ME_Preamble' => [],
'mail_inet_head' => bless( {
'mail_hdr_foldlen' => 79,
'mail_hdr_modify' => 0,
'mail_hdr_list' => [
'Received: from [192.168.12.34] (port=56309 helo=theirserver.theirdomain.tld)
by server.mydomain.tld with esmtp (Exim 4.86)
(envelope-from <sender@theirdomain.tld>)
id 1aG62o-0002ad-Hu
for recipient@mydomain.tld; Mon, 04 Jan 2016 09:23:34 -0500
',
'Received: from 00a657f7.theirserver.theirdomain.tld ([127.0.0.1]:8056 helo=theirserver.theirdomain.tld)
by theirserver.theirdomain.tld with ESMTP id 00PA657MF7;
for <recipient@mydomain.tld>; Mon, 4 Jan 2016 06:22:53 -0800
',
'Date: Mon, 4 Jan 2016 06:22:53 -0800
',
'To: <recipient@mydomain.tld>
',
'Message-ID: <70562391089443970564001376171645@theirserver.theirdomain.tld>
',
'From: "Sender" <sender@theirdomain.tld>
',
'Subject: Test
',
'Content-Language: en-us
',
'MIME-Version: 1.0
',
'Content-Transfer-Encoding: 8bit
',
'Content-Type: multipart/alternative;
boundary="----=Part.960.1818.1451917373"
'
],
等等。
所以我接下来尝试使用以下内容解析 mail_hdr_list:
my($message) = @_;
MailScanner::Log::InfoLog("SpamWhitelist $msgid: mail_hdr_list @{$message->{headers}}[mail_hdr_list]");
我得到了这个结果:
Received: from server.theirdomain.tld ([192.168.165.54]:49620 helo=server.theirdomain.tld)
我很困惑。我不知道如何从这个对象获取 From: 地址,但不是信封发件人地址。
任何重写我的代码的帮助将不胜感激。