我正在尝试对使用 ASP.NET 构建的站点进行回发请求,我的第一次尝试是在 Perl 中使用 Mechanize 进行,然后我尝试了 Python,但没有结果……我正在阅读可能是损坏的 SSL或回发本身。
我这样做是因为我想每天下载一个文件并手动下载它会花费很多时间,但我无法自动化它。
我愿意接受任何语言或建议。以下是我的 perl 代码(注意:我必须停用 SSL 认证,因为当它处于活动状态时,我没有得到服务器的响应):
use warnings;
use strict;
use WWW::Mechanize;
use Data::Dumper;
use Time::Piece;
use Time::Seconds;
use Try::Tiny;
use Log::Logger;
use File::Basename;
$ENV{PERL_LWP_SSL_VERIFY_HOSTNAME} = 0;
sub getLoggingTime {
my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst)=localtime(time);
my $nice_timestamp = sprintf ( "%04d%02d%02d %02d:%02d:%02d",
$year+1900,$mon+1,$mday,$hour,$min,$sec);
return $nice_timestamp;
}
my $dirname = dirname(__FILE__);
my $log = new Log::Logger;
$log->open_append($dirname . "/pmlcenace.log");
$log->log(getLoggingTime() . " INFO: Inicio del Script pmlcenace.pl");
# =============================
# Seteo de Variables generales
# =============================
my $maxtry = 6;
my $dest = qq(C:/Users/MyUser/Desktop/Precios/);
my $url = 'https://www.cenace.gob.mx/SIM/VISTA/REPORTES/H_CapacidadTransfer.aspx?N=263&site=&tipoArch=C&tipoUni=SIN&tipo=Diarios';
my $params = [
__ASYNCPOST => 'true',
__EVENTARGUMENT => '{"commandName":"Check","index":"0:0"}',
__EVENTTARGET => 'ctl00$ContentPlaceHolder1$treePrincipal',
ctl00_ContentPlaceHolder1_treePrincipal_ClientState => '{"expandedNodes":[],"collapsedNodes":[],"logEntries":[],"selectedNodes":[],"checkedNodes":["0","0:0"],"scrollPosition":0}',
__VIEWSTATE => '/w edited for space and viewing reasons',
__VIEWSTATEGENERATOR => '955A55B8',
__EVENTVALIDATION => '/wEdAAOCpQshmYscjMDA9x+69HkswfFVx5pEsE3np13JV2opXVEvSNmVO1vU+umjph0DtwdLoqQBBqXirK2Np+DpA6TO2lTaZh4NXJjUyfeW6oTM9g==',
'ctl00_ContentPlaceHolder1_ListViewNodos_ClientState' => '',
'ctl00$ContentPlaceHolder1$NotifAvisos$hiddenState' => '',
'ctl00_ContentPlaceHolder1_NotifAvisos_XmlPanel_ClientState' => '',
'ctl00_ContentPlaceHolder1_NotifAvisos_TitleMenu_ClientState' => '',
'ctl00_ContentPlaceHolder1_NotifAvisos_ClientState' => '',
'ctl00$ContentPlaceHolder1$btnCerrarPanel' => '',
'ctl00$ContentPlaceHolder1$toolkit' => '{"expandedNodes":[],"collapsedNodes":[],"logEntries":[],"selectedNodes":[],"checkedNodes":["0","0:0"],"scrollPosition":0}'
,''=>''];
# ===========================================
# Configuración e inizialización de Mechanize
# ===========================================
my $mech = WWW::Mechanize->new();
$mech->show_progress(1);
$mech->max_redirect(0);
$mech->agent_alias('Windows Mozilla');
#$mech->add_header(
# User_Agent => 'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36'
# );
# ==========================================
# POST Request a la página del CENACE
# Nota: Intenta hasta 5 veces, si no exit: 1
# ==========================================
while ($maxtry > 0) {
$log->log(getLoggingTime() . " INFO: POST Request: " . $url);
$mech->post($url,$params);
if ($mech->status == '200') {
last;
} else {
$maxtry--;
$log->log(getLoggingTime() . " WARN: POST Request fallido. [" . $maxtry . "] Intentos restantes");
if ($maxtry == 1) {
$log->fail(getLoggingTime() . " ERROR: Numero máximo de intentos alcanzados");
exit 1;
}
sleep(1.5);
}
}
print "\n\n\n\n\n Validating: AFTER DOING THE POST:\n";
print $mech->content( base_href => [my $base_href|undef] );
if ($mech->is_html() == 1)
{
print "\n";
print "It DOES have HTML";
print "\n";
}
else
{
print "\n";
print "Do NOT have HTML";
print "\n";
}
我得到的响应是 RedirectError,如下所示:
1|#||4|87|pageRedirect||%2fContacto.aspx%3faspxerrorpath%3d%2fSIM%2fVISTA%2fREPORTES%2fH_CapacidadTransfer.aspx|
但是该请求必须返回一个 HTML 代码,其中包含我需要的文件链接。
也许我错过了一些我不知道的东西。我真的很感激任何帮助或解决方案。
谢谢