0

我正在尝试XML::Tidy在其他中使用缩进XML文件:

sub reformatXML {
    #
    # the only argument to this function is the file name
    #
    my $file = $_[ 0 ];
    #
    # create a new XML::Tidy object from $file
    #
    my $tidy = XML::Tidy->new( 'filename' => $file );
    #
    # Tidy up the indenting
    #
    $tidy->tidy();
    #
    # write out changes back to the file
    #
    $tidy->write();
    print "$file was reformated.\n";
    return
}

sub main(){
    #
    # get the current directory in which is the 
    # program running on
    #
    #my $current_dir = getcwd;
    #iterateDir( $current_dir );
    my $file = "/path/to/xml/file/autotest.xml";
    reformatXML( $file );
}

就如此容易。但是,当我调用我的main()函数时,我得到:

501 Protocol scheme 'd' is not supported d:/UDU/r/tc10.0.0.2012080100_buildA/src/build/kits/tc.dtd
Handler couldn't resolve external entity at line 2, column 29, byte 73
error in processing external entity reference at line 2, column 29, byte 73:
<?xml version="1.0" encoding="iso-8859-1"?>
<!DOCTYPE kit SYSTEM "tc.dtd">
============================^
<kit>
  <contact/>
 at C:/xampp/perl/site/lib/XML/Parser.pm line 187

我是新手Perl,我不知道为什么会出现这个错误。有人可以帮我弄清楚吗?

文件的头部XML是:

 <?xml version="1.0" encoding="iso-8859-1"?>
<!DOCTYPE kit SYSTEM "tc.dtd">
<kit>
  <contact/>
  <description>autotest files</description>
  <history>
    <hist>06-May-2005                  Created</hist>
    <hist>17-Jun-2005            Add autotest.jar to rtkit</hist>
    <hist>29-Jun-2005            Remove bits picked up elsewhere</
hist>
    <hist>15-Jul-2005            Added acad_add_note_types.</hist>
    <hist>20-Sep-2005            Add ai stuff</hist>
    <hist>31-Oct-2005            DMS BnT fixes</hist>
    <hist>03-Nov-2005            Pander to kitting's obsession abo
ut unique filenames</hist>
    <hist>17-Nov-2005            Add ics schema and junit</hist>
    <hist>09-Dec-2005            add gdt_autotest</hist>
    <hist>11-Jan-2006            Merge in P10.0.1.5</hist>
    <hist>16-Jan-2006      Merge</hist>
    <hist>26-Jan-2006      Need inclass.plmxml to pass tceng
_util autotest</hist>
    <hist>06-Mar-2006      Add qdiff.pl</hist>
    <hist>09-Mar-2006      Kernel tests need a couple fms cl
ient files</hist>
    <hist>10-Mar-2006      Missing dependent library</hist>
    <hist>19-Jan-2006      Merged from timb_gmo</hist>
    <hist>17-Jan-2006      GMO Kernel Autotests Implementati

4

1 回答 1

2

XML::Tidy(或者更确切地说,它使用的模块之一)似乎期望文件的绝对路径是有效的 URL,但事实并非如此。它认为指定的 URL 是

 d:/UDU/r/tc10.0.0.2012080100_buildA/src/build/kits/tc.dtd

当它真的

 file:///d:/UDU/r/tc10.0.0.2012080100_buildA/src/build/kits/tc.dtd

我不知道如何解决该错误。你可以尝试改变

my $file = "...";
reformatXML($file);

my $file = "...";
my $url = URI::file->new($file);
reformatXML($url);

这是直接的错误。除此之外,还存在向 DTD 提供相对 URL 的问题。这不一定是错误的,但有点奇怪。这意味着它tc.dtd必须与autotest.xml. 真的是这样吗?


一些解析器(例如 XML::LibXML)有一个选项来避免获取 DTD。这通常是不必要的,因此会浪费时间、金钱、CPU 和带宽。寻找这样的选择。它可能在 XML::Tidy 继承的类之一的构造函数中。

于 2012-08-23T18:08:47.620 回答