0

我有一个带有以下标签的 xml 文件:

<param>
  <name>date</name>
  <Value>20010202</Value>
</param>

我在一个文件中有很多这样的标签。我需要通过批处理读取此文件并将日期的值更新为名称标签包含“日期”的字符串。我已经批量编写了一些代码来将日期(无论它在完整文件中的哪个位置)替换为所需的字符串。我的代码如下:

for /f "tokens=1,* delims=]" %%K in ('"type "example.xml"|find /n /v """') do (
set "line=%%L"
if defined line (
  call set "line=echo.%%line:20010202=desiredString%%"
  for /f "delims=" %%C in ('"echo."%%line%%""') do %%~C
  ) else echo.
) >> "example.tmp"

但我需要检查替换的日期是否应该是名称标签的值标签为“日期”。我想在 for 循环中执行检查,然后在当前行匹配时读取下一行。

如何在 for 循环中读取一行并检查批处理文件中的 if 条件?

4

2 回答 2

4

我想您可以使用纯批处理编写解决方案,但它可能会很慢而且很脆弱。脆弱我的意思是 XML 可以以逻辑上等效的方式重新格式化,但会破坏您的代码。Batch 可能是您可以用来操作 XML 的最糟糕的语言之一。

理想情况下,您应该使用专门设计用于处理 XML 的命令行工具。或者,您可以使用任何支持正则表达式搜索和替换的语言或工具。一些选项 - VBScript、JScript、gnu sed for Windows、Powershell ......不胜枚举。

我编写了一个混合批处理/JScript REPL.BAT 实用程序,它为批处理执行正则表达式搜索和替换文本文件提供了一种简单的方法。它非常快,比任何纯批处理解决方案都要快得多。

下面是使用我的 REPL.BAT 实用程序的简单解决方案:

@echo off
setlocal
set "file=example.xml"
set "oldDate=20010202"
set "newDate=20130123"
set "search=(<param>\s*<name>date</name>\s*<Value>)%oldDate%(</Value>\s*</param>)"
type "%file%"|repl "%search%" "$1%newDate%$2" m >"%file%.new"
move /y "%file%.new" "%file%" >nul

这是 REPL.BAT 实用程序。完整的文档嵌入在脚本中。

@if (@X)==(@Y) @end /* Harmless hybrid line that begins a JScript comment

::************ Documentation ***********
:::
:::REPL  Search  Replace  [Options  [SourceVar]]
:::REPL  /?
:::
:::  Performs a global search and replace operation on each line of input from
:::  stdin and prints the result to stdout.
:::
:::  Each parameter may be optionally enclosed by double quotes. The double
:::  quotes are not considered part of the argument. The quotes are required
:::  if the parameter contains a batch token delimiter like space, tab, comma,
:::  semicolon. The quotes should also be used if the argument contains a
:::  batch special character like &, |, etc. so that the special character
:::  does not need to be escaped with ^.
:::
:::  If called with a single argument of /? then prints help documentation
:::  to stdout.
:::
:::  Search  - By default this is a case sensitive JScript (ECMA) regular
:::            expression expressed as a string.
:::
:::            JScript syntax documentation is available at
:::            http://msdn.microsoft.com/en-us/library/ae5bf541(v=vs.80).aspx
:::
:::  Replace - By default this is the string to be used as a replacement for
:::            each found search expression. Full support is provided for
:::            substituion patterns available to the JScript replace method.
:::            A $ literal can be escaped as $$. An empty replacement string
:::            must be represented as "".
:::
:::            Replace substitution pattern syntax is documented at
:::            http://msdn.microsoft.com/en-US/library/efy6s3e6(v=vs.80).aspx
:::
:::  Options - An optional string of characters used to alter the behavior
:::            of REPL. The option characters are case insensitive, and may
:::            appear in any order.
:::
:::            I - Makes the search case-insensitive.
:::
:::            L - The Search is treated as a string literal instead of a
:::                regular expression. Also, all $ found in Replace are
:::                treated as $ literals.
:::
:::            E - Search and Replace represent the name of environment
:::                variables that contain the respective values. An undefined
:::                variable is treated as an empty string.
:::
:::            M - Multi-line mode. The entire contents of stdin is read and
:::                processed in one pass instead of line by line. ^ anchors
:::                the beginning of a line and $ anchors the end of a line.
:::
:::            X - Enables extended substitution pattern syntax with support
:::                for the following escape sequences:
:::
:::                \\     -  Backslash
:::                \b     -  Backspace
:::                \f     -  Formfeed
:::                \n     -  Newline
:::                \r     -  Carriage Return
:::                \t     -  Horizontal Tab
:::                \v     -  Vertical Tab
:::                \xnn   -  Ascii (Latin 1) character expressed as 2 hex digits
:::                \unnnn -  Unicode character expressed as 4 hex digits
:::
:::                Escape sequences are supported even when the L option is used.
:::
:::            S - The source is read from an environment variable instead of
:::                from stdin. The name of the source environment variable is
:::                specified in the next argument after the option string.
:::

::************ Batch portion ***********
@echo off
if .%2 equ . (
  if "%~1" equ "/?" (
    findstr "^:::" "%~f0" | cscript //E:JScript //nologo "%~f0" "^:::" ""
    exit /b 0
  ) else (
    call :err "Insufficient arguments"
    exit /b 1
  )
)
echo(%~3|findstr /i "[^SMILEX]" >nul && (
  call :err "Invalid option(s)"
  exit /b 1
)
cscript //E:JScript //nologo "%~f0" %*
exit /b 0

:err
>&2 echo ERROR: %~1. Use REPL /? to get help.
exit /b

************* JScript portion **********/
var env=WScript.CreateObject("WScript.Shell").Environment("Process");
var args=WScript.Arguments;
var search=args.Item(0);
var replace=args.Item(1);
var options="g";
if (args.length>2) {
  options+=args.Item(2).toLowerCase();
}
var multi=(options.indexOf("m")>=0);
var srcVar=(options.indexOf("s")>=0);
if (srcVar) {
  options=options.replace(/s/g,"");
}
if (options.indexOf("e")>=0) {
  options=options.replace(/e/g,"");
  search=env(search);
  replace=env(replace);
}
if (options.indexOf("l")>=0) {
  options=options.replace(/l/g,"");
  search=search.replace(/([.^$*+?()[{\\|])/g,"\\$1");
  replace=replace.replace(/\$/g,"$$$$");
}
if (options.indexOf("x")>=0) {
  options=options.replace(/x/g,"");
  replace=replace.replace(/\\\\/g,"\\B");
  replace=replace.replace(/\\b/g,"\b");
  replace=replace.replace(/\\f/g,"\f");
  replace=replace.replace(/\\n/g,"\n");
  replace=replace.replace(/\\r/g,"\r");
  replace=replace.replace(/\\t/g,"\t");
  replace=replace.replace(/\\v/g,"\v");
  replace=replace.replace(/\\x[0-9a-fA-F]{2}|\\u[0-9a-fA-F]{4}/g,
    function($0,$1,$2){
      return String.fromCharCode(parseInt("0x"+$0.substring(2)));
    }
  );
  replace=replace.replace(/\\B/g,"\\");
}
var search=new RegExp(search,options);

if (srcVar) {
  WScript.Stdout.Write(env(args.Item(3)).replace(search,replace));
} else {
  while (!WScript.StdIn.AtEndOfStream) {
    if (multi) {
      WScript.Stdout.Write(WScript.StdIn.ReadAll().replace(search,replace));
    } else {
      WScript.Stdout.WriteLine(WScript.StdIn.ReadLine().replace(search,replace));
    }
  }
}
于 2013-01-23T14:30:13.637 回答
0

尽管我们都知道 Batch 很慢,但我认为与其他解决方案相比,没有人知道确切的程度。下面有一个针对这个问题的纯批处理解决方案,我认为它可能相当快。我可以请您(或任何人)使用大型 .xml 文件测试此解决方案并获得比较时间吗?这些信息可能对我们所有人都很有价值!

@echo off
setlocal EnableDelayedExpansion
set "file=example.xml"
set "oldDate=20010202"
set "newDate=20130123"
set lastLine=1
set line=
echo "<name>date</name>" >> "%file%"
< "%file%" (for /F "delims=:" %%a in ('findstr /N /C:"<name>date</name>" "%file%"') do (
   if not defined line set /P line=
   set /A lines=%%a-lastLine, lastLine+=lines+1
   for /L %%i in (1,1,!lines!) do set /P "line=!line!" & echo/
   set nextLine=
   set /P nextLine=
   if defined nextLine (
      echo !line!
      set "line=!nextLine:<Value>%oldDate%</Value>=<Value>%newDate%</Value>!"
   )
)) > "%file%.new"
move /Y "%file%.new" "%file%"

请注意,以前的 Batch 程序错误地处理文件中的空行。虽然这点可以修复,但是额外的代码会减慢程序的速度,所以我想先了解一下原始代码与其他解决方案的比较。使用没有空行的 .xml 文件测试此程序。

安东尼奥

于 2013-01-24T05:33:59.917 回答