regex - 如何在 VBScript RegEx 中用换行符替换

Question

我正在使用 VBScript 并有一个将 xml 转换为文本文件的脚本。

我正在尝试进行替换以将字符串替换为###EntryEnd###\|LF字符。

我尝试\n了\x0a替换模式，但它们不起作用。我发现的唯一解决方法是Chr(10)改用。

我一直在寻找这种行为的答案，但无法找到它。两者都\n应该\x0a工作。有什么建议吗？

这是代码：

' Method to process the file
Private Function PrepFile(ByVal strInp)
    With New RegExp
        .Global = True
        .Pattern = "\|"
        strInp = .Replace(strInp, "")
        .Pattern = "<xmldoc .*?xml:lang=""([^""]+)"">"
        strInp = .Replace(strInp, "English|$1|Part Of Speech|Note|EngDef|Glossary Definition###EntryEnd###|")
        .Pattern = "<remove>.*?</remove>"
        strInp = .Replace(strInp, "")
        .Pattern = "(<tab/>|</para>)"
        strInp = .Replace(strInp, "|")
        .Pattern = "<[^>]*>"
        strInp = .Replace(strInp, "")
        .Pattern = "\n"
        strInp = .Replace(strInp, "")
        .Pattern = "###EntryEnd###\|"
        strInp = .Replace(strInp, chr(10))
    End With
    PrepFile = strInp
End Function

示例文件片段：

<?xml version="1.0" encoding="UTF-8"?>
<xmldoc source="" type="TERMS" xml:lang="hu-HU">
<para id="13" name="Entry"><notrans><seg>School Administrator</seg><tab/></notrans><remove>___________</remove><seg>iskolavezető</seg></para>
<para id="14" name="Usage"><notrans><seg> </seg><tab/></notrans><remove>HASZNÁLAT:</remove><seg> </seg></para>
<para id="15" name="EntryText"><notrans><seg> </seg><tab/></notrans><remove>MEGHATÁROZÁS:</remove><seg> </seg></para>
<para id="16" name="Context"><remove>PÉLDA:</remove><remove><seg>Cathy Brown iskolavezető</seg></remove><notrans>###EntryEnd###</notrans></para>
<para id="17" name="Entry"><notrans><seg>School Resource Officer</seg><tab/></notrans><remove>___________</remove><seg>iskolarendőr</seg></para>
<para id="18" name="Usage"><notrans><seg> </seg><tab/></notrans><remove>HASZNÁLAT:</remove><seg> </seg></para>
<para id="19" name="EntryText"><notrans><seg>a law enforcement officer who is responsible for providing security and crime prevention services in schools in parts of the United States and Canada.|</seg><tab/></notrans><remove>MEGHATÁROZÁS:</remove><seg>rendőr, aki azért felelős, hogy az iskolákban biztonsági és bűnmegelőzési feladatokat lásson az Egyesült Államok és Kanada egyes területein.</seg></para>
<para id="20" name="Context"><remove>PÉLDA:</remove><remove><seg>Ocalai iskolarendőrök</seg></remove><notrans>###EntryEnd###</notrans></para>
</xmldoc>

score 1 · Accepted Answer

在您的问题中，“问题”（只是错误的假设）可以在

两者都\n应该\x0a 工作

该方法的文档Replace没有说明替换字符串允许使用转义序列，除了$1, $2, ... 在正则表达式模式中对捕获组的引用。

因此，如果RegExp对象在替换字符串中没有提供此行为，并且由于 VBScript 解析器不处理字符串中的任何转义序列，除了转义的双引号，则没有任何元素处理\n到换行符的转换。

您可以使用指示的转义序列来表示搜索模式字符串中的非打印字符，但它们不会被视为替换字符串中的转义序列。

如果你不喜欢Chr(10)函数调用，你可以使用可用的vbLf常量来引用换行符

strInp = .Replace(strInp, vbLf)

regex - 如何在 VBScript RegEx 中用换行符替换

1 回答 1

Related

Reference