0

Xerces2-j XMLInputSource以及 SAX InputSource指的是公共和系统标识符。Xerces2-JXMLInputSource也指基本系统标识符。

这些标识符代表什么?

编辑:Xerces-J,当将文件位置作为 SystemId 时,将打开文件作为输入。如果输入是作为字节流提供的,而不是从其他来源(例如数据库)提供,那么公共 ID 或系统 ID 有什么用途吗?

4

2 回答 2

1

如果您查看 XML 语法,您会看到,例如,外部实体引用使用以下语法:

ExternalID ::= 'SYSTEM' S SystemLiteral
  | 'PUBLIC' S PubidLiteral S SystemLiteral

以下是使用此语法的示例:

<!ENTITY open-hatch
         PUBLIC "-//Textuality//TEXT Standard open-hatch boilerplate//EN"
         "http://www.textuality.com/boilerplate/OpenHatch.xml">

对 DTD 的引用以相同的方式工作(实际上,外部 DTD 从技术上讲是一种实体)。

“系统标识符”是一个 URI,用于标识可以在何处找到实体的文本。“公共标识符”(SGML 的遗留物)更像是资源的名称;如果您有某种索引或目录可以告诉您在哪里查找,它只会帮助您找到资源。

系统标识符通常作为相对 URI 引用(例如“books.dtd”)给出,需要相对于基本 URI 进行解析。基本 URI 通常是找到包含资源(或实体)的位置。例如,如果一个 XML 文档是 at,http://my.com/lib/books.xml那么它的基本 URI 是,然后http://my.com/lib/相对 URIbooks.dtd扩展为http://my.com/lib/books.dtd.

在回答您的问题“公众或系统 ID 是否有任何目的”时,如果文档完全由单个实体组成(通常是这种情况),答案是否定的。但是一旦多个实体开始发挥作用,您就需要标识符将它们链接在一起。

于 2022-01-14T17:55:42.677 回答
0

如果输入是作为字节流提供的,而不是从其他来源(例如数据库)提供,那么公共 ID 或系统 ID 有什么用途吗?

不,因为如果输入是字节流,则无需解析实体的位置。

这些标识符代表什么?

我认为这个线程解释得很好:

SYSTEM declaration can be used to specify a file on the local file
system like:

<!DOCTYPE RootElement SYSTEM "C:\validate.dtd">

The problem with this approach is that if the file is made public the
path specified on the local file system will not have any meaning any
more. Even if the path specified in the SYSTEM declaration *is* a URL:

<!DOCTYPE RootElement SYSTEM "http://www.mihaiu.name/validate.dtd">

the parser might be unable to retrieve the DTD file if the system is
not connected to the Internet.

The PUBLIC declaration constitutes a partial solution to this problem.
The string contained in a PUBLIC declaration is not an URL but an URN
(Uniform Resource Name). A URN does not pinpoint the precise location
of the resource, but only clearly specify its name. The *parser* of the
document must be smart enough to be able to generate a URL from a URN
using some internal logic.

Example of a PUBLIC declaration:

<!DOCTYPE RootElement PUBLIC "mihaiu/validate.dtd"
SYSTEM "http://www.mihaiu.name/validate.dtd">

In this case, a custom parser that already has a catalogue of DTDs
published by mihaiu can generate a URL from the PUBLIC declaration. The
generated URL can look like

c:\DTDs\validate.dtd

There is no standard way to convert a URN to a URL, so, if this
conversion fails because the parser does not contain the internal logic
to perform such a conversion (or for whatever other reason) the parser
will attempt to use the SYSTEM declaration which in this case resolves
to

http://www.mihaiu.name/validate.dtd

Important observation:
Since there is no standard way to generate a URL from a URN the PUBLIC
declarations can only be useful for customized parsers !!! (e.g. they
are not useful for general purpose parsers like Xerces)
于 2022-01-14T17:47:55.733 回答