2

我试图使用 xdmp:document-filter 从“pptx”文件中提取元数据。以下是我在查询控制台中运行的代码。

xquery version "1.0-ml";
declare namespace html = "http://www.w3.org/1999/xhtml";

let $uri := "/documents/b46682e6b00156d98cb9ba26222c57ab8cbd60f1.pptx"
let $xxx := xdmp:document-filter(fn:doc($uri), ())
return $xxx

似乎没有任何效果,并且在显示消息“查询控制台丢失连接,并尝试重新建立连接”之后。所以我查看了服务器日志文件并看到分段错误..以下是一些日志文件..我尝试提取元数据的方式有什么问题吗?

Segmentation fault in thread 140239321458432 addr 0x10
Thread 224 (Thread 0x7f8ccca01700 (LWP 6307)):
#0  0x00007f8cc8b7d9c0 in sem_wait () from /lib64/libpthread.so.0
#1  0x0000000004081327 in svc::Semaphore::wait(bool) const ()
#2  0x000000000409c22d in svc::StarterThread::run() ()
#3  0x000000000409cf8b in svc::Thread::top() ()
#4  0x000000000409e189 in runThread ()
#5  0x00007f8cc8b77f18 in start_thread () from /lib64/libpthread.so.0
#6  0x00007f8cc7f78b2d in clone () from /lib64/libc.so.6
Thread 223 (Thread 0x7f8ccc9e2700 (LWP 6315)):
#0  0x00007f8cc8b7e9ad in accept () from /lib64/libpthread.so.0
#1  0x000000000408bd75 in svc::Socket::accept(sockaddr_in&) ()
#2  0x0000000003d546cf in xdmp::XDQPServerThread::run() ()
#3  0x000000000409cf8b in svc::Thread::top() ()
#4  0x000000000409e189 in runThread ()
#5  0x00007f8cc8b77f18 in start_thread () from /lib64/libpthread.so.0
#6  0x00007f8cc7f78b2d in clone () from /lib64/libc.so.6
Thread 222 (Thread 0x7f8ccc9c3700 (LWP 6316)):
#0  0x00007f8cc8b7ee6d in nanosleep () from /lib64/libpthread.so.0
#1  0x000000000409bf1b in svc::Thread::sleep(unsigned int) ()
#2  0x000000000242b1c5 in xdmp::ClusterManager::clusterThread() ()
#3  0x000000000409cf8b in svc::Thread::top() ()
#4  0x000000000409e189 in runThread ()
#5  0x00007f8cc8b77f18 in start_thread () from /lib64/libpthread.so.0
#6  0x00007f8cc7f78b2d in clone () from /lib64/libc.so.6
4

1 回答 1

0

我也能够在 ML 8.0-4.2 中重现此问题。

当我将查询更改为 xdmp:document-filter(fn:doc($uri)) 时效果很好

于 2016-02-15T15:55:48.337 回答