0

I found there is a lot of tools available for breaking the Big PDF files into smaller one by splitting the original PDF file PAGE WISE.for example, if i have a 10 page PDF Document,then we can able to break the original pdf file into 10 pieces in page wise splitting.

But i want similar kind of tool that breaks the PDF file smaller than the Page wise splitting.That means,i need to split the PDF page into different documents based on any parameter like paragraph,section,element...

for example,
If my PDF file having 2 pages with 10 paragraphs then i would like to split the pdf file into 10 separate Pdf file based on paragraph parameter...

Also, I strongly believe pdf does not contain any structure like Open XML.But i also Suspecting


How the tools can able to break the pdf files in to small pdf files by splitting page wise?
What kind of mechanism they are using for page wise splitting PDF File?

So, Is there any way to do my work? Please give me your valuable suggestion on this?

4

1 回答 1

2

PDF是一种基于矢量的文档描述语言。它是基于页面的,因此在某种程度上每个页面都独立于下一个页面。因此,明智地拆分页面非常容易。与可以在 pdf 中独立提取小子集的光栅图像相反,您必须渲染整个页面才能知道小子集的外观。

假设您有一个页面(黑色),其中包含一个复杂形状的对象(这里是一条线,但它可以是任何文本、形状、图像等),并且您想要提取一个子集(红色)。您必须首先找到在感兴趣区域中产生可见输出的所有对象。然后您必须修改它们,以便正确渲染它们(在这种情况下,从蓝点计算绿点,同时保留对象的形状)。

页面上的复杂形状

一种更简单的方法是包含整个页面并将查看区域裁剪为区域的尺寸。

你可以用pdfjam. 结合自定义纸张大小检查--trim//命令--offset( pdfjam 网站上的示例 6,7)。--delta不过,您仍然必须以某种方式计算感兴趣区域的坐标。

于 2012-02-27T08:13:51.387 回答