I am having a huge XML file with multiple documents in it (This can be easily converted to multiple files per document, if required). The file size of this huge XML is around 4 GB. I need to index it for better and faster search for the users. Also I am using XSLT to transform this data.
Below is my basic XML structure-
<Docs>
<Doc>
<Title t=""></Title>
<Desc></Desc>
<Info></Info>
</Doc>
<Doc>
<Title t=""></Title>
<Desc></Desc>
<Info></Info>
</Doc>
</Docs>
Each Doc
can be separated, if required. If the user searched for specific word(s), I will be displaying the complete document in HTML format using XSLT.
As I have not used Lucene so far, I have certains questions -
- Will I have to keep both XML file and Lucene Index file?
- If only later one, then, will it be transformable through XSLT?