3

最近在我的项目中,我使用 lucene 在我们的数据库中进行重复搜索,它运行良好。但是现在需要加密 lucene 索引,我被要求寻找 lucene 本身提供的加密工具,而不是使用外部库。

我刚刚找到了 LUCENE-2228 AES 加密目录并制作了小型 POC。问题是,当我重新索引时,我收到以下错误:

java.lang.RuntimeException: File already Exists
    at org.apache.lucene.util.AESWriter.<init>(AESWriter.java:117)
    at org.apache.lucene.store.AESDirectory$AESIndexOutput.<init>
        (AESDirectory.java:187)
    at org.apache.lucene.store.AESDirectory.createOutput(AESDirectory.java:72)
    at org.apache.lucene.index.SegmentInfos.finishCommit(SegmentInfos.java:939)
    at org.apache.lucene.index.IndexWriter.finishCommit(IndexWriter.java:3539)
    at org.apache.lucene.index.IndexWriter.commitInternal(IndexWriter.java:3529)
    at org.apache.lucene.index.IndexWriter.closeInternal(IndexWriter.java:1879)
    at org.apache.lucene.index.IndexWriter.close(IndexWriter.java:1822)
    at org.apache.lucene.index.IndexWriter.close(IndexWriter.java:1786)
    at org.apache.lucene.test.indexing.main(indexing.java:45)

这是我的代码

public class indexing 
{
    private static final byte[] KEY = 
        new byte[] { 'T', 'h', 'e', 'B', 'e', 's', 't',
'S', 'e', 'c', 'r','e', 't', 'K', 'e', 'y' };
    public static void main(String[] args) throws Exception
    {

        Directory INDEX_DIR = new AESDirectory(new File("index1"),KEY);

        Connection conn=null;

        SnowballAnalyzer analyzer=new SnowballAnalyzer(Version.LUCENE_30,"English");
        try
        {
           Class.forName("com.mysql.jdbc.Driver").newInstance();
           conn = DriverManager.getConnection("jdbc:mysql:///lucene", "abcd", "abcd");
           IndexWriterConfig config = new IndexWriterConfig(Version.LUCENE_34, analyzer);
           IndexWriter writer = new IndexWriter(INDEX_DIR, config);
           writer.deleteAll();
           //writer.flush();
           System.out.println("Indexing to directory '" + INDEX_DIR + "'...");
           long starttime=System.currentTimeMillis();
           indexDocs(writer, conn);
           writer.optimize();
           writer.close();
           long endtime=System.currentTimeMillis();
           long timetaken=TimeUnit.MILLISECONDS.convert(endtime - starttime,TimeUnit.MILLISECONDS);
           System.out.println("Time taken to do indexing is "+timetaken+"ms");
        } 
        catch (Exception e)
        {
           e.printStackTrace();
        }
    }
    static void indexDocs(IndexWriter writer, Connection conn) throws Exception 
    {
        //String sql = "select qid,question from tblquestions";
        String sql = "select qid,question from tblquestions";
        Statement stmt = conn.createStatement();
        stmt.setFetchSize(Integer.MIN_VALUE);
        ResultSet rs = stmt.executeQuery(sql);
        Integer count = 0;
        while (rs.next())
        {
            count ++;
            Document d = new Document();
            d.add(new Field("qid", rs.getString("qid"), Field.Store.YES, Field.Index.NOT_ANALYZED));
            d.add(new Field("question", rs.getString("question"), Field.Store.YES,  Field.Index.ANALYZED));
            writer.addDocument(d);
        }
        System.out.println("count: " + count);
    }
}

谁能帮我解决这个问题。或者给出一些关于 lucene 索引加密的想法。

4

1 回答 1

0

该补丁适用于 3.1 版,您似乎正在使用其他版本的 Lucene。选择以下选项之一,直到您的版本的补丁发布(或编写您自己的补丁!)

  1. 切换到 Lucene 3.1

  2. 使用 Windows NTFS 加密。除非未经授权的人知道如何以创建索引的用户身份登录,否则应该是安全的。

  3. 继续使用 TrueCrypt 或其他外部加密。这应该是非常安全的,但它需要安装 TrueCrypt 和管理权限才能安装加密驱动器。

于 2014-07-18T07:26:12.033 回答