2

我试图一次读一章。我正在使用 Paul Seigmann 库。但是,我不知道该怎么做,但我可以从书中得到所有的文字。不知道从那里去哪里。

// find InputStream for book

InputStream epubInputStream = assetManager

            .open("the_planet_mappers.epub");

        // Load Book from inputStream

        mThePlanetMappersBookEpubLib = (new EpubReader()).readEpub(epubInputStream);

        Spine spine = new Spine(mThePlanetMappersBookEpubLib.getTableOfContents());
        for (SpineReference bookSection : spine.getSpineReferences()) {
            Resource res = bookSection.getResource();
            try {
                InputStream is = res.getInputStream();
                BufferedReader r = new BufferedReader(new InputStreamReader(is));
                String line;
                while ((line = r.readLine()) != null) {
                    line = Html.fromHtml(line).toString();
                    Log.i("Read it ", line);
                    mEntireBook.append(line);
                }
            } catch (IOException e) {
            }
4

1 回答 1

1

我不知道你是否还在寻找答案,但是......我现在也在努力。这是我必须检索所有 epub 文件内容的代码:

public ArrayList<String> getBookContent(Book bi) {
    // GET THE CONTENTS OF ALL PAGES
    StringBuilder string = new StringBuilder();
    ArrayList<String> listOfPages = new ArrayList<>();
    Resource res;
    InputStream is;
    BufferedReader reader;
    String line;
    Spine spine = bi.getSpine();
    for (int i = 0; spine.size() > i; i++) {
        res = spine.getResource(i);
        try {
            is = res.getInputStream();
            reader = new BufferedReader(new InputStreamReader(is));
            while ((line = reader.readLine()) != null) {
                // FIRST PAGE LINE -> <?xml version="1.0" encoding="utf-8" standalone="no"?>
                if (line.contains("<?xml version=\"1.0\" encoding=\"utf-8\" standalone=\"no\"?>")) {
                    string.delete(0, string.length());
                }

                // ADD THAT LINE TO THE FINAL STRING REMOVING ALL THE HTML
                string.append(Html.fromHtml(formatLine(line)));

                // LAST PAGE LINE -> </html>
                if (line.contains("</html>")) {
                    listOfPages.add(string.toString());
                }
            }
        } catch (IOException e) {
            e.printStackTrace();
        }
    }
    return listOfPages;
}

private String formatLine(String line) {
    if (line.contains("http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd")) {
        line = line.substring(line.indexOf(">") + 1, line.length());
    }

    // REMOVE STYLES AND COMMENTS IN HTML
    if ((line.contains("{") && line.contains("}"))
            || ((line.contains("/*")) && line.contains("*/"))
            || (line.contains("<!--") && line.contains("-->"))) {
        line = line.substring(line.length());
    }
    return line;
}

您可能已经注意到,我需要改进过滤器,但我的 ArrayList 中有那本书的每一章。现在我只需要像这样调用 ArrayListmyList.get(0);就完成了。

为了以正确的方式显示文本,我使用了 bluejamesbond:textjustify 库(https://github.com/bluejamesbond/TextJustify-Android)。它易于使用且功能强大。

我希望它对您有所帮助,如果有人找到更好的方法来过滤该 html,请注意我。

于 2016-08-01T17:44:20.497 回答