html - 如何使用clojure解析html文件？

Question

我是clojure的新手，我需要一些例子。请告诉我如何使用 clojure 解析 html 文件？

score 17 · Accepted Answer

Enlive is a great tool for this. In short:

(ns foo.bar
  (:require [net.cgrand.enlive-html :as html]))

(defn fetch-page [url]
  (html/html-resource (java.net.URL. url)))

Here is a nice tutorial on using it both as a scraper/parser and as a template engine:

Here is a short example of scraping a page.

Another option is clj-tagsoup. Enlive also uses tagsoup, but in addition has a pluggable parser so you can add support for other parsers.

score 4 · Accepted Answer

Clojure's xml parsing library is there for you.

Parses and loads the source s, which can be a File, InputStream or String naming a URI. Returns a tree of the xml/element struct-map, which has the keys :tag, :attrs, and :content. and accessor fns tag, attrs, and content. Other parsers can be supplied by passing startparse, a fn taking a source and a ContentHandler and returning a parser

Or use enlive, it's framework fully on clojure or use Java based HtmlCleaner.

score 1 · Accepted Answer

HTML 解析器

clj-tagsoup clj
面包丁
山核桃clj cljs
图珀洛clj cljs
Webmine clj

来源 - https://www.clojure-toolbox.com

html - 如何使用clojure解析html文件？

3 回答 3

Related

Reference