I'm looking for a spec on handling HTML entities in the href
attribute of <a>
tags. So far, no luck (I might be searching for something too specific).
In detail:
The bug I'm trying to fix is part of the cheerio project.
Some entities don't require a semicolon at the end. One of them is ¤
. Anyway, this leads to problems when a source links to /test/example.jsp?item=123¤tSize=S¤tQty=1
.
Browsers (at least Chrome) handle this nicely. I still haven't figured out why though.