python3-html5-parser 0.4.10-8 (amd64 binary) in ubuntu lunar
A fast implementation of the HTML 5 parsing spec for Python. Parsing is
done in C using a variant of the gumbo parser. The gumbo parse tree is
then transformed into an lxml tree, also in C, yielding parse times that
can be a thirtieth of the html5lib parse times. That is a speedup of 30x.
This differs, for instance, from the gumbo python bindings, where the
initial parsing is done in C but the transformation into the final
tree is done in python.
Details
- Package version:
- 0.4.10-8
- Status:
- Superseded
- Component:
- universe
- Priority:
- Optional
Downloadable files
amd64 build of html5-parser 0.4.10-8 in ubuntu lunar PROPOSED produced
these files:
- python3-html5-parser_0.4.10-8_amd64.deb (143.1 KiB)