mirror of
https://github.com/alpmestan/taggy.git
synced 2024-08-16 10:20:30 +03:00
a simple, tolerant & efficient HTML/XML parser (with HTML in mind though)
bench | ||
example | ||
html_files | ||
src/Text | ||
LICENSE | ||
README.md | ||
Setup.hs | ||
taggy.cabal |
taggy
An attoparsec based html parser.
Currently very WIP. It even chokes on some HTML, although it already supports a fairly decent range of common websites.
The performance is quite promising for now, but we don't do a lof of things that tagsoup does, like converting &
to &
, etc.