a simple, tolerant & efficient HTML/XML parser (with HTML in mind though)
Go to file
2014-06-12 14:13:05 +02:00
bench initial commit 2014-06-02 21:22:35 +02:00
example improve the DOM module's correctness 2014-06-12 14:13:05 +02:00
html_files initial commit 2014-06-02 21:22:35 +02:00
src/Text improve the DOM module's correctness 2014-06-12 14:13:05 +02:00
.travis.yml travis 2014-06-05 14:13:06 +02:00
LICENSE initial commit 2014-06-02 21:22:35 +02:00
README.md Create README.md 2014-06-02 21:25:35 +02:00
Setup.hs initial commit 2014-06-02 21:22:35 +02:00
taggy.cabal improve the DOM module's correctness 2014-06-12 14:13:05 +02:00

taggy

An attoparsec based html parser.

Currently very WIP. It even chokes on some HTML, although it already supports a fairly decent range of common websites.

The performance is quite promising for now, but we don't do a lof of things that tagsoup does, like converting & to &, etc.