mirror of https://github.com/alpmestan/taggy.git synced 2024-08-16 10:20:30 +03:00

a simple, tolerant & efficient HTML/XML parser (with HTML in mind though)

Go to file

Alp Mestanogullari af8e70bdd6 improve the DOM module's correctness		2014-06-12 14:13:05 +02:00
bench	initial commit	2014-06-02 21:22:35 +02:00
example	improve the DOM module's correctness	2014-06-12 14:13:05 +02:00
html_files	initial commit	2014-06-02 21:22:35 +02:00
src/Text	improve the DOM module's correctness	2014-06-12 14:13:05 +02:00
.travis.yml	travis	2014-06-05 14:13:06 +02:00
LICENSE	initial commit	2014-06-02 21:22:35 +02:00
README.md	Create README.md	2014-06-02 21:25:35 +02:00
Setup.hs	initial commit	2014-06-02 21:22:35 +02:00
taggy.cabal	improve the DOM module's correctness	2014-06-12 14:13:05 +02:00

taggy

An attoparsec based html parser.

Currently very WIP. It even chokes on some HTML, although it already supports a fairly decent range of common websites.

The performance is quite promising for now, but we don't do a lof of things that tagsoup does, like converting & to &, etc.