Parsing and extracting information from (possibly malformed) HTML/XML documents
TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping. Users should start from the "Text.HTML.TagSoup" module.
Release | Stable | Testing |
---|---|---|
Fedora Rawhide | 0.14.8-22.fc40 | - |
Fedora 40 | 0.14.8-22.fc40 | - |
Fedora 39 | 0.14.8-20.fc39 | - |
Fedora 38 | 0.14.8-18.fc38 | - |
EPEL 9 | 0.14.8-10.el9 | - |
EPEL 7 | 0.12.8-4.el7 | - |
You can contact the maintainers of this package via email at
ghc-tagsoup dash maintainers at fedoraproject dot org
.