Catmandu::HTML - Modules for handling HTML data within the Catmandu framework
Command line client catmandu
:
catmandu convert HTML to JSON < ex/test.html > data/test.json
catmandu convert JSON to HTML < data/test.json > ex/test.html
# Keep the text
curl -s http://librecat.org |
catmandu convert HTML to Text --fix "html_text()"
# Filter out the anchors
curl -s http://librecat.org |
catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")"
# Filter out the meta tags as YAML
curl -s http://librecat.org |
catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"
See documentation of modules for more examples.
-
Serialize HTML data
-
Parse HTML data
-
Catmandu::Fix::html_filter_tag
Filter HTML by name and optional group them
-
Catmandu::Fix::html_filter_type
Filter HTML by type
-
Select only the literal text values from the HTML
This module is based on Catmandu
- Patrick Hochstenbach,
patrick.hochstenbach at ugent.be
The Perl software is copyright (c) 2018 by Patrick Hochstenbach. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.