Skip to content

LibreCat/Catmandu-HTML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NAME

Catmandu::HTML - Modules for handling HTML data within the Catmandu framework

SYNOPSIS

Command line client catmandu:

catmandu convert HTML to JSON < ex/test.html > data/test.json

catmandu convert JSON to HTML < data/test.json > ex/test.html

# Keep the text
curl -s http://librecat.org |
  catmandu convert HTML to Text --fix "html_text()"

# Filter out the anchors
curl -s http://librecat.org |
  catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")"

# Filter out the meta tags as YAML
curl -s http://librecat.org |
  catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"

See documentation of modules for more examples.

AVAILABLE MODULES

SEE ALSO

This module is based on Catmandu

AUTHORS

  • Patrick Hochstenbach, patrick.hochstenbach at ugent.be

COPYRIGHT AND LICENSE

The Perl software is copyright (c) 2018 by Patrick Hochstenbach. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.