Skip to content
Perl application which creates (RSS) web feeds from random webpages via HTML parsing.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
inputs
modules
outputs
.gitattributes
FeedAny.pl
LICENSE
README.md

README.md

FeedAny

https://customsolvers.com/feed_any/ (ES: https://customsolvers.com/feed_any_es/) --- Video

Introduction

FeedAny generates web feeds from random pages via HTML parsing. The target web pages can have any structure, but are supposed to be web-feed friendly (i.e., regularly-updated sets of data on a well-structured layout).

Quick start guide

  • Make sure that both Perl and Wget are installed on your machine.
  • Include as many input files (*.fa) as you wish in the root "inputs" folder (some samples).
  • Run FeedAny (e.g., type "perl FeedAny.pl" in the command line).
  • Use the files generated in the root "outputs" folder as you would use any web feed (e.g., open them with a feed reader).

Input files (*.fa)

All the input files are expected to follow these rules:

  • The titles (before ":") aren't supposed to be modified.
  • The input values (after ":") have to verify the following formats: HTML ("Entry title", "Entry body" and "Entry additional"), URL ("Entry URL") or integer numeric ("Maximum number of entries").
  • Only supported HTML entities can be used. The attributes might be anything, but their values have to be surrounded by quotes (e.g., <div attribute='value'>). Only opening tags are expected. It is possible to include various nested entities (e.g., <div attr="whatever"><a>).
  • The URLs have to start with "http://" or "https://". In any other scenario, all the contents after "//" are assumed to be comments and are ignored.
  • All the entry inputs can include multiple constraints (e.g., contain "target") related through the logical (short-circuit) operators and/or.
  • The parsing algorithm tends to analyse the entry input information in a sequential fashion, from top to bottom.
  • It is possible to include as many additional inputs (i.e., "Entry additional") as desired.

Authorship & copyright

I, Alvaro Carballo Garcia (varocarbas), am the sole author of each single bit of this code.

Equivalently to what happens with all my other online contributions, this code can be considered public domain. For more information about my copyright/authorship attribution ideas, visit https://customsolvers.com/copyright/.

You can’t perform that action at this time.