Skip to content
This repository has been archived by the owner on May 17, 2018. It is now read-only.

mndrix/readability_parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Synopsis

:- use_module(library(readability_parser)).
?- build_agent("f861ea4...", Agent),
   parse(Agent, 'http://foo.com/article.html', Response).
Response = _{ author: "John Doe"
            , content: "A long time ago ..."
            , title: "A Fairy Tale"
            , word_count: 372
            ...
            }.

Description

Access Readability's parser API for extracting article content from an HTML page.

Changes in this Version

  • Workaround Readability SSL weirdness

Installation

Using SWI-Prolog 7.1.5 or later:

?- pack_install(readability_parser).

This module uses semantic versioning.

Source code available and pull requests accepted at http://github.com/mndrix/readability_parser

About

Access Readability's Parser API using Prolog

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages