ExtractContent for PHP

About ExtractContent

The ExtractContent will extract content from HTML.
It has been rewritten in PHP from Ruby and Perl.

Syuyo Nakatani:: http://labs.cybozu.co.jp/blog/nakatani/downloads/extractcontent.rb
Ina Lintaro:: http://search.cpan.org/dist/HTML-ExtractContent/lib/HTML/ExtractContent.pm

Getting Started

Clone the repo, git clone git://github.com/aoiaoi/ExtractContent.git: , or download the latest release.

Performing Basic Usage

Example #1 Instantiating a ExtractContent object

<?php
require_once 'ExtractContent.php';

$extractor = new ExtractContent();

Example #2 Extract content from HTML

$extractor = new ExtractContent();
$extractor->extract($html);
echo $extractor->asText();
// if retrieve as HTML:
echo $extractor->asHtml();

Example #3 Extract title from HTML

$extractor = new ExtractContent();
$extractor->extract($html);
echo $extractor->getTitle();

Example #4 Set parameters

$extractor = new ExtractContent(array('g_adsense' => true));

// This is actually exactly the same:
$extractor = new ExtractContent();
$extractor->setOptions('g_adsense' => true);

Configuration Parameters

Available Public Methods

ExtractContent::setOptions(array $options);
ExtractContent::extract(string $html);
ExtractContent::getTitle();
ExtractContent::asText();
ExtractContent::asHtml();

Development Environment

PHP 5.4.1-rc2

Copyright

Copyright of the original implementation

License

The files in this archive are released under the New BSD license. You can find a copy of this license in LICENSE.txt

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
ExtractContent.php		ExtractContent.php
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ExtractContent for PHP

About ExtractContent

Getting Started

Performing Basic Usage

Example #1 Instantiating a ExtractContent object

Example #2 Extract content from HTML

Example #3 Extract title from HTML

Example #4 Set parameters

Configuration Parameters

Available Public Methods

Development Environment

Copyright

Copyright of the original implementation

License

About

Releases

Packages

Languages

License

aoiaoi/ExtractContent

Folders and files

Latest commit

History

Repository files navigation

ExtractContent for PHP

About ExtractContent

Getting Started

Performing Basic Usage

Example #1 Instantiating a ExtractContent object

Example #2 Extract content from HTML

Example #3 Extract title from HTML

Example #4 Set parameters

Configuration Parameters

Available Public Methods

Development Environment

Copyright

Copyright of the original implementation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages