Extracting data from web page with different extractors like css, xpath, regex...
Code:
<?php
use WebExtractor\DataExtractor\DataExtractorFactory;
use WebExtractor\DataExtractor\DataExtractorTypes;
use WebExtractor\Client\Client;
$factory = DataExtractorFactory::getFactory();
$extractor = $factory->createDataExtractor(DataExtractorTypes::CSS);
$client = new Client;
$content = $client->get('https://en.wikipedia.org/wiki/2014_Winter_Olympics');
$extractor->setContent($content);
$h1 = $extractor->setSelector('h1')->extract();
For more look tests.
Installation via Composer
-
Install Composer to your project root:
curl -sS https://getcomposer.org/installer | php
-
Add a
composer.json
file to your project:{ "require": { "knyga/webextractor": "1.1.2.*@dev" } }
-
Run the Composer installer:
php composer.phar install
WebExtractor is licensed under the MIT license.
Oleksandr Knyga oleksandrknyga@gmail.com
Sobit Akhmedov sobit.akhmedov@gmail.com