Skip to content

knyga/webextractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WebExtractor

Extracting data from web page with different extractors like css, xpath, regex...

Example

Code:

<?php

use WebExtractor\DataExtractor\DataExtractorFactory;
use WebExtractor\DataExtractor\DataExtractorTypes;
use WebExtractor\Client\Client;

$factory = DataExtractorFactory::getFactory();
$extractor = $factory->createDataExtractor(DataExtractorTypes::CSS);
$client = new Client;
$content = $client->get('https://en.wikipedia.org/wiki/2014_Winter_Olympics');
$extractor->setContent($content);
$h1 = $extractor->setSelector('h1')->extract();

For more look tests.

Installation via Composer

  • Install Composer to your project root:

    curl -sS https://getcomposer.org/installer | php
  • Add a composer.json file to your project:

    {
      "require": {
        "knyga/webextractor": "1.1.2.*@dev"
      }
    }
  • Run the Composer installer:

    php composer.phar install

License

WebExtractor is licensed under the MIT license.

Oleksandr Knyga oleksandrknyga@gmail.com

Sobit Akhmedov sobit.akhmedov@gmail.com

About

Extracting data from web page with different extractors like css, xpath, regex...

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages