Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
A Web Bot that crawls the http://Schema.org web site to retrieve all available Types and Properties in order to create a JSON file and also some PHP libraries.
PHP
branch: master
Failed to load latest commit information.
bin Update copywrite year
dist Update library, Autoload fix, Create bin cli
src Update copywrite year
tests Update copywrite year
.gitignore Update library, Autoload fix, Create bin cli
.travis.yml Changed the License
LICENSE Changed the License
README.md Update library, Autoload fix, Create bin cli
composer.json
phpunit.xml Changed the License

README.md

Spider4Schema Build Status

A Web Bot that crawls the http://Schema.org web site to retrieve all available Types and Properties in order to create a JSON file and also some PHP libraries.
For generating Microdata or RDFa Lite 1.1 semantics you can use the PHP library https://github.com/PAlexcom/PHPStructuredData.
Created during the Google Summer of Code 2013 and 2014.

Documentation

Files structure:

  • configuration.php → the configuration file, setup the type of library to be created.
  • http.php → a class that handles all HTTP requests.
  • parser.php → methods to parse the HTML and retrieve all needed information.
  • fileCreator.php → methods to create the library files.

Usage

  • Make sure you have the cURL library installed, and the PHP CLI shell script package
  • Clone the repo: git clone https://github.com/PAlexcom/Spider4Schema.git
  • Enter Spider4Schema/ directory
  • Open your terminal/shell and call php bin/spider.php [minified|json|normal] [true|false|verbose]

The libraries will be created in the dist/ folder.

Library types

There are 3 types of libraries you can create:

  • JSON → a .json file containing all available Types and Properties, used in library https://github.com/PAlexcom/PHPStructuredData for generating valid Microdata and RDFa Lite 1.1 semantics
  • Minified → a .php file with an array containing all available Types and Properties
  • Normal → each Type is a PHP class file (an abstract class with static Properties)

Performance

The json library:
1 .json file, 91 KB, contains all available Types (620+) and its Properties

The minified library:
1 php file, 107 KB, contains all available Types (620+) and its Properties, stored in a hash table (array)

The normal abstract static library:
622 php files, 710 KB, 1 file for each available Type

Todos

License

Spider4Schema is licensed under the MIT License – see the LICENSE file for details.

Something went wrong with that request. Please try again.