Skip to content
Goutte, a simple PHP Web Scraper
Find file
New pull request
Latest commit 3cbc6ed @fabpot fabpot feature #244 allow installation of this component with symfony3 (benj…

This PR was merged into the 3.1-dev branch.


allow installation of this component with symfony3

I create this PR so I can run my Behat test with Symfony3 and the GoutteDriver


2b4e1d8 allow installation with symfony3


Goutte, a simple PHP Web Scraper

Goutte is a screen scraping and web crawling library for PHP.

Goutte provides a nice API to crawl websites and extract data from the HTML/XML responses.


Goutte depends on PHP 5.5+ and Guzzle 6+.


If you need support for PHP 5.4 or Guzzle 4-5, use Goutte 2.x (latest phar).

If you need support for PHP 5.3 or Guzzle 3, use Goutte 1.x (latest phar).


Add fabpot/goutte as a require dependency in your composer.json file:

composer require fabpot/goutte


Create a Goutte Client instance (which extends Symfony\Component\BrowserKit\Client):

use Goutte\Client;

$client = new Client();

Make requests with the request() method:

// Go to the website
$crawler = $client->request('GET', '');

The method returns a Crawler object (Symfony\Component\DomCrawler\Crawler).

Fine-tune cURL options:

$client->getClient()->setDefaultOption('config/curl/'.CURLOPT_TIMEOUT, 60);

Click on links:

// Click on the "Security Advisories" link
$link = $crawler->selectLink('Security Advisories')->link();
$crawler = $client->click($link);

Extract data:

// Get the latest post in this category and display the titles
$crawler->filter('h2 > a')->each(function ($node) {
    print $node->text()."\n";

Submit forms:

$crawler = $client->request('GET', '');
$crawler = $client->click($crawler->selectLink('Sign in')->link());
$form = $crawler->selectButton('Sign in')->form();
$crawler = $client->submit($form, array('login' => 'fabpot', 'password' => 'xxxxxx'));
$crawler->filter('.flash-error')->each(function ($node) {
    print $node->text()."\n";

More Information

Read the documentation of the BrowserKit and DomCrawler Symfony Components for more information about what you can do with Goutte.


Goutte is pronounced goot i.e. it rhymes with boot and not out.

Technical Information

Goutte is a thin wrapper around the following fine PHP libraries:

  • Symfony Components: BrowserKit, CssSelector and DomCrawler;
  • Guzzle HTTP Component.


Goutte is licensed under the MIT license.

Something went wrong with that request. Please try again.