This is a web site scraper. Collects all urls from any site.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

Trinity - Web Application URL Collector

Version 0.1
Author GERASIMOS KASSARAS (@lamehacker)
Copyright 2013 Gerasimos Kassaras
License Apache License Version 2.0


Trinity is an Open Source,free url collector written for training purposes.

Trinity offers:

A stable, efficient, high-performance simple python url collector.

Trinity is a simple proof of concept python script that collects urls from sites that need no authentication nor use SSL.


In order to run Trinity to collect the urls from your site set the variables to the desired site url:

urlList = [""] # Later on this url is going to be fed through command parser. host = '' domain = ''

In simple terms



Collects urls from:

  • a HTML tags.
  • link HTML tags.
  • script HTML tags.
  • meta HTML tags.


The crawler Trinity is using is

HTML Parser

Is based in BeautifulSoup soup version 4.

Documentation found in: Download:


You have to install BeautifulSoup. Instruction about that found here:


Trinity is licensed under the Apache License Version 2.0.


This is free software and you are allowed to use it as you see fit. However, neither the development team nor any of our contributors can held responsible for your actions or for any damage caused by the use of this software.