A collection of utilities for downloading and analyzing browser extension from the Chrome Web store.
crawler: A crawler for extensions from the Chrome Web Store.
crx-tool: A tool for analyzing and extracting
*.crxfiles (i.e., Chrome extensions). Calling
crx-tool.py <extension>.crxwill check the integrity of the extension.
crx-extract: A simple tool for extracting
*.crxfiles from the tar-based archive hierarchy.
create-db: A tool for updating a remote MariaDB from already existing extension archives.
The utilities store the extensions in the following directory hierarchy:
archive ├── conf │ └── forums.conf ├── data │ └── ... └── log └── ...
The crawler downloads the most recent extension (i.e., the
file as well as the overview page. In addition, the
may contain one file, called
forums.conf that lists the ids of
extensions for which the forums and support pages should be downloaded
as well. The
data directory will contain the downloaded extensions.
create-db scripts will access and update a MariaDB.
They will use the host, datebase, and credentials found in
Since they make use of various JSON features, it is recommended to use at
least version 10.2.8 of MariaDB.
All utilities are written in Python 3.6. The required modules are listed
in the file
Clone and use pip3 to install as a package.
git clone email@example.com:BrowserSecurity/ExtensionCrawler.git pip3 install --user -e ExtensionCrawler
- Mehmet Balande
This project is licensed under the GPL 3.0 (or any later version).
The master git repository for this project is hosted by the Software Assurance & Security Research Team at https://git.logicalhacking.com/BrowserSecurity/ExtensionCrawler.