Files to run: downloadRunner.py, finderRunner.py (no command line parameters needed)
All necessary parameters should be given in Crawler/config.py.
-
if db schema in DB.txt gives errors, use the following
CREATE TABLE `repos` ( `id` int(20) NOT NULL, `url` varchar(200) NOT NULL, `language` varchar(20) DEFAULT NULL, `downloaded` int(11) DEFAULT NULL, PRIMARY KEY (`id`), UNIQUE KEY `repos_id_uindex` (`id`), UNIQUE KEY `repos_url_uindex` (`url`) ) ENGINE=InnoDB DEFAULT CHARSET=utf8
-
install github and sql connector module by this command:
- pip install pygithub
- pip install mysql-connector