Skip to content

A crawler in python for detecting and gathering code in different repositories. and storing them locally

Notifications You must be signed in to change notification settings

AwsafAlam/githubMiner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Description

Files to run: downloadRunner.py, finderRunner.py (no command line parameters needed)

Set Parameters

All necessary parameters should be given in Crawler/config.py.

Issues

  • if db schema in DB.txt gives errors, use the following

    CREATE TABLE `repos` (
      `id` int(20) NOT NULL,
      `url` varchar(200) NOT NULL,
      `language` varchar(20) DEFAULT NULL,
      `downloaded` int(11) DEFAULT NULL,
      PRIMARY KEY (`id`),
      UNIQUE KEY `repos_id_uindex` (`id`),
      UNIQUE KEY `repos_url_uindex` (`url`)
    ) ENGINE=InnoDB DEFAULT CHARSET=utf8
    
  • install github and sql connector module by this command:

    1. pip install pygithub
    2. pip install mysql-connector

About

A crawler in python for detecting and gathering code in different repositories. and storing them locally

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages