Skip to content

teeranan/bigbang

 
 

Repository files navigation

BigBang

BigBang is a toolkit for studying communications data from collaborative projects. It currently supports analyzing mailing lists from Sourceforge, Mailman, or .mbox files.

Installation

BigBang depends on several scientific computing packages that you must first install on your system, which include:

You can use the Anaconda distribution to install numpy and matplotlib on almost any platform. This will also install the conda package management system, which you can use to complete installation. Note that Anaconda does not include Graphviz, so you will have to install that separately.

If you choose not to use Anaconda, you will have to install each of the above-mentioned packages for your platform. If you're using OS X these instructions may be helpful.

Once these dependencies are installed, you can install BigBang using either conda or pip.

conda installation

Run the following commands:

git clone https://github.com/sbenthall/bigbang.git
conda create -n bigbang python
cd bigbang
bash conda-setup.sh

pip installation

Run the following commands:

git clone https://github.com/sbenthall/bigbang.git
# optionally create a new virtualenv here
pip install -r requirements.txt
python setup.py develop

Usage

There are serveral IPython notebooks in the examples/ directory of this repository. To open them and begin exploring, run the following commands in the root directory of this repository:

source activate bigbang
ipython notebook examples/

Collecting from Mailman

BigBang comes with a script for collecting files from public Mailman web archives. An example of this is the scipy-dev mailing list page. To collect the archives of the scipy-dev mailing list, run the following command from the root directory of this repository:

python bin/collect_mail.py -u http://mail.scipy.org/pipermail/scipy-dev/

You can also give this command a file with several urls, one per line. One of these is provided in the examples/ directory.

python bin/collect_mail.py -f examples/urls.txt

Once the data has been collected, BigBang has functions to support analysis.

Git Information

A new branch of BigBang is collecting git commit information for projects. We can analyze a project using both its mail and gir information to answer new questions about development.

Collecting git information

As of now, the git collection clones targeted repos into '<./git_data/sample_git_repos>' which can take some time. After cloning these repos, &Repo.py& will collect information from these them. It works very similarly to mail collection. While in the bigbang directory, run

python bin/collect_git.py -u https://github.com/scipy/scipy.git

You can also give this command a file with several urls, one per line. One of these is provided in the examples/ directory.

python bin/collect_mail.py -f examples/git_urls.txt

Community

If you are interested in participating in BigBang development, please subscribe to the BigBang-dev mailing list.

If you are using BigBang and would like support from the core development team, please address your questions to the BigBang-user mailing list.

License

GPLv2, see LICENSE for its text.

About

listserve analysis toolkit

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.3%
  • Shell 0.7%