Skip to content

Shingho is a PySpark based statistical library designed for Big Data applications.

Notifications You must be signed in to change notification settings

snazrul1/Shingho

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Shingho

Introduction

Shingho is a robust, Python and Spark based statistical library designed for Big Data applications.

Special features of the Shingho statistical library:

  • Multithreading capabilities for greater parallelization
  • Leverages both SQL and MapReduce operations for faster processing

Dependencies

  1. Python 2.7+
  2. Spark 1.6+ (2.0.0+ recommended)
  3. Anaconda 4.3+

Installation Guide

python setup.py --install

User Guide

  • Tutorials for using Shingho can be found here.

Contributing

  • Refer to the Developer's Guide to help you get started on contributing to our project.

About

Shingho is a PySpark based statistical library designed for Big Data applications.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages