GitHub - sdpython/pysqllike: Tentative to design map/reduce jobs (SQL, Pig)

https://github.com/sdpython/pysqllike/blob/master/_doc/sphinxdoc/source/phdoc_static/project_ico.png?raw=true

pysqllike: pseudo map/reduce in python

https://circleci.com/gh/sdpython/pysqllike/tree/master.svg?style=svg

https://codecov.io/github/sdpython/pysqllike/coverage.svg?branch=master

The project is not actively developed.

Writing a map/reduce job (using PIG for example), usually requires to switch from local files to remote files (on Hadoop). On way to work is extract a small sample of the data which will be processed by a map/reduce job. The job is then locally developped. And when it works, it is run on a parallized environment.

The goal of this extension is allow the implementation of this job using Python syntax as follows:

def myjob(input):
    iter = input.select (input.age, input.nom, age2 = input.age2*input.age2)
    wher = iter.where( (iter.age > 60).Or(iter.age < 25))
    return where

input = IterRow (None, [ {"nom": 10}, {"jean": 40} ] )
output = myjob(input)

When the job is ready, it can be translated into a PIG job:

input = LOAD '...' USING PigStorage('\t') AS (nom, age);
iter = FOREACH input GENERATE age, nom, age*age AS age2 ;
wher = FILTER iter BY age > 60 or age < 25 ;
STORE wher INTO '...' USING PigStorage();

It should also be translated into SQL.

Links:

Name		Name	Last commit message	Last commit date
Latest commit History 259 Commits
.circleci		.circleci
_doc		_doc
_unittests		_unittests
src/pysqllike		src/pysqllike
.gitattributes		.gitattributes
.gitignore		.gitignore
.landscape.yml		.landscape.yml
.local.jenkins.lin.yml		.local.jenkins.lin.yml
.local.jenkins.win.yml		.local.jenkins.win.yml
.travis.yml		.travis.yml
HISTORY.rst		HISTORY.rst
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.rst		README.rst
appveyor.yml		appveyor.yml
build_script.bat		build_script.bat
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pysqllike: pseudo map/reduce in python

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sdpython/pysqllike

Folders and files

Latest commit

History

Repository files navigation

pysqllike: pseudo map/reduce in python

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages