mr

nodejs map-reduce

CouchDB-style-like map-reduce:CouchDB MapReduce

usage: world count example

  var fs = require('fs');
	var worldCounter = new MapReduce({
  	map: function(chunk){		
  		chunk.toString().split(/\W+|\d+/).forEach(function(world){			
  			world && this.emit(world.toLowerCase(), 1);
  		}, this);
  	},
  	reduce: function(key, values){
  		return this.count(values);
  	},
  	inputs: fs.readdirSync('./').map(fs.createReadStream),
  	fork: false //should forEach input fork a cluster.worker to do map job or not
  });
  
  worldCounter.run(function(result){
  	console.log(result);
  });

more think:

should do reduce during mapping rather than wait until mapping done?
use nodejs ChildProcess/Cluster fork to do map/reduce job?
for processing and generating large data sets with a parallel, distributed algorithm on a cluster? you may look for Hadoop

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
lib		lib
test		test
.gitignore		.gitignore
README.md		README.md
index.js		index.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lib

lib

test

test

.gitignore

.gitignore

README.md

README.md

index.js

index.js

Repository files navigation

mr

About

Releases

Packages

Languages

lodengo/mr

Folders and files

Latest commit

History

Repository files navigation

mr

About

Resources

Stars

Watchers

Forks

Languages