Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
A system to manage machine learning models
fuzzy lazo + multi columns
Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
Website for NEDBDay
A database with automatic dynamic imputation of missing values.
XSystem: Extracting Syntactical Patterns from Databases
Deneva is a distributed in-memory database framework that supports the evaluation of various concurrency control algorithms.
Reading awesome papers in the DB world, one at a time
Mirror of Apache Spark
Code for GenBase: complex analytics based genomics benchmark
Visualize some mit wifi access point data
Notes and Labs for Advanced Topics in Data Processing
MIT Big Data Challenge
back-end code for array browser joint demo
VoiceX is an open source platform designed to create an information ecosystem for people in developing world, with low-end feature phones that can't connect to the Internet, to generate, manage, retrieve, and search information.
Better data integration
A timeline-based visualization of events as they are discussed on Twitter
hit layer infrastructure
Most used topics