- Joined on
Aug 2, 2011
forked from codelucas/newspaper
News, full-text, and article metadata extraction in python 2.6 - 3.4.
forked from iPoli/intentive
Use Machine Learning to turn natural language text into structured data
forked from facebook/Stack-RNN
This is the code used for the paper "Inferring algorithmic patterns with a stack augmented recurrent network", by Armand Joulin and Tomas Mikolov.
forked from BarakOshri/TextualReconstructor
Training autoencoders to reconstruct text to generate valuable summarizations of sentences or paragraphs
forked from 3Top/word2vec-api
Simple web service providing a word embedding model
forked from xiaozhouwang/kaggle_Microsoft_Malware
code for kaggle competition Microsoft malware classification
forked from tsroten/pynlpir
A Python wrapper around the NLPIR/ICTCLAS Chinese segmentation software.
forked from idio/wiki2vec
Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps
forked from zhangxiangxiao/Crepe
Text classification from character-level using convolution networks
forked from benanne/kaggle-ndsb
Winning solution for the National Data Science Bowl competition on Kaggle (plankton classification)
forked from largelymfs/topical_word_embeddings
A demo code for topical word embedding
forked from scikit-learn-contrib/lightning
Large-scale linear classification, regression and ranking in Python
forked from sancha/jrae
I re-implemented a semi-supervised recursive autoencoder in java. I think it is a pretty nice technique. Check it out! Or fork it
forked from larsga/Duke
Duke is a fast and flexible deduplication engine written in Java
forked from feeeermendoza/we-work-with-mammograms
Un sistema que facilita el análisis de mamografías
forked from pcmanus/ccm
A script to easily create and destroy an Apache Cassandra cluster on localhost
forked from ivan-vasilev/neuralnetworks
java deep neural networks with gpu acceleration
forked from jexp/neo4j-activity-stream
Server extension implementation for an simple activity stream example
forked from socialsensor/twitter-dataset-collector
Helps with the distribution of Twitter datasets by downloading sets of tweets (if still available) using their ids as input.
forked from typesafehub/playconf
Sample application for the introductory Play/Java online training course
forked from DrewEaster/realtime-search
Demonstration using Play!, Akka, AngularJS and Elasticsearch to perform real-time log entry search
forked from abwaters/cryptsy-api
Small fast API for the Cryptsy crypto-currency exchange.
forked from nativelibs4java/nativelibs4java
Repository for all NativeLibs4Java projects : JavaCL, ScalaCL, BridJ, JNAerator...
forked from airlift/slice
Java library for efficiently working with heap and off-heap memory
forked from JorenSix/TarsosLSH
A Java library implementing Locality-sensitive Hashing (LSH), a practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time.
forked from L-Y/socialfunnel
Create brand awareness, manage Leads, responde to customers and megger your company's digital presence.
forked from rochacbruno/Movuca
Movuca - web2py powered social CMS (this project is not updated/maintained anymore, if you want to adopt it, let me know, then I can transfer ownership)
forked from snikolov/rumor
Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)
forked from RestExpress/RestExpress
RestExpress is the easiest way to create RESTful web services in Java. An extremely Lightweight, Fast, REST Engine and API for Java. Supports JSON and XML serialization automagically as well as ISO 8601 date formats. A thin wrapper on Netty IO HTTP handling, RestExpress lets you create performant, stand-alone REST web services rapidly. Works 'ex…
forked from teambox/teambox
This is the legacy version of Teambox - the award-winning collaboration solution, inspired by Basecamp, Yammer and Twitter.
forked from jroper/play-demo-twitbookplus
Play Java demo application, using guice, mongodb, mongojack, webjars, requirejs, knockoutjs and bootstrap
forked from benhamner/GEFlightQuest
Data transformation code and benchmarks for GE Flight Quest
forked from BlueMountainCapital/riemann-cassandra
riemann tool for cassandra
forked from myditto/beautiful-visualization
Examples of visualization by (mainly) Processing
forked from amki/Dreambox-Controller
Program that interacts with the Enigma 2 Interface used by Dreamboxes
forked from twitter/elephant-twin
Elephant Twin is a framework for creating indexes in Hadoop
forked from urbanairship/datacube
Multidimensional data storage with rollups for numerical data
forked from atduskgreg/Processing-Shader-Examples
Experiments working with shaders in Processing. Click the link below for formatted notes.
forked from ngharo/Dynamic-Zone-OCR
Create OCR templates for parsing standard documents. Created at the 28Hour BuildHealth Hack-a-Thon in Milwaukee, WI April 16th 2012
forked from nchandra/ExponentialSmoothing
Implementation of Holt Winters Triple exponential smoothing and other methods.
forked from mulesoft/linkedin-connector
LinkedIn is a business-related social networking site. Founded in December 2002 and launched in May 2003, it is mainly used for professional networking. This connector allows you to interact with LinkedIn API.
forked from sidbatra/twitter-follower-recommender
Adaptive Locality Sensitive Hashing for Recommending Twitter Followers using Map Reduce
forked from cloudera/matching
A distributed weighted matching algorithm implemented on top of Apache Giraph
forked from twitter/cassovary
Cassovary is a simple big graph processing library for the JVM from Twitter
forked from cestella/SpatialSearch
Uses Locality Sensitive Hashing to provide a spatial search on top of any distributed or non-distributed key-value store
forked from danielnegri/mkcrm-rails
Open-source project CRM software, available online for free. Inspired by SalesForce, Teambox, and Spree.
forked from maheskrishnan/HQLRunner
Simple Java Swing Program for running your HQL and SQL Queries.
forked from nathanmarz/storm
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
forked from LanceNorskog/LSH-Hadoop
Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations
forked from paramsethi/linkedInDemo
Application demonstrating usage of Authenticate, View Profile, Search Connections and View their profile, and Signout functionalties of Linkedin API.
forked from mostafa-ead/Starfish
Starfish is a self-tuning system for big data analytics. Starfish builds on Hadoop while adapting to user needs and system workloads to provide good performance automatically, without any need for users to understand and manipulate the many tuning knobs in Hadoop.
forked from aching/Giraph
Graph processing infrastructure that runs on Hadoop (see Pregel)
forked from reines/persistenthashmap
A disk-based HashMap implementation allowing persistence of data across sessions.
forked from everpeace/minwise-lsh
an implementation of locality sensitive hash using min-wise permutation family.
forked from cloudian/logprocessing
Log processing system using Flume and Cassandra
forked from claudiomartella/Graph-ish
Graph-ish, a distributed graph database based on DHTs. ... so it should be called Hash-ish :-)
A Java library for creating, managing and searching indexes on Cassandra database
forked from protovis/protovis-java
A Java port of Protovis, the visualization toolkit.
Mele is a Lucene index manager that manages replication, provides local synchronization, and index locking through zookeeper.