Example source code accompanying O'Reilly's "Programming Elastic MapReduce: Using AWS services to build an end-to-end application" by Christopher Phillips and Kevin Schmidt
-
Updated
May 10, 2014 - Java
Example source code accompanying O'Reilly's "Programming Elastic MapReduce: Using AWS services to build an end-to-end application" by Christopher Phillips and Kevin Schmidt
A map-reduce implementation in Apache Hadoop (AWS EMR) for calculating the probabilities of trigrams in the Hebrew language. This project utilizes the deleted estimation two-way cross validation method to calculate trigram probabilities. The Google Hebrew Trigram database serves as this project's corpus.
OpenPCM Server
Project of the final year of my engineering courses
Built a distributed system which completes several objectives with given data to generate loan reports using Amazon Web Services, Apache Spark, Java and Python.
In this project, we try to reproduce the paper 'Comparing Measures of Semantic Similarity' by Nikola Ljubešić et al. which aims at comparing different methods for automatic extraction of semantic similarity mesaures from a corpus.
A map-reduce implementation in Apache Hadoop (AWS EMR) for calculating the probabilities of trigrams in the Hebrew language. This project utilizes the deleted estimation two-way cross validation method to calculate trigram probabilities. The Google Hebrew Trigram database serves as this project's corpus.
Assignment 2 of the course 'Distributed Systems Programming' by Meni Adler. In the assignment we build an application that calculates the probabilities for any word to come after a couple of words, for ANY couple of words in the n-gram corpus (google).
Application that automatically extracts collocations from the Google 2-grams dataset using Amazon Elastic Map Reduce
An Annotation Tool Designed for Health Unstructured Data (标注工具)
One Ring is a framework to unify, unite and bind Apache Spark-based computing modules, and run them in parametrized chains
Add a description, image, and links to the emr topic page so that developers can more easily learn about it.
To associate your repository with the emr topic, visit your repo's landing page and select "manage topics."