Skip to content

abdalla1611/EMR-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

How to run :
go to the steps folder in the CMD and then write :

    - mvn package&&compile

then upload the jar file to AWS in the S3
after that run HadoopRunner in your local computer with the right credentials

*******************************************************

in the assignment we use techniques of duplicating information and reversing order to compute the probabilities. it take three steps :

    1. using the techinque to order the trigrams , bigrams and unigram in a way that help us computing the probabilities
    2. comuting the probabilities for each trigram
    3. sorting the trigrams using the Comperator

the output will be in this link :

    s3://emrodaiar/output/Step3Output2022-02-27T00-28-48.395/



About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages