Skip to content

JihyeHong/java

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

java

java programming

Page Rank Program

Page Rank Algorithm implementation

  • UrlExtractor: extracts URL strings in a text.
  • PageRank: based on the graph(Network) theory, find the most highly referenced web page (node).
  • MyURL: parses URLs.
  • ArticleIndexer: finds web page URLs within news articles.

The rest of text files can be used as sample data for this program.

Enron

Data Mining with Hadoop Map/Reduce Program Implementation

This project incorporates two different Map/Reduce program.

  • MailReader: MapReduce program to parse the headers of the Email messages to extract triples consisting of the sender and the recipient employee ID, and the transmission timestamp.
  • MultipleOutput: A new MapReduce program for Time Slicing of Enron data set into multiple outputs.

Streams

Java 8 streams practice

  • Books: a directory including stream API practices related with a library's archive of books
  • Sms: a directory including stream API practices of sms messages

About

java programming

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages