Skip to content

Commit

Permalink
Add README_en.md
Browse files Browse the repository at this point in the history
  • Loading branch information
zieckey committed Jan 19, 2017
1 parent bd61669 commit 83b60d6
Showing 1 changed file with 13 additions and 0 deletions.
13 changes: 13 additions & 0 deletions README_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Poseidon

Poseidon is a log searching engine system that can quickly search and retrieve specific strings in hundreds of petabytes and hundreds of trillion lines of log data.
In the past, if we try to find some information in such large set of data, we need to write a Map/Reduce task program and run it in Hadoop platform.
This will cost several hours, maybe more, that greatly restricting the working efficiency.

The Poseidon system can solve this problem, it builds the inverted index directly on HDFS and does not change the storing mode of the original log data which is still stored on HDFS.
That means the Poseidon system does not need to store the original log data.
We can use Poseidon system to do searching and use Hadoop system to run Map/Reduce task in the same time and on the same data set.

That is very different with ElasticSearch which stores the index data and original data in its own system and stores the original data on HDFS if we also need to run Map/Reduce task.
And more importantly, ElasticSearch cannot hold such large scale of data set.

0 comments on commit 83b60d6

Please sign in to comment.