Skip to content
This repository was archived by the owner on May 12, 2021. It is now read-only.

Hdfs2 read#16

Closed
efikalti wants to merge 37 commits intoapache:masterfrom
efikalti:hdfs2_read
Closed

Hdfs2 read#16
efikalti wants to merge 37 commits intoapache:masterfrom
efikalti:hdfs2_read

Conversation

@efikalti
Copy link
Copy Markdown
Contributor

Contains the methods for reading collection of documents and single documents from HDFS, JUnit tests for reading collection of documents.

Retrieving of the hdfs configuration folder path is the old one yet, retrieving it by the system environments as well as the tests run on the user's hdfs setup and not a temp.These are to be changed in the next commit.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This doesn't actually search subdirectories right?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No it doesn't, do we want it to search subdirectories?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nope, but I think the description is misleading as it is.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The collection function actually searching all sub directories and finds all the XML documents. Are you doing the same?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes it does the same, I tested it with the ghcnd folder from the unit tests and works correctly.I will change the comment to be more clear.

@sjaco002
Copy link
Copy Markdown
Contributor

sjaco002 commented Jul 8, 2015

A few comments from me

pom.xml Outdated
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code style should match the document. 4 spaces for each indentation.

@prestoncarman
Copy link
Copy Markdown
Contributor

I added a few comments and will take a closer look once you have made the suggested changes.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we just create a separate constructor where the tag is added? Then tag can be final.



----------------------------
for $x in collectionwithtag("hdfs://user/hduser/store","book")/book
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Update example - collection-with-tag.

@prestoncarman
Copy link
Copy Markdown
Contributor

The change has been merged under a different pull request. Nice to have this part of the code base! (Finally!)

@efikalti
Copy link
Copy Markdown
Contributor Author

I am glad to hear and see this into the code! :D Thank you for the final steps!

I am closing this pull

@efikalti efikalti closed this May 23, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants