Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Improvement] Rename Hdfs*.java to Hadoop*.java to support other Hadoop FS-compatible distributed filesystem #895

Closed
3 tasks done
jiafuzha opened this issue May 23, 2023 · 4 comments · Fixed by #898
Closed
3 tasks done

Comments

@jiafuzha
Copy link
Contributor

Code of Conduct

Search before asking

  • I have searched in the issues and found no similar issues.

What would you like to be improved?

In server and storage modules, there are many classes prefixed with Hdfs which use Hadoop FS API and are thus impl agnostic, not depending on specific Hdfs impl. So, it's better to rename them to Hadoop* so that we can support other Hadoop FS compatible distributed filesystem by extending existing classes. It'll make code look more naturally.

There may be some slight differences among different Hadoop FS impls, like hadoop-daos not have a dedicated thread for reading and writing data, which is different from hdfs. Thus, we don't need to close outputstream at each flush to FS.

How should we improve?

Just rename the class names.

Are you willing to submit PR?

  • Yes I am willing to submit a PR!
@jiafuzha
Copy link
Contributor Author

@advancedxy @jerqi does it look reasonable for you?

@zuston
Copy link
Member

zuston commented May 23, 2023

Sounds good

@jerqi
Copy link
Contributor

jerqi commented May 23, 2023

OK for me.

@advancedxy
Copy link
Contributor

SGTM.

jerqi pushed a commit that referenced this issue May 24, 2023
…r Hadoop FS-compatible distributed filesystem (#898)

### What changes were proposed in this pull request?

In server and storage modules, there are many classes prefixed with Hdfs which use Hadoop FS API and are thus impl agnostic, not depending on specific Hdfs impl. So, it's better to rename them to Hadoop* so that we can support other Hadoop FS compatible distributed filesystem by extending existing classes. It'll make code look more naturally. 

There may be some slight differences among different Hadoop FS impls, like [hadoop-daos](https://github.com/daos-stack/daos/tree/master/src/client/java/hadoop-daos) not have a dedicated thread for reading and writing data, which is different from hdfs. Thus, we don't need to close outputstream at each flush to FS. 

### Why are the changes needed?

Fix: #895 

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

CI passed.

Signed-off-by: jiafu zhang <jiafu.zhang@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants