Skip to content

Conversation

@FangYongs
Copy link
Contributor

Currently OrcFileFormat will create new org.apache.hadoop.conf.Configuration instance in methods createReaderFactory and createWriterFactory. The initialization of org.apache.hadoop.conf.Configuration tries to load local file core-site.xml.

This PR will initialize the writerConf and readerConf of org.apache.hadoop.conf.Configuration and reuse them in createReaderFactory and createWriterFactory.

@FangYongs
Copy link
Contributor Author

Hi @JingsongLi
I create this new PR for FLINK-29345. I find the ThreadLocalClassLoaderConfiguration in OrcBulkWriterFactory can't be removed while it is used to avoid classloader leaks and the detail is in the docs of ThreadLocalClassLoaderConfiguration.
I have created writerConf and readerConf of org.apache.hadoop.conf.Configuration and reuse them in createReaderFactory and createWriterFactory

Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @zjureel ! Looks good to me!

@JingsongLi JingsongLi merged commit bee050f into apache:master Oct 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants