Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added BatchSourceContext.setInput(InputFormatProvider) #4512

Merged
merged 1 commit into from
Nov 10, 2015

Conversation

chtyim
Copy link
Contributor

@chtyim chtyim commented Nov 9, 2015

  • Remove usage of BatchContext.getHadoopJob
  • Also move MapReduce classes under "mapreduce" package

@chtyim chtyim added the 3.3 label Nov 9, 2015
@chtyim chtyim force-pushed the feature/etl-remove-hadoop-job-usage branch from 0b7277c to 88f7f4c Compare November 9, 2015 18:33
@chtyim chtyim changed the title - Added BatchSourceContext.setInput(InputFormatProvider) Added BatchSourceContext.setInput(InputFormatProvider) Nov 9, 2015
@chtyim chtyim force-pushed the feature/etl-remove-hadoop-job-usage branch from 88f7f4c to 2467d13 Compare November 9, 2015 18:51
config.put(entry.getKey(), entry.getValue());
}
} catch (IOException e) {
// Shouldn't happen
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't happen because tpfsAvroConfig.schema has been validated before in configurePipeline method?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The IOException is from the Job.getInstance() call.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh ok. 👍

@gokulavasan
Copy link
Contributor

One question. Rest LGTM 👍

Configuration hConf = job.getConfiguration();
hConf.clear();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why is this needed? So the hConf passed to SourceInputFormatProvider on line 101 does not have anything other than DBConfiguration, ETLDBInputFormat properties?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

otherwise it'll have everything from the core-default.xml, which are unnecessary.

@chtyim chtyim force-pushed the feature/etl-remove-hadoop-job-usage branch from 9c2efc8 to 6a70f8d Compare November 9, 2015 22:46
@chtyim
Copy link
Contributor Author

chtyim commented Nov 10, 2015

Build passed: http://builds.cask.co/browse/CDAP-DUT3117-3

- Remove usage of BatchContext.getHadoopJob
- Also move MapReduce classes under "mapreduce" package
@chtyim chtyim force-pushed the feature/etl-remove-hadoop-job-usage branch from 6a70f8d to d6f3a56 Compare November 10, 2015 01:39
@chtyim
Copy link
Contributor Author

chtyim commented Nov 10, 2015

Squashed commits. Merging.

chtyim added a commit that referenced this pull request Nov 10, 2015
…usage

Added BatchSourceContext.setInput(InputFormatProvider)
@chtyim chtyim merged commit 66c64d2 into develop Nov 10, 2015
@chtyim chtyim deleted the feature/etl-remove-hadoop-job-usage branch November 10, 2015 01:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants