Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NUTCH-2644 CrawlDbReader -dump ignores filter options #383

Merged

Conversation

sebastian-nagel
Copy link
Contributor

  • need to pass filter options via job configuration into mapper

- need to pass filter options via job configuration into mapper
  (modifying original configuration does not have any effect)
- must set values of command-line options in job configuration
  to pass them to job tasks
- use separate job configuration for separate web graph jobs/steps
- make NodeDumper job/tool to log to stdout
@sebastian-nagel sebastian-nagel merged commit 0ce62e1 into apache:master Oct 7, 2018
@sebastian-nagel sebastian-nagel deleted the NUTCH-2644-crawldb-reader branch October 7, 2018 19:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant