[STREAMING] SPARK-1581: Allow One Flume Avro RPC Server for Each Worker ... #495

christopheclc · 2014-04-23T04:15:21Z

...rather than Just One Worker

…er rather than Just One Worker

AmplabJenkins · 2014-04-23T04:17:55Z

Can one of the admins verify this patch?

tdas · 2014-04-25T00:47:17Z

Can you elaborate on what this usecase is? Please add more information on the JIRA explaining the problem, and the intended solution.

https://issues.apache.org/jira/browse/SPARK-1581

tdas · 2014-04-25T00:49:40Z

Also, there has been some updates to the flume stream code, it may be a good idea to merge from the master.

Fix graphx Commons Math dependency `graphx` depends on Commons Math (2.x) in `SVDPlusPlus.scala`. However the module doesn't declare this dependency. It happens to work because it is included by Hadoop artifacts. But, I can tell you this isn't true as of a month or so ago. Building versus recent Hadoop would fail. (That's how we noticed.) The simple fix is to declare the dependency, as it should be. But it's also worth noting that `commons-math` is the old-ish 2.x line, while `commons-math3` is where newer 3.x releases are. Drop-in replacement, but different artifact and package name. Changing this only usage to `commons-math3` works, tests pass, and isn't surprising that it does, so is probably also worth changing. (A comment in some test code also references `commons-math3`, FWIW.) It does raise another question though: `mllib` looks like it uses the `jblas` `DoubleMatrix` for general purpose vector/matrix stuff. Should `graphx` really use Commons Math for this? Beyond the tiny scope here but worth asking.

tdas · 2014-07-30T22:09:12Z

@christopheclc Any ideas on elaborating on the usecase behind this? If this is not relevant any more, i am inclined to close this PR.

DannyGuoHT · 2014-08-11T06:49:46Z

i don't get how this patch can resolve this issue, because this patch just change the host to "0.0.0.0".

Fix graphx Commons Math dependency `graphx` depends on Commons Math (2.x) in `SVDPlusPlus.scala`. However the module doesn't declare this dependency. It happens to work because it is included by Hadoop artifacts. But, I can tell you this isn't true as of a month or so ago. Building versus recent Hadoop would fail. (That's how we noticed.) The simple fix is to declare the dependency, as it should be. But it's also worth noting that `commons-math` is the old-ish 2.x line, while `commons-math3` is where newer 3.x releases are. Drop-in replacement, but different artifact and package name. Changing this only usage to `commons-math3` works, tests pass, and isn't surprising that it does, so is probably also worth changing. (A comment in some test code also references `commons-math3`, FWIW.) It does raise another question though: `mllib` looks like it uses the `jblas` `DoubleMatrix` for general purpose vector/matrix stuff. Should `graphx` really use Commons Math for this? Beyond the tiny scope here but worth asking. (cherry picked from commit 3184fac) Signed-off-by: Patrick Wendell <pwendell@gmail.com>

…he#495) Reverts palantir#492 sorry :/

…ache#498) * add initial bypass merge sort shuffle writer benchmarks * dd unsafe shuffle writer benchmarks * changes in bypassmergesort benchmarks * cleanup * add circle script * add this branch for testing * fix circle attempt 1 * checkout code * add some caches? * why is it not pull caches... * save as artifact instead of publishing * mkdir * typo * try uploading artifacts again * try print per iteration to avoid circle erroring out on idle * blah (apache#495) * make a PR comment * actually delete files * run benchmarks on test build branch * oops forgot to enable upload * add sort shuffle writer benchmarks * add stdev * cleanup sort a bit * fix stdev text * fix sort shuffle * initial code for read side * format * use times and sample stdev * add assert for at least one iteration * cleanup shuffle write to use fewer mocks and single base interface * shuffle read works with transport client... needs lots of cleaning * test running in cicle * scalastyle * dont publish results yet * cleanup writer code * get only git message * fix command to get PR number * add SortshuffleWriterBenchmark * writer code * cleanup * fix benchmark script * use ArgumentMatchers * also in shufflewriterbenchmarkbase * scalastyle * add apache license * fix some scale stuff * fix up tests * only copy benchmarks we care about * increase size for reader again * delete two writers and reader for PR * SPARK-25299: Add shuffle reader benchmarks (apache#506) * Revert "SPARK-25299: Add shuffle reader benchmarks (apache#506)" This reverts commit 9d46fae. * add -e to bash script * blah * enable upload as a PR comment and prevent running benchmarks on this branch * Revert "enable upload as a PR comment and prevent running benchmarks on this branch" This reverts commit 13703fa. * try machine execution * try uploading benchmarks (apache#498) * only upload results when merging into the feature branch * lock down machine image * don't write input data to disk * run benchmark test * stop creating file cleanup threads for every block manager * use alphanumeric again * use a new random everytime * close the writers -__________- * delete branch and publish results as comment * close in finally

You can use below config to enbale gcc-7: ``` roles: - role: config-gcc gcc_version: 7 ``` Close-issue: theopenlab/openlab#239

[STREAMING] SPARK-1581: Allow One Flume Avro RPC Server for Each Work…

1ebb1f3

…er rather than Just One Worker

asfgit closed this in 87738bf Aug 2, 2014

yifeih added a commit to yifeih/spark that referenced this pull request Feb 25, 2019

Revert "allow circle to build for remote branches (apache#492)" (apac…

1ead46b

…he#495) Reverts palantir#492 sorry :/

yifeih added a commit to yifeih/spark that referenced this pull request Feb 27, 2019

blah (apache#495)

9546397

bzhaoopenstack pushed a commit to bzhaoopenstack/spark that referenced this pull request Sep 11, 2019

Add support for config-gcc role (apache#495)

d981be3

You can use below config to enbale gcc-7: ``` roles: - role: config-gcc gcc_version: 7 ``` Close-issue: theopenlab/openlab#239

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[STREAMING] SPARK-1581: Allow One Flume Avro RPC Server for Each Worker ... #495

[STREAMING] SPARK-1581: Allow One Flume Avro RPC Server for Each Worker ... #495

christopheclc commented Apr 23, 2014

AmplabJenkins commented Apr 23, 2014

tdas commented Apr 25, 2014

tdas commented Apr 25, 2014

tdas commented Jul 30, 2014

DannyGuoHT commented Aug 11, 2014

[STREAMING] SPARK-1581: Allow One Flume Avro RPC Server for Each Worker ... #495

[STREAMING] SPARK-1581: Allow One Flume Avro RPC Server for Each Worker ... #495

Conversation

christopheclc commented Apr 23, 2014

AmplabJenkins commented Apr 23, 2014

tdas commented Apr 25, 2014

tdas commented Apr 25, 2014

tdas commented Jul 30, 2014

DannyGuoHT commented Aug 11, 2014