Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
sunisdown committed Jan 3, 2019
1 parent c81da93 commit 6f197f8
Showing 1 changed file with 1 addition and 11 deletions.
12 changes: 1 addition & 11 deletions distributed_system/mpc.rst
Original file line number Diff line number Diff line change
@@ -1,17 +1,7 @@
分布式计算值 MPC
分布式计算之 MPC
===============================================


现代的数据分析通常是跑在大规模的 shared-nothing 集群上。

join的性能是大数据处理系统里面一个最重要的指标。两种join方法,hash join
和 sort join。hash
join在大多数的数据集中都可以通过增加设备来做到线性扩展。但是当数据倾斜比较严重时,hash
join 性能就会比较差,比如大多数

都被hash到同一个节点上。sort join
对于数据倾斜这种情况反而能处理的很好,但是在排序又带来额额外的消耗。

MPC
---

Expand Down

0 comments on commit 6f197f8

Please sign in to comment.