"Blocks read inconsistent" happened when shuffle read #76

Augus-smile · 2022-02-09T03:15:33Z

你好，我在执行spark 作业时，在shuffle read阶段，显示blocks read inconsistent错误，请问是什么原因呢？

server.conf:

rss.server.flush.thread.alive=2
rss.server.flush.threadPool.size=4
rss.server.buffer.capacity=6g
rss.server.read.buffer.capacity=3g
rss.server.disk.capacity=180g

server_rss_env.sh:
....
XMX_SIZE="12g"
....

colinmjj · 2022-02-09T03:38:40Z

When shuffle read, after process all blocks, it will check if all expected blocks are processed. "Blocks read inconsistent" will be thrown when some blocks are lost. It may be caused by writing shuffle data failed.

Augus-smile · 2022-02-09T06:19:58Z

Thank you for your reply, Does the LOCALFILE_AND_HDFS mode write data to the HDFS and Shuffle server at the same time?

colinmjj · 2022-02-09T09:09:30Z

@Augus-smile please try release 0.2.0 which has a lot of improvements and bug fix. You can refer readme for detail configuration. In 0.2.0, MEMORY_LOCALFILE_HDFS is introduced for multiple storages. We will publish related doc soon.

Augus-smile · 2022-02-10T09:28:34Z

Shuffle server process was killed due to OOM, related server conf : physical memory : 16g, XMX_SIZE: 12g. However, during the Spark task execution, the Shuffle Server process occupies nearly 16g memory. What are the memory consumption components of the Shuffle Server? Is there a parameter to restrict the memory usage of shuffle Server?

colinmjj · 2022-02-10T09:35:44Z

set xmx_size in rss-env.sh
in server.conf:
rss.server.buffer.capacity # memory cache for write and read
rss.server.read.buffer.capacity # memory cache for read
BTW, please left extra 5G memory for shuffle server

jerqi closed this as completed Feb 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Blocks read inconsistent" happened when shuffle read #76

"Blocks read inconsistent" happened when shuffle read #76

Augus-smile commented Feb 9, 2022

colinmjj commented Feb 9, 2022 •

edited

Loading

Augus-smile commented Feb 9, 2022

colinmjj commented Feb 9, 2022

Augus-smile commented Feb 10, 2022

colinmjj commented Feb 10, 2022 •

edited

Loading

"Blocks read inconsistent" happened when shuffle read #76

"Blocks read inconsistent" happened when shuffle read #76

Comments

Augus-smile commented Feb 9, 2022

colinmjj commented Feb 9, 2022 • edited Loading

Augus-smile commented Feb 9, 2022

colinmjj commented Feb 9, 2022

Augus-smile commented Feb 10, 2022

colinmjj commented Feb 10, 2022 • edited Loading

colinmjj commented Feb 9, 2022 •

edited

Loading

colinmjj commented Feb 10, 2022 •

edited

Loading