Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: Tianmu column storage engine - performs HashJoin queries - runs out of disk space #558

Open
1 task done
adofsauron opened this issue Sep 23, 2022 · 2 comments
Open
1 task done
Labels
A-bug Something isn't working B-performance prio: low Low priority

Comments

@adofsauron
Copy link
Collaborator

Describe the problem

Abstract:

Mysql Column Storage engine - performs HashJoin query - runs out of disk space to record the problem.
Data file backup https://download.csdn.net/download/adofsauron/86662384

SQL:

 
CREATE TABLE `data02` (
  `id` int(11) DEFAULT NULL,
  `uname` varchar(20) DEFAULT NULL,
  `regdate` datetime DEFAULT NULL,
  `logindate` datetime DEFAULT NULL,
  `phone` varchar(255) DEFAULT NULL,
  `encryption` varchar(20) DEFAULT NULL,
  `number` varchar(20) DEFAULT NULL,
  `accuracy` varchar(50) DEFAULT NULL,
  `azimuth` varchar(50) DEFAULT NULL,
  `age` int(11) DEFAULT NULL
) ENGINE=TIANMU DEFAULT CHARSET=utf8;
 
select a.id,b.uname from data02 a,data02 b where a.uname=b.uname;
ERROR 1105 (HY000): Tianmu other specific error

Exceptions during query:

Error log:

[2022-09-23 14:32:32.901310] [1967019776] Begin traversed with 1 threads with none type.
[2022-09-23 14:32:32.904042] [713017088] Establishing hash table need 57MB
[2022-09-23 14:32:32.928696] [1967019776] End traversed 1280/1000000 rows.
[2022-09-23 14:32:32.928767] [1967019776] Too many hash conflicts: restarting join.
[2022-09-23 14:32:32.928912] [1967019776] Begin traversed with 1 threads with none type.
[2022-09-23 14:32:32.931694] [201324288] Establishing hash table need 57MB
[2022-09-23 14:32:51.923140] [1967019776] End traversed 1000000/1000000 rows.
[2022-09-23 14:32:51.923374] [1967019776] Begin match dim of 1000000 rows, spliting into 4 threads with packs type.
[2022-09-23 14:38:22.773741] [1967019776] Error: Parallel hash join failed.

/ stonedb57 / install/data/cache directory swelled TMP files:

root@htap06:/stonedb57/install/data/cache# ls -ltr
total 16777260
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000770x7f29e00329e0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000580x7f29e0032ef0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000050x7f29e0032770.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000720x7f29e0033160.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000270x7f29e00343e0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000460x7f29e00345e0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000310x7f29e001aba0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:33 JW_INT000000650x7f29e001b6b0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001770x7f29e00329e0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001580x7f29e0032ef0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001050x7f29e0032770.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001720x7f29e0033160.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001270x7f29e00343e0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001460x7f29e00345e0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001310x7f29e001aba0.tianmu_tmp
-rw-r----- 1 mysql mysql 1073741824 Sep 23 14:34 JW_INT000001650x7f29e001b6b0.tianmu_tmp

Expected behavior

No response

How To Reproduce

No response

Environment

No response

Are you interested in submitting a PR to solve the problem?

  • Yes, I will!
@adofsauron adofsauron added the A-bug Something isn't working label Sep 23, 2022
@adofsauron adofsauron added this to the stonedb_5.7_v1.0.2 milestone Sep 23, 2022
@adofsauron adofsauron self-assigned this Sep 23, 2022
@wisehead
Copy link
Collaborator

the result is correct, but takes too much disk space, will fix it in v1.0.3

@adofsauron
Copy link
Collaborator Author

A similar scenario: #1110

@wisehead wisehead added B-performance B-storage data type, data storage, insert,update,delete, transactions labels Jan 9, 2023
@hustjieke hustjieke added prio: low Low priority and removed B-storage data type, data storage, insert,update,delete, transactions labels Jan 30, 2023
@adofsauron adofsauron removed their assignment Apr 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-bug Something isn't working B-performance prio: low Low priority
Projects
None yet
Development

No branches or pull requests

3 participants