Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: [tke regression] tpcc 100-1000 test oom. #16379

Closed
1 task done
Ariznawlll opened this issue May 24, 2024 · 11 comments
Closed
1 task done

[Bug]: [tke regression] tpcc 100-1000 test oom. #16379

Ariznawlll opened this issue May 24, 2024 · 11 comments
Assignees
Labels
duplicate This issue or pull request already exists kind/bug Something isn't working phase/testing severity/s0 Extreme impact: Cause the application to break down and seriously affect the use
Milestone

Comments

@Ariznawlll
Copy link
Contributor

Is there an existing issue for the same bug?

  • I have checked the existing issues.

Branch Name

mian

Commit ID

e4c504a

Other Environment Information

- Hardware parameters:
- OS type:
- Others:

Actual Behavior

job url: https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9211125528/job/25347041190
image

image image image

log:https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22JM6%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240523%5C%22%7D%20%7C%3D%20%60%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%22now-12h%22,%22to%22:%22now%22%7D%7D%7D&schemaVersion=1&orgId=1

profile今日复现再抓

Expected Behavior

No response

Steps to Reproduce

trigger tke test

Additional information

No response

@Ariznawlll Ariznawlll added kind/bug Something isn't working needs-triage severity/s0 Extreme impact: Cause the application to break down and seriously affect the use labels May 24, 2024
@Ariznawlll Ariznawlll added this to the 1.2.1 milestone May 24, 2024
@Ariznawlll
Copy link
Contributor Author

Ariznawlll commented May 24, 2024

job url:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9211125528/job/25364260136

image

已复现,但是profile太大了此处放不下,麻烦联系我,私发

@jensenojs
Copy link
Contributor

jensenojs commented May 24, 2024

看着ReadBloomFilter占用的内存很多, 请徐鹏大哥帮忙找人看一下吧, @XuPeng-SH

alloc
image

in use
image

@jensenojs jensenojs assigned XuPeng-SH and unassigned matrix-meow May 24, 2024
@jensenojs
Copy link
Contributor

由徐鹏哥确认bf的内存占用是很小的, 从heap上看不出什么特别的东西. 需要二分来找这个bug, 先暂时assign在我头上, 等找到引入的commit再交给对应的同事

@XuPeng-SH
Copy link
Contributor

profile里bloom filter 数据是2g, 也就是是metadata cache的大小,预期是这样,每天的daily也一直是2G,没有变过, cn 的内存limit 是55g, 不是这个原因

@XuPeng-SH XuPeng-SH assigned jensenojs and unassigned gouhongshen May 24, 2024
sukki37 pushed a commit that referenced this issue May 25, 2024
#16401)

…16359)"

This reverts commit 918a356.

## What type of PR is this?

- [ ] API-change
- [ ] BUG
- [ ] Improvement
- [ ] Documentation
- [ ] Feature
- [ ] Test and CI
- [ ] Code Refactoring

## Which issue(s) this PR fixes:

issue ##16379

## What this PR does / why we need it:
revert to avoid oom
sukki37 pushed a commit that referenced this issue May 25, 2024
)

This reverts commit e4c504a.

## What type of PR is this?

- [ ] API-change
- [ ] BUG
- [ ] Improvement
- [ ] Documentation
- [ ] Feature
- [ ] Test and CI
- [ ] Code Refactoring

## Which issue(s) this PR fixes:
issue ##16379


## What this PR does / why we need it:
revert to avoid oom
@XuPeng-SH
Copy link
Contributor

image 二分到5月21号的 #16292 跑tpcc也是oom的,更早之前就oom了

@aressu1985 aressu1985 removed this from the 1.2.1 milestone May 29, 2024
@jensenojs
Copy link
Contributor

@jensenojs jensenojs added the duplicate This issue or pull request already exists label Jun 24, 2024
@jensenojs
Copy link
Contributor

not working on it

@jensenojs
Copy link
Contributor

#16741 之后这个问题应该已经解决了, 辛苦动哥有时间检验一下

@aressu1985

@Ariznawlll
Copy link
Contributor Author

最近没再出现了,先关闭

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists kind/bug Something isn't working phase/testing severity/s0 Extreme impact: Cause the application to break down and seriously affect the use
Projects
None yet
Development

No branches or pull requests

6 participants