bluefs _allocate unable to allocate 0x90000 on bdev 1 #9885

osaffer · 2022-03-10T13:44:34Z

ceph-version: 16.2.6-0
rook-version: v1.7.4

Hi,

One nice morning that all of my osd pod were crashed, except one by node.
When I check OSD log I can see :
ebug -7> 2022-03-10T09:59:05.158+0000 7ff21a9db080 4 rocksdb: EVENT_LOG_v1 {"time_micros": 1646906345165107, "job": 1, "event": "recovery_started", "log_files": [17870]}
debug -6> 2022-03-10T09:59:05.158+0000 7ff21a9db080 4 rocksdb: [db_impl/db_impl_open.cc:760] Recovering log #17870 mode 2
debug -5> 2022-03-10T09:59:05.430+0000 7ff21a9db080 3 rocksdb: [le/block_based/filter_policy.cc:584] Using legacy Bloom filter with high (20) bits/key. Dramatic filter space and/or accuracy improvement is available with format_version>=5.
debug -4> 2022-03-10T09:59:05.434+0000 7ff21a9db080 1 bluefs _allocate unable to allocate 0x90000 on bdev 1, allocator name block, allocator type hybrid, capacity 0x4ffc00000, block size 0x10000, free 0x0, fragmentation 0, allocated 0x0
debug -3> 2022-03-10T09:59:05.434+0000 7ff21a9db080 -1 bluefs _allocate allocation failed, needed 0x80cbb
debug -2> 2022-03-10T09:59:05.434+0000 7ff21a9db080 -1 bluefs _flush_range allocated: 0x0 offset: 0x0 length: 0x80cbb
debug -1> 2022-03-10T09:59:05.442+0000 7ff21a9db080 -1 /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.6/rpm/el8/BUILD/ceph-16.2.6/src/os/bluestore/BlueFS.cc: In function 'int BlueFS::_flush_range(BlueFS::FileWriter*, uint64_t, uint64_t)' thread 7ff21a9db080 time 2022-03-10T09:59:05.440116+0000
/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.6/rpm/el8/BUILD/ceph-16.2.6/src/os/bluestore/BlueFS.cc: 2768: ceph_abort_msg("bluefs enospc")

After some researches, I noticed that some people got more or less the same issue:
They talked about a workaround :
[osd]
bluestore_allocator = bitmap

Can you tell me where I can set this parameter?

I have also added some new disk on each nodes

Thank you very much

osaffer · 2022-03-10T16:30:51Z

Ok I found to change the type, but it did not solve :
ceph config set osd.0 bluestore_allocator bitmap
So I put back hybrid

travisn · 2022-03-10T17:23:52Z

Yes, that command in the toolbox should work. Another way to set it is in the ceph.conf overrides.

osaffer · 2022-03-11T14:08:41Z

Hi,

It has been fixed.

Extend disk
resize pvresize /dev/sdX
pvdisplay -m to find the disk associated
lvextend -L +10G /dev/mapper/ceph--eda319c5--cce0--4a33--90d1--9ecf950676f5-osd--data--08884bac--6bac--4ae5--8a28--cdc43de1b85e

When done, put a sleep inside the pod
Edit osd deployment, change command ceph-osd to
command:

sh
-c
sleep 10000
Open session inside osd pod
ceph-bluestore-tool bluefs-bdev-expand --path /var/lib/ceph/osd/ceph-X
parted
edit osd deployment put back ceph-osd

Then OSD should be green again

BlaineEXE · 2022-03-16T19:23:06Z

Closing this since it seems resolved.

harrykas · 2022-09-21T08:10:09Z

@osaffer I've encountered the same issue and your plan works, thank you very much!
Just want to ask - what the exact reason of this and how to prevent it from happening again?

osaffer · 2022-09-21T20:23:24Z

Some OSD were full If I remember well ... I would say, monitor your OSD :D
I have enabled prometheus and I monitor with Grafana

harrykas · 2022-09-22T06:29:59Z

@osaffer hehe, you're right, that ceph instance was full. It is for development and another team supports it so I have no monitoring for it. Yet :)
Thank you!

osaffer · 2022-09-22T08:01:03Z

@osaffer hehe, you're right, that ceph instance was full. It is for development and another team supports it so I have no monitoring for it. Yet :) Thank you!

You are welcome ... my environment is also a development one so , I had not configured any monitoring.
But thanks to the problems, we acquire good knowledge :D

thenamehasbeentake · 2023-10-13T07:59:39Z

https://tracker.ceph.com/issues/53466
We encountered the same problem. Add bluefs_shared_alloc_size=4096 to our ceph.conf for the OSDs, and the osd can be restored temporarily.

microyahoo · 2023-11-03T10:07:32Z

The issue is fixed in ceph/ceph#48854, and related issues: https://tracker.ceph.com/issues/53899
and https://tracker.ceph.com/issues/53466

osaffer added the bug label Mar 10, 2022

BlaineEXE closed this as completed Mar 16, 2022

sp98 mentioned this issue Jun 27, 2023

Unable to allocate 0x900000 on bdev 1 despite having plenty of space #12433

Closed

thenamehasbeentake mentioned this issue Nov 9, 2023

Limit the number or size of core dump files to avoid excessive storage utilization. #13184

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bluefs _allocate unable to allocate 0x90000 on bdev 1 #9885

bluefs _allocate unable to allocate 0x90000 on bdev 1 #9885

osaffer commented Mar 10, 2022

osaffer commented Mar 10, 2022

travisn commented Mar 10, 2022

osaffer commented Mar 11, 2022

BlaineEXE commented Mar 16, 2022

harrykas commented Sep 21, 2022 •

edited

Loading

osaffer commented Sep 21, 2022

harrykas commented Sep 22, 2022

osaffer commented Sep 22, 2022

thenamehasbeentake commented Oct 13, 2023 •

edited by microyahoo

Loading

microyahoo commented Nov 3, 2023 •

edited

Loading

bluefs _allocate unable to allocate 0x90000 on bdev 1 #9885

bluefs _allocate unable to allocate 0x90000 on bdev 1 #9885

Comments

osaffer commented Mar 10, 2022

osaffer commented Mar 10, 2022

travisn commented Mar 10, 2022

osaffer commented Mar 11, 2022

BlaineEXE commented Mar 16, 2022

harrykas commented Sep 21, 2022 • edited Loading

osaffer commented Sep 21, 2022

harrykas commented Sep 22, 2022

osaffer commented Sep 22, 2022

thenamehasbeentake commented Oct 13, 2023 • edited by microyahoo Loading

microyahoo commented Nov 3, 2023 • edited Loading

harrykas commented Sep 21, 2022 •

edited

Loading

thenamehasbeentake commented Oct 13, 2023 •

edited by microyahoo

Loading

microyahoo commented Nov 3, 2023 •

edited

Loading