bluetooth: host: Split cmd complete & cmd status to separate pool #74645

LingaoM · 2024-06-21T07:30:52Z

The default main branch, cmd complete & cmd status use common normal net_buf pool, will be increase dead-lock, so split it to separate.

This always ensures that the event flow is not blocked.

subsys/bluetooth/common/Kconfig

The default main branch, cmd complete & cmd status use common normal net_buf pool, will be increase dead-lock, so split it to separate. Signed-off-by: Lingao Meng <menglingao@xiaomi.com>

Thalley

I'm not yet sure whether this is a good change or not.

Do you have any logs or (even better) some tests that show the issue you are trying to solve, which shows that the issue is solved by this PR?

LingaoM · 2024-06-22T09:04:07Z

I'm not yet sure whether this is a good change or not.

Do you have any logs or (even better) some tests that show the issue you are trying to solve, which shows that the issue is solved by this PR?

#74287 (comment)

Maybe this answer by @alwa-nordic better.

Thalley · 2024-06-22T09:31:33Z

subsys/bluetooth/common/Kconfig

Is the range and default value here due to the fallback mechanism you have defined?

If we did not fallback to the event buffers in case of error, could this be reduced? Do we have any events larger than ~32 without ext_adv?

I have also thought of a similar approach, but considering that almost all configurations now use extended advertising as a default option, the value of this optimization is not high.

The current design will add about 140 bytes of RAM overhead, but it ensures relative safety, so I think it is worth it and change this way will not be noticeable to users.

Thalley · 2024-06-22T09:32:07Z

subsys/bluetooth/host/buf.c

Is that valid behavior of the controller?

there is the command complete for NOP opcode. This is legal to send on boot, and we have a quirk in the host to wait for it, as it is somewhat useful over H4.

Also: #64158

Currently normal event pool will also use by extended advertising event report, this is a huge activity, Also other events such as periodic advertising report. So for command complete and command status also use same normal pool not a good way.

Thalley

The change looks good to me, but it looks to me that we can get some significant optimization from the default size of BT_BUF_CMD_COMPLETE_EVT_SIZE

subsys/bluetooth/common/Kconfig

jori-nordic · 2024-06-25T06:26:39Z

subsys/bluetooth/common/Kconfig

To me this reads as "if you use a split build (ie hci on another chip), then you better use 255".
So I propose you also add default 255 if !BT_CTLR. Same remark for the other kconfig.

After zephyrproject-rtos#72135 there are no-used at all, so deleted. Signed-off-by: Lingao Meng <menglingao@xiaomi.com>

alwa-nordic · 2024-06-28T08:11:48Z

Do we need a pool for Command Complete? I think we don't need the new pool and can save the memory by putting Command Complete events into the single-buffer sync_evt_pool.

The current implementation only needs one event buffer, since we copy out the data immediately in hci_cmd_done. But even with the changes proposed in #74287, #74287 (comment) says we will never block the buffer.

LingaoM · 2024-06-28T10:54:58Z

Do we need a pool for Command Complete? I think we don't need the new pool and can save the memory by putting Command Complete events into the single-buffer sync_evt_pool.

Three reasons to keep a separate pool for command complete events:

Please do not couple command complete event and bluetooth connection, since sync_evt_pool only used with connection, but command will use in any scenes. and also the default size of SYNC_EVT_SIZE less than 68, this does not save memory, but it is more likely to cause dead-lock anyway.
Currently normal event pool will also use by extended advertising event report, this is a huge activity, Also other events such as periodic advertising report. So for command complete and command status also use same normal pool not a good way.
The current design will add about 140 bytes of RAM overhead, but it ensures relative safety, so I think it is worth it and change this way will not be noticeable to users.

LingaoM · 2024-06-28T11:00:35Z

The current implementation only needs one event buffer, since we copy out the data immediately in hci_cmd_done. But even with the changes proposed in #74287, #74287 (comment) says we will never block the buffer.

I fully understand that currently only one is supported, but the current configuration is two:

config BT_BUF_CMD_TX_COUNT
	int "Number of HCI command buffers"
	default 2
	range 2 64
	help
	  Number of buffers available for outgoing HCI commands from the Host.

Another advantage of using two here is that we can always ensure that there is always a free buffer for command complete event even in some extreme scenarios, if the Controller reports a message inexplicably, we can still tolerate the error., so I use K_NO_WAIT instead of timeout, to avoiding dead-lock.

jori-nordic · 2024-07-01T08:08:37Z

but it is more likely to cause dead-lock anyway

What deadlock are you hinting at?

jori-nordic · 2024-07-01T08:10:01Z

even in some extreme scenarios, if the Controller reports a message inexplicably, we can still tolerate the error., so I use K_NO_WAIT instead of timeout, to avoiding dead-lock

Can you elaborate on this deadlock scenario? What sequence of events/data would get us into this state. Have you seen it in manual testing?

I don't understand the issue this patch is supposed to address.

LingaoM · 2024-07-04T09:27:08Z

even in some extreme scenarios, if the Controller reports a message inexplicably, we can still tolerate the error., so I use K_NO_WAIT instead of timeout, to avoiding dead-lock

Can you elaborate on this deadlock scenario? What sequence of events/data would get us into this state. Have you seen it in manual testing?

Please see:#68008 (comment)
In this PR, allocate for command complete events will not block at all, because use k_NO_WAIT.

LingaoM · 2024-07-04T10:38:30Z

but it is more likely to cause dead-lock anyway

What deadlock are you hinting at?

#70153

zephyrbot added area: Bluetooth area: Bluetooth Host Bluetooth Host (excluding BR/EDR) labels Jun 21, 2024

zephyrbot requested review from Thalley, Vudentz, alwa-nordic, asbjornsabo, hermabe, jhedberg, jori-nordic, sjanc and theob-pro June 21, 2024 07:31

zephyrbot assigned jhedberg Jun 21, 2024

LingaoM force-pushed the split_pool branch 2 times, most recently from a3ca302 to cd33a8b Compare June 21, 2024 07:39

LingaoM added the Enhancement Changes/Updates/Additions to existing features label Jun 21, 2024

LingaoM mentioned this pull request Jun 21, 2024

Bluetooth: Host: Fixed where bt_send returns an error but is actually… #74287

Closed

LingaoM force-pushed the split_pool branch 3 times, most recently from f49cf94 to 37d5879 Compare June 21, 2024 08:31

Thalley reviewed Jun 21, 2024

View reviewed changes

subsys/bluetooth/common/Kconfig Outdated Show resolved Hide resolved

LingaoM force-pushed the split_pool branch 3 times, most recently from 96db9d2 to bdec445 Compare June 21, 2024 09:18

bluetooth: host: Split cmd complete & cmd status to separate pool

81c4841

The default main branch, cmd complete & cmd status use common normal net_buf pool, will be increase dead-lock, so split it to separate. Signed-off-by: Lingao Meng <menglingao@xiaomi.com>

LingaoM force-pushed the split_pool branch from 94b354d to 7d05f85 Compare June 21, 2024 09:38

Thalley reviewed Jun 22, 2024

View reviewed changes

Thalley reviewed Jun 23, 2024

View reviewed changes

subsys/bluetooth/common/Kconfig Outdated Show resolved Hide resolved

jhedberg previously approved these changes Jun 24, 2024

View reviewed changes

LingaoM requested a review from Thalley June 25, 2024 01:00

jori-nordic reviewed Jun 25, 2024

View reviewed changes

bluetooth: host: remove unused ztest

5ceeb2b

After zephyrproject-rtos#72135 there are no-used at all, so deleted. Signed-off-by: Lingao Meng <menglingao@xiaomi.com>

LingaoM dismissed jhedberg’s stale review via 5ceeb2b June 25, 2024 11:41

LingaoM force-pushed the split_pool branch from 7d05f85 to 5ceeb2b Compare June 25, 2024 11:41

LingaoM requested review from jhedberg and jori-nordic June 25, 2024 11:41

Thalley approved these changes Jun 25, 2024

View reviewed changes

jori-nordic previously approved these changes Jun 25, 2024

View reviewed changes

jhedberg approved these changes Jun 25, 2024

View reviewed changes

aescolar added this to the v4.0.0 milestone Jun 26, 2024

LingaoM closed this Jul 5, 2024

bluetooth: host: Split cmd complete & cmd status to separate pool #74645

bluetooth: host: Split cmd complete & cmd status to separate pool #74645

Uh oh!

Conversation

LingaoM commented Jun 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Thalley left a comment

Choose a reason for hiding this comment

Uh oh!

LingaoM commented Jun 22, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LingaoM Jun 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Thalley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alwa-nordic commented Jun 28, 2024

Uh oh!

LingaoM commented Jun 28, 2024

Uh oh!

LingaoM commented Jun 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jori-nordic commented Jul 1, 2024

Uh oh!

jori-nordic commented Jul 1, 2024

Uh oh!

LingaoM commented Jul 4, 2024

Uh oh!

LingaoM commented Jul 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

LingaoM commented Jun 21, 2024 •

edited

Loading

LingaoM Jun 22, 2024 •

edited

Loading

LingaoM commented Jun 28, 2024 •

edited

Loading