Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

why send flow control error when connection window is 0 #13676

Open
iJIAJIA opened this issue Jan 18, 2024 · 6 comments
Open

why send flow control error when connection window is 0 #13676

iJIAJIA opened this issue Jan 18, 2024 · 6 comments
Labels
component/sdk Related with apache/dubbo help wanted Everything needs help from contributors type/enhancement Everything related with code enhancement or performance

Comments

@iJIAJIA
Copy link

iJIAJIA commented Jan 18, 2024

对于dubbo自定义的http2 流控窗口逻辑有点疑惑.

复现步骤:
dubbo version: 3.2.x

  1. 调整peer initialize window size from 8M to 64k(65535) 越小越容易复现.
  2. provider实现一个大数据返参接口.(如 listUser)

TriHttp2RemoteFlowController

// FlowState
int writeAllocatedBytes(int allocated) {
            ...
            try {
                ...
            } finally {
                ...
                // 当connection window为0时, 此时会直接发送flow control error, 阻断连接. 
                if(monitor.isOverFlowControl()){
                    logger.info(String.format("[HTTP2] connection window throttling, stream:%d overflow flow controlling", stream.id()));
//                    cause = new Throwable();
//                    cancel(FLOW_CONTROL_ERROR,cause);
                }
            }
            return writtenBytes;
        }

provider报:

Caused by: io.netty.handler.codec.http2.Http2Exception$StreamException: TotalPendingBytes size overflow for stream: 3
at io.netty.handler.codec.http2.Http2Exception.streamError(Http2Exception.java:153)
at org.apache.dubbo.rpc.protocol.tri.TriHttp2RemoteFlowController$ListenerWritabilityMonitor.checkConnectionThenStreamWritabilityChanged(TriHttp2RemoteFlowController.java:794)
at org.apache.dubbo.rpc.protocol.tri.TriHttp2RemoteFlowController$ListenerWritabilityMonitor.stateCancelled(TriHttp2RemoteFlowController.java:755)
... 35 common frames omitted

这里很好奇, http2 官方协议并没有规定, 当 connection window为0 时 直接发送 flow_control_error. 常规做法不应该是暂停发送, 等待
窗口更新后 继续传输么?

@iJIAJIA iJIAJIA added the type/discussion Everything related with code discussion or question label Jan 18, 2024
@iJIAJIA
Copy link
Author

iJIAJIA commented Jan 18, 2024

对于dubbo自定义的http2 流控窗口逻辑有点疑惑.

复现步骤: dubbo version: 3.2.x

  1. 调整peer initialize window size from 8M to 64k(65535) 越小越容易复现.
  2. provider实现一个大数据返参接口.(如 listUser)

TriHttp2RemoteFlowController

// FlowState
int writeAllocatedBytes(int allocated) {
            ...
            try {
                ...
            } finally {
                ...
                // 当connection window为0时, 此时会直接发送flow control error, 阻断连接. 
                if(monitor.isOverFlowControl()){
                    logger.info(String.format("[HTTP2] connection window throttling, stream:%d overflow flow controlling", stream.id()));
//                    cause = new Throwable();
//                    cancel(FLOW_CONTROL_ERROR,cause);
                }
            }
            return writtenBytes;
        }

provider报:

Caused by: io.netty.handler.codec.http2.Http2Exception$StreamException: TotalPendingBytes size overflow for stream: 3
at io.netty.handler.codec.http2.Http2Exception.streamError(Http2Exception.java:153)
at org.apache.dubbo.rpc.protocol.tri.TriHttp2RemoteFlowController$ListenerWritabilityMonitor.checkConnectionThenStreamWritabilityChanged(TriHttp2RemoteFlowController.java:794)
at org.apache.dubbo.rpc.protocol.tri.TriHttp2RemoteFlowController$ListenerWritabilityMonitor.stateCancelled(TriHttp2RemoteFlowController.java:755)
... 35 common frames omitted

这里很好奇, http2 官方协议并没有规定, 当 connection window为0 时 直接发送 flow_control_error. 常规做法不应该是暂停发送, 等待 窗口更新后 继续传输么?

@AlbumenJ 能否帮忙联系下作者回复下? java to java因为默认窗口设置的8M, 所以暂时未触发此异常. 但我们跨语言调用碰到了. golang那边的初始化窗口大小只有64k. 这里的逻辑, 给不到远端做窗口更新的机会.

@AlbumenJ
Copy link
Member

#11029

@AlbumenJ
Copy link
Member

@EarthChen PTAL

@EarthChen
Copy link
Member

这里其实是反压逻辑没做彻底,为了保护让consumer 中断,从而不再发请求

@iJIAJIA
Copy link
Author

iJIAJIA commented Jan 19, 2024

这里其实是反压逻辑没做彻底,为了保护让consumer 中断,从而不再发请求

那我先将我们内部版本调整回netty的默认流控策略了. 现在在跨语言调用下, 实际体验并不好.

@EarthChen
Copy link
Member

这里其实是反压逻辑没做彻底,为了保护让consumer 中断,从而不再发请求

那我先将我们内部版本调整回netty的默认流控策略了. 现在在跨语言调用下, 实际体验并不好.

默认策略是放入队列等待,当 consumer 速率大于 provider 时,可能导致 oom

@AlbumenJ AlbumenJ added help wanted Everything needs help from contributors type/enhancement Everything related with code enhancement or performance component/sdk Related with apache/dubbo and removed type/discussion Everything related with code discussion or question labels Mar 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/sdk Related with apache/dubbo help wanted Everything needs help from contributors type/enhancement Everything related with code enhancement or performance
Projects
Status: No status
Development

No branches or pull requests

3 participants