[Question]What does the service parameter max_tokens_in_paged_kv_cache mean?

I have run through the entire process of llama2 and want to stress test and see the benchmark indicators.

Regarding ```max_tokens_in_paged_kv_cache```, I may not understand it well

Is it similar to the [max_num_batched_tokens](https://github.com/vllm-project/vllm/blob/main/vllm/engine/arg_utils.py#L151) parameter of vllm?

Thanks~