Skip to content

fix(lazy_vlm): update HKVCacheFast and Qwen2.5VL model implementations#441

Merged
chenghuaWang merged 3 commits intoUbiquitousLearning:v2from
chenghuaWang:v2
Sep 12, 2025
Merged

fix(lazy_vlm): update HKVCacheFast and Qwen2.5VL model implementations#441
chenghuaWang merged 3 commits intoUbiquitousLearning:v2from
chenghuaWang:v2

Conversation

@chenghuaWang
Copy link
Copy Markdown
Collaborator

  • Update HKVCacheFast::updateKVCache to use memcpy for faster cache updates
  • Modify Qwen2_5VLForCausalLM and Qwen2_5VLText classes to handle position IDs and window indexing
  • Update MultimodalRoPEOp to support different input shapes
  • Adjust PagedAttnTest to include new test cases for query shapes

- Create C.hpp as the header file for MLLM FFI C API
- Create PackedArgs.hpp for defining PackedArgs structure
… executables

- Added `lazy_vlm_qwen2vl_fast` and `lazy_vlm_qwen2_5vl_fast` executables.
- Updated `CMakeLists.txt` to include the new executables.
- Modified `HKVCacheFast.cpp` and `HKVCacheFast.hpp` to add new methods for manual cache length update, visit hidden state cache, and get current sequence count.
- Updated `LazyVLMQwen2_5VLFast.cpp` with the main function and necessary includes.
- Updated `modeling_qwen2_5vl_fast.hpp` to include `MultimodalRoPEOp` and modify the `makeWindowIndex` function.
- Updated `run_remote_android.py` to push the new executables to the remote Android device.
- Updated `MultimodalRoPEOp.cpp`, `MultimodalRoPEOp.hpp`, and `MultimodalRoPE.cpp` to support different input types for the `MultimodalRoPEOp`.
- Updated `MultimodalRoPEOp.hpp` and `MultimodalRoPE.hpp` to include the new input type enum and constructor parameters.
- Update HKVCacheFast::updateKVCache to use memcpy for faster cache updates
- Modify Qwen2_5VLForCausalLM and Qwen2_5VLText classes to handle position IDs and window indexing
- Update MultimodalRoPEOp to support different input shapes
- Adjust PagedAttnTest to include new test cases for query shapes
@chenghuaWang chenghuaWang merged commit f6ae390 into UbiquitousLearning:v2 Sep 12, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant