Skip to content

Add CAST+ and SOSC chunking strategies#194

Merged
m1rl0k merged 1 commit intotestfrom
chunk
Jan 24, 2026
Merged

Add CAST+ and SOSC chunking strategies#194
m1rl0k merged 1 commit intotestfrom
chunk

Conversation

@m1rl0k
Copy link
Collaborator

@m1rl0k m1rl0k commented Jan 24, 2026

Introduces CAST+ hybrid chunker and Search-Optimized Semantic Chunker (SOSC) for concept-aware code chunking. Adds universal concept extraction via language mappings, updates configuration and documentation to support new chunking strategies, and integrates chunker selection in the pipeline. Includes 34 language mapping modules for tree-sitter-based extraction.

Introduces CAST+ hybrid chunker and Search-Optimized Semantic Chunker (SOSC) for concept-aware code chunking. Adds universal concept extraction via language mappings, updates configuration and documentation to support new chunking strategies, and integrates chunker selection in the pipeline. Includes 34 language mapping modules for tree-sitter-based extraction.
@m1rl0k m1rl0k merged commit cecb223 into test Jan 24, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant