v1.0.4 #61
chonknick
announced in
Announcements
v1.0.4
#61
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
✨ Highlights
CodeChunker!CodeChunkeris specialized to handle code files and can gain structural understanding of the code before chunking each portion separately~ Supports 100+ programming languages! Let's check out the usage~Firstly, install the code chunker dependencies via:
pip install "chonkie[code]"and then simply run it like any other chunker~
JinaAIembeddings withJinaEmbeddings— allowing for use withSemanticChunkerandSDPMChunker!Install it via the following command:
pip install "chonkie[jina]"and use it like this~
OverlapRefinerywhich allows you to add overlap context to your chunks~ It's available in the default install and can be used with any chunker. Just chunk with a chunker and pass your chunks through the refinery!EmbeddingsRefinerywhich allows you to run the chunks through a embedding model and have the embeddings available for downstream loading in a vector database. Similar to theOverlapRefineryjust pass in thechunksfrom aChunkerinto aEmbeddingsRefineryobject loaded with the appropriate embedding model and eachChunkwill then be loaded with.embeddingvalue which can be used downstream.What's Changed
CodeChunkerby @chonknick in Add initial support for chunking code viaCodeChunker#53CodeChunkerby @chonknick in Add Initial support for code chunking viaCodeChunker#54No module named 'chonkie.cloud.chunkers'error by @chonknick in Fix: Paths for the Chonkie Cloud chunkers;No module named 'chonkie.cloud.chunkers'error #58chonkie.cloudpath bug + update README.md to have integrations by @chonknick in Fix: Attempt fixingchonkie.cloudpath bug + update README.md to have integrations #60New Contributors
Full Changelog: v1.0.3...v1.0.4
This discussion was created from the release v1.0.4.
Beta Was this translation helpful? Give feedback.
All reactions