Skip to content

Commit

Permalink
Add longer chunking example
Browse files Browse the repository at this point in the history
  • Loading branch information
jncraton committed Apr 21, 2024
1 parent 529a8c7 commit 9fc78cf
Showing 1 changed file with 12 additions and 0 deletions.
12 changes: 12 additions & 0 deletions languagemodels/embeddings.py
Expand Up @@ -100,6 +100,18 @@ def chunk_doc(doc, name="", chunk_size=64, chunk_overlap=8):
>>> chunk_doc("")
[]
>>> chunk_doc(""
... "It was the best of times, it was the worst of times, it was the age "
... "of wisdom, it was the age of foolishness, it was the epoch of belief, "
... "it was the epoch of incredulity, it was the season of Light, it was "
... "the season of Darkness, it was the spring of hope, it was the winter "
... "of despair, we had everything before us, we had nothing before us, we "
... "were all going direct to Heaven, we were all going direct the other "
... "way—in short, the period was so far like the present period, that "
... "some of its noisiest authorities insisted on its being received, for "
... "good or for evil, in the superlative degree of comparison only.")
['It was the best of times...']
>>> chunk_doc("Hello")
['Hello']
Expand Down

0 comments on commit 9fc78cf

Please sign in to comment.