Releases: OpenVoiceOS/quebra_frases
Releases · OpenVoiceOS/quebra_frases
Release 0.3.7
Changes in this Release
Changelog
V0.3.7a2 (2023-08-15)
V0.3.7a1 (2023-08-15)
Implemented enhancements:
Fixed bugs:
Merged pull requests:
v0.2.0 (2021-04-15)
* This Changelog was automatically generated by github_changelog_generator
Release 0.3.7a2
Changes in this Release
Changelog
V0.3.7a1 (2023-08-15)
Implemented enhancements:
Fixed bugs:
Merged pull requests:
v0.2.0 (2021-04-15)
* This Changelog was automatically generated by github_changelog_generator
Release 0.3.7a1
Changes in this Release
Changelog
Unreleased
Implemented enhancements:
Fixed bugs:
Merged pull requests:
v0.2.0 (2021-04-15)
* This Changelog was automatically generated by github_changelog_generator
v0.2.0 - Sentence Breaker
Quebra Frases
quebra_frases chunks strings into byte sized pieces
import quebra_frases
samples = ["what do you dream about",
"what did you dream about",
"what are your dreams about"]
print(quebra_frases.get_common_chunks(samples))
# ['what', 'about']
delimiters = ["mycroft"]
sentence = "sometimes i develop stuff for mycroft, mycroft is FOSS!"
print(quebra_frases.chunk(sentence, delimiters))
# ['sometimes i develop stuff for', 'mycroft', ',', 'mycroft', 'is FOSS!']
print(quebra_frases.word_tokenize(sentence))
# ['sometimes', 'i', 'develop', 'stuff', 'for', 'mycroft', ',',
# 'mycroft', 'is', 'FOSS', '!']
print(quebra_frases.char_indexed_word_tokenize(sentence))
# [(0, 'sometimes'), (10, 'i'), (12, 'develop'), (20, 'stuff'),
# (26, 'for'), (30, 'mycroft'), (37, ','), (39, 'mycroft'), (47, 'is'),
# (50, 'FOSS'), (54, '!')]
print(quebra_frases.span_indexed_word_tokenize(sentence))
# [(0, 9, 'sometimes'), (10, 11, 'i'), (12, 19, 'develop'),
# (20, 25, 'stuff'), (26, 29, 'for'), (30, 37, 'mycroft'),
# (37, 38, ','), (39, 46, 'mycroft'), (47, 49, 'is'),
# (50, 54, 'FOSS'), (54, 55, '!')]
print(quebra_frases.sentence_tokenize(
"Mr. Smith bought cheapsite.com for 1.5 million dollars, i.e. he paid a lot for it. Did he mind? Adam Jones Jr. thinks he didn't. In any case, this isn't true... Well, with a probability of .9 it isn't."))
#['Mr. Smith bought cheapsite.com for 1.5 million dollars, i.e. he paid a lot for it.',
#'Did he mind?',
#"Adam Jones Jr. thinks he didn't.",
#"In any case, this isn't true...",
#"Well, with a probability of .9 it isn't."]