Skip to content

Conversation

@kyteinsky
Copy link
Contributor

It was used as the default fallback for nc_texttotext provider under the hood before.
The tokenizer is pre-downloaded in the init step.

Signed-off-by: Anupam Kumar <kyteinsky@gmail.com>
@kyteinsky kyteinsky requested a review from marcelklehr as a code owner August 8, 2025 08:28
Signed-off-by: Anupam Kumar <kyteinsky@gmail.com>
@kyteinsky kyteinsky force-pushed the fix/suppress-tokenizer-warn branch from ffc5448 to 10c332d Compare August 13, 2025 06:56
@kyteinsky kyteinsky force-pushed the fix/suppress-tokenizer-warn branch 4 times, most recently from 443ed7b to 3545c35 Compare August 27, 2025 07:46
@kyteinsky
Copy link
Contributor Author

seems like we're running out of disk space since the larger embedding model wasn't downloaded but the smaller gpt2 tokenizer was, which are downloaded in that order. Let's see if switching to ubuntu-22.04 as the base image helps.

Signed-off-by: Anupam Kumar <kyteinsky@gmail.com>
@kyteinsky kyteinsky force-pushed the fix/suppress-tokenizer-warn branch from 3545c35 to 310d89a Compare August 27, 2025 10:55
@kyteinsky
Copy link
Contributor Author

It was an issue with nc_py_api not downloading the model in the latest version. Fixed its version to the previous release: 0.20.2.

@kyteinsky kyteinsky merged commit ebae8ce into master Aug 27, 2025
10 of 12 checks passed
@kyteinsky kyteinsky deleted the fix/suppress-tokenizer-warn branch August 27, 2025 12:18
@kyteinsky kyteinsky mentioned this pull request Aug 29, 2025
kyteinsky added a commit that referenced this pull request Aug 29, 2025
## 4.5.0-beta.0 - 2025-08-29

### Fixed
- suppress tokenizer max context length warning (#200) @kyteinsky
- fix nc_py_api to 0.20.2 (#200) @kyteinsky

### Changed
- store worker logs and present them better (#200) @kyteinsky

### Added
- add app version to file logs (#201) @kyteinsky

Signed-off-by: Anupam Kumar <kyteinsky@gmail.com>
@kyteinsky kyteinsky mentioned this pull request Sep 23, 2025
kyteinsky added a commit that referenced this pull request Sep 23, 2025
## 4.5.0 - 2025-09-23

### Changed
- stable release of the 4.5.0-beta series
- bump max NC version to 33
- store worker logs and present them better (#200) @kyteinsky

### Fixed
- better parsing of version string for repairs (#209) @kyteinsky
- suppress tokenizer max context length warning (#200) @kyteinsky
- fix nc_py_api to 0.20.2 (#200) @kyteinsky

### Added
- add app version to file logs (#201) @kyteinsky

Signed-off-by: Anupam Kumar <kyteinsky@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants