Skip to content

Conversation

@kyteinsky
Copy link
Contributor

This gpt2 tokenizer is used to estimate the tokens a query takes up.

This gpt2 tokenizer is used to estimate the tokens
a query takes up.

Signed-off-by: Anupam Kumar <[email protected]>
@kyteinsky kyteinsky requested a review from julien-nc June 18, 2025 19:22
@kyteinsky kyteinsky requested a review from marcelklehr as a code owner June 18, 2025 19:22
@kyteinsky kyteinsky merged commit 64f0f52 into master Jun 19, 2025
9 checks passed
@kyteinsky kyteinsky deleted the chore/pre-download-tokenizer branch June 19, 2025 11:13
@kyteinsky kyteinsky mentioned this pull request Jul 21, 2025
kyteinsky added a commit that referenced this pull request Jul 21, 2025
## 4.4.0 - 2025-07-21

### Fixed
- improve source tracking so no file stat is lost (#190) @kyteinsky
- improve OCS signing error messages (#189) @kyteinsky
- handle encrypted pdf decryption error (#195) @kyteinsky

### Changed
- maintenance update (#184) @kyteinsky
- update issue template to attach logs (#193) @lukasdotcom
- bump llama_cpp_python (#196) @kyteinsky

### Added
- add doc search endpoint (#185) @kyteinsky
- pre download the tokenizer instead of mid operation (#191) @kyteinsky
- add endpoint for downloading logs (#192) @lukasdotcom
- use supervisord to manage the processes (#194) @kyteinsky

Signed-off-by: Anupam Kumar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants