Skip to content
This repository has been archived by the owner on Sep 30, 2023. It is now read-only.

Releases: ravenscroftj/turbopilot

v0.2.1

29 Aug 09:20
eaeb52f
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.2.0...v0.2.1

v0.2.0

26 Aug 16:03
86f0774
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.0...v0.2.0

v0.1.0

05 Aug 09:10
66270d3
Compare
Choose a tag to compare

What's Changed

  • Refactored codebase - now a single unified turbopilot binary that provides support for codegen and starcoder style models.
  • Support for starcoder, wizardcoder and santacoder models
  • Support for CUDA 11 and 12

Full Changelog: v0.0.5...v0.1.0

Version 0.0.5

13 Jun 21:13
Compare
Choose a tag to compare

What's Changed

  • Turbopilot now supports CUDA which significantly accelerates inference for long prompts on machines with NVIDIA cards builds by @ravenscroftj in #27
  • Turbopilot now comes with prebuilt windows binaries by @ravenscroftj in #29
  • Added links to download pre-converted and pre-quantized CodeGen mono models. by @virtualramblas and @CRD716 in #18 and #21

New Contributors

Full Changelog: v0.0.4...v0.0.5

Version 0.0.4

15 Apr 12:55
Compare
Choose a tag to compare
  • Added multi-threaded server support which should prevent health checks aimed at GET / from failing during prediction.
  • Separated autocomplete lambda into a separate C++ function so that it can be bound to /v1/completions, /v1/engines/copilot-codex/completions and /v1/engines/codegen/completions
  • Removed model from completion input as required param which stops the official copilot plugin from freaking out
  • Integrate latest changes from upstream ggml including some fixes for ARM NEON processor
  • Added Mac "universal binary" builds as part of CI
  • Support for fork of vscode-fauxpilot with a progress indicator is now available (PR is open upstream, please react/vote for it) vscode-fauxcode now supports progress indication

V0.0.3

14 Apr 09:35
Compare
Choose a tag to compare
  • Added 350M parameter codegen model to Google Drive folder
  • Added multi-arch docker images so that users can now directly run on Apple silicon and even raspberry pi 4
  • Now support pre-tokenized inputs passed into the API from a Python tokenizer (thanks to @thakkarparth007 for their PR - ravenscroftj/ggml#2)

0.0.2

12 Apr 20:43
Compare
Choose a tag to compare

What's Changed

  • Project now builds on Mac OS (Thanks to @Dimitrije-V for their PR ravenscroftj/ggml#1 and @dabdine for contributing some clearer Mac build instructions)
  • Fix inability to load vocab.json on converting the 16B model due to encoding of the file not being set by @swanserquack in #5
  • Improve performance of model by incorporating changes to GGML library from @ggerganov

New Contributors

Full Changelog: 0.0.1...0.0.2

Version 0.0.1

10 Apr 09:08
826e298
Compare
Choose a tag to compare
Version 0.0.1 Pre-release
Pre-release
  • Initial release of Turbo Pilot