Huggingface downloader & Simpler log message & InterruptMixin #2

c0sogi · 2023-08-02T09:47:36Z

This pull request encompasses several enhancements to usability and code refactoring. The primary changes include:

Automatic Model Downloader: In our previous implementation, the model_path attribute in model_definitions.py required an actual filename of a model. We have now upgraded this to accept the name of a HuggingFace repository instead. As a result, the specified model is automatically downloaded when needed. For instance, if you define TheBloke/NewHope-GPTQ as the model_path, the necessary files will be downloaded into models/gptq/thebloke_newhope_gptq. This functionality works similarly for GGML.
Simpler Log Message: We've made our log messages more concise when using Completions, Chat Completions, or Embeddings endpoints. These logs will now fundamentally display elapsed time, token usage, and token generations per second.
Improved Responsiveness for Job Cancellation: The Event object in SyncManager now sends an interrupt signal to worker processes. It checks the is_interrupted property at the most low-level accessible area and tries to cancel the operation.

These changes foster more intuitive use of our application and enhance its overall responsiveness. They streamline model handling by allowing automatic downloads from a repository, rather than relying on specific file names. The job cancellation process is now more reactive, potentially saving computing resources and time if a process needs to be halted. Finally, our log messages are now cleaner and more informative, providing essential information for monitoring performance and usage.

c0sogi and others added 4 commits August 1, 2023 00:20

added auto model downloader

6040e1a

simpler log message

9caed96

refactored v1 endpoint

0e59225

added InterruptMixin

14ad9fd

c0sogi changed the title ~~Use Hugginface model downloader & Add InterruptMixin~~ Huggingface downloader & Simpler log message & InterruptMixin Aug 2, 2023

c0sogi merged commit 344ab12 into master Aug 2, 2023
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huggingface downloader & Simpler log message & InterruptMixin #2

Huggingface downloader & Simpler log message & InterruptMixin #2

c0sogi commented Aug 2, 2023 •

edited

Loading

Huggingface downloader & Simpler log message & InterruptMixin #2

Huggingface downloader & Simpler log message & InterruptMixin #2

Conversation

c0sogi commented Aug 2, 2023 • edited Loading

c0sogi commented Aug 2, 2023 •

edited

Loading