Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ship Nitro as a Binary #9

Closed
3 of 5 tasks
tikikun opened this issue Sep 12, 2023 · 2 comments
Closed
3 of 5 tasks

Ship Nitro as a Binary #9

tikikun opened this issue Sep 12, 2023 · 2 comments
Assignees
Labels
P0: critical Mission critical type: epic A major feature or initiative

Comments

@tikikun
Copy link
Collaborator

tikikun commented Sep 12, 2023

Nitro should be statically built and distribution as a binary

Tasks

  • Build drogon with llama cpp
  • Spin up Mac VM for testing
  • Target mac os, x86, metal supported binary
  • (clarify?) We have llm endpoint using ggml
  • Server can be configured using a config file

Success criteria

  • Nitro is multi platform binary
  • Runs a Drogon C++ server
  • Serves llama-cpp for Metal or CPU only modes
  • Include encoding / decoding
  • An architecture diagram showings whats up
@tikikun tikikun self-assigned this Sep 12, 2023
@0xSage
Copy link
Contributor

0xSage commented Sep 12, 2023

@tikikun can you elaborate a bit more on your issues please.

Template inspo: https://github.com/pytorch/pytorch/issues/new/choose

@0xSage 0xSage added the status: needs info This doesn't seem right label Sep 12, 2023
@tikikun
Copy link
Collaborator Author

tikikun commented Sep 12, 2023

Outcome:

  • Have static build and single binary file (or multiple files) to run nitro

Problem:

  • Currently nitro needs to be shipped with a container, which is huge in size (2.89gb)
  • Nitro can be much lighter with just a single binary, but the way we code it now does not allow for static build.

@0xSage 0xSage changed the title should have static build option Ship Nitro as a Binary Sep 18, 2023
@0xSage 0xSage added P0: critical Mission critical type: feature request A new feature and removed status: needs info This doesn't seem right labels Sep 18, 2023
@0xSage 0xSage added type: epic A major feature or initiative and removed type: feature request A new feature labels Sep 26, 2023
@tikikun tikikun closed this as completed Sep 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P0: critical Mission critical type: epic A major feature or initiative
Projects
None yet
Development

No branches or pull requests

2 participants