[LLM] Add code llama example #3050

Michaelvll · 2024-01-30T21:08:28Z

After #3048 is fixed, we should be able to make the model path a env var.

Tested (run the relevant ones):

Code formatting: bash format.sh
Any manual or new tests for this PR (please specify below)
- Two commands in the readme
All smoke tests: pytest tests/test_smoke.py
Relevant individual smoke tests: pytest tests/test_smoke.py::test_fill_in_the_name
Backward compatibility tests: bash tests/backward_comaptibility_tests.sh

romilbhardwaj

Nice! Thanks @Michaelvll!! Left some comments. Tried out the sky launch bit, works nicely. Trying out with sky serve now.

llm/codellama/README.md

llm/codellama/complete.py

llm/codellama/README.md

MaoZiming · 2024-01-31T00:54:31Z

llm/codellama/README.md

+```
+As shown, the service is now backed by 2 replicas, one on Azure and one on GCP, and the accelerator
+type is chosen to be **the cheapest and available one** on the clouds. That said, it maximizes the
+availability of the service while minimizing the cost.


What does it mean by 'maximize availability' here? Does it mean finding the instance faster?

More like it increases the chance of getting the resources with all the candidate resources in multiple resource pools.

llm/codellama/complete.py

Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu>

…ellama

Co-authored-by: Ziming Mao <ziming.mao@yale.edu>

romilbhardwaj

LGTM, thanks @Michaelvll! This is very cool!

romilbhardwaj · 2024-01-31T02:55:16Z

llm/codellama/README.md

+* No one else sees your chat history
+
+
+![Code Llama GUI](https://imgur.com/Dor1MoE.gif "Chatting with Code Llama")


Should we put this GIF at the top, say at L2 to catch eyes?

romilbhardwaj · 2024-01-31T02:55:34Z

examples/serve/vllm.yaml

@@ -38,7 +38,6 @@ run: |

  python3 -m fastchat.serve.controller --host 0.0.0.0 --port ${CONTROLLER_PORT} > ~/controller.log 2>&1 &

-  cd FastChat


I am assuming this is intended : )

Yep, it seems we forgot to remove this line before, when we change the dependency.

llm/codellama/README.md

llm/codellama/gui.yaml

Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu>

llm/codellama/endpoint.yaml

…dellama

Michaelvll · 2024-01-31T20:16:15Z

Added the example for tabby. PTAL @romilbhardwaj @MaoZiming : )

llm/codellama/README.md

Add code llama example

5ff7b8a

Michaelvll requested a review from concretevitamin January 30, 2024 21:15

concretevitamin requested review from romilbhardwaj and MaoZiming January 30, 2024 21:18

Add update and python API

1918ec5

romilbhardwaj reviewed Jan 30, 2024

View reviewed changes

Michaelvll added 5 commits January 30, 2024 23:08

use endpoint and fix ux for serve update

d3f7d98

Add gui

9474c9c

adopt changes from comments

25a3a34

fix gui

3df3d78

fix gui

af04b8f

MaoZiming reviewed Jan 31, 2024

View reviewed changes

llm/codellama/README.md Outdated Show resolved Hide resolved

MaoZiming reviewed Jan 31, 2024

View reviewed changes

llm/codellama/README.md Outdated Show resolved Hide resolved

MaoZiming reviewed Jan 31, 2024

View reviewed changes

llm/codellama/README.md Outdated Show resolved Hide resolved

MaoZiming reviewed Jan 31, 2024

View reviewed changes

llm/codellama/complete.py Outdated Show resolved Hide resolved

Michaelvll and others added 12 commits January 31, 2024 01:30

use API server directly for GUI

f810840

Move web GUI gif to the top

94d370c

new line

db8e5f1

Credit for fastchat

72463c6

use instruct model instead

9bb468f

Update llm/codellama/README.md

27482aa

Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu>

Update llm/codellama/README.md

9efdf40

Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu>

Merge branch 'codellama' of github.com:skypilot-org/skypilot into cod…

e503914

…ellama

Update llm/codellama/README.md

e9c11bd

Co-authored-by: Ziming Mao <ziming.mao@yale.edu>

Update llm/codellama/README.md

b8bd7d1

Co-authored-by: Ziming Mao <ziming.mao@yale.edu>

Update llm/codellama/complete.py

d76d25f

Co-authored-by: Ziming Mao <ziming.mao@yale.edu>

Update llm/codellama/README.md

7b5f9d9

Co-authored-by: Ziming Mao <ziming.mao@yale.edu>

romilbhardwaj approved these changes Jan 31, 2024

View reviewed changes

Update llm/codellama/README.md

13ecf1f

Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu>

Michaelvll added 2 commits January 31, 2024 05:03

dependency fixes for GUI

8a9a187

simplify replica config

7b27fab

concretevitamin reviewed Jan 31, 2024

View reviewed changes

llm/codellama/endpoint.yaml Show resolved Hide resolved

Michaelvll and others added 15 commits January 31, 2024 05:30

fix indents

cd0af5e

fix

19106a2

Merge branch 'update_fix' of github.com:skypilot-org/skypilot into co…

c480797

…dellama

use 2 replicas

5c37fad

add news

6c2991f

recycle llama 2 chatbot

9771f02

Add tabby example

2ca4385

Image size

eae6d1b

smaller size

ef30804

Fix readme

5c5d96a

Smaller title

579251b

shorten title

8630fdb

shorten

db27d12

fix

6d9bfcb

paras

784572a

Michaelvll added 3 commits January 31, 2024 20:17

remove local

170818c

rename

0b8e210

typo

1900419

concretevitamin reviewed Jan 31, 2024

View reviewed changes

llm/codellama/README.md Outdated Show resolved Hide resolved

Michaelvll added 5 commits January 31, 2024 21:12

title

fdf4e16

Increase initial delay

9268025

change to skypilot serve

b7893cf

title

08bcc5e

new title

458048a

Michaelvll merged commit b25471c into master Feb 1, 2024
19 checks passed

Michaelvll deleted the codellama branch February 1, 2024 16:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add code llama example #3050

[LLM] Add code llama example #3050

Michaelvll commented Jan 30, 2024 •

edited

romilbhardwaj left a comment

MaoZiming Jan 31, 2024 •

edited

Michaelvll Jan 31, 2024

romilbhardwaj left a comment

romilbhardwaj Jan 31, 2024

romilbhardwaj Jan 31, 2024

Michaelvll Jan 31, 2024

Michaelvll commented Jan 31, 2024

		* No one else sees your chat history


		![Code Llama GUI](https://imgur.com/Dor1MoE.gif "Chatting with Code Llama")

		@@ -38,7 +38,6 @@ run: \|

		python3 -m fastchat.serve.controller --host 0.0.0.0 --port ${CONTROLLER_PORT} > ~/controller.log 2>&1 &

		cd FastChat

[LLM] Add code llama example #3050

[LLM] Add code llama example #3050

Conversation

Michaelvll commented Jan 30, 2024 • edited

romilbhardwaj left a comment

Choose a reason for hiding this comment

MaoZiming Jan 31, 2024 • edited

Choose a reason for hiding this comment

Michaelvll Jan 31, 2024

Choose a reason for hiding this comment

romilbhardwaj left a comment

Choose a reason for hiding this comment

romilbhardwaj Jan 31, 2024

Choose a reason for hiding this comment

romilbhardwaj Jan 31, 2024

Choose a reason for hiding this comment

Michaelvll Jan 31, 2024

Choose a reason for hiding this comment

Michaelvll commented Jan 31, 2024

Michaelvll commented Jan 30, 2024 •

edited

MaoZiming Jan 31, 2024 •

edited