Skip to content

Conversation

@heiruwu
Copy link
Contributor

@heiruwu heiruwu commented Apr 25, 2024

Because

  • writing metadata response is a burden for user

This commit

  • add predefined constructors for LLM related task for better developer experience when implementing custom models

@linear
Copy link

linear bot commented Apr 25, 2024

@heiruwu heiruwu merged commit be122d1 into main Apr 25, 2024
@heiruwu heiruwu deleted the heiru/INS-4336 branch April 25, 2024 18:46
heiruwu pushed a commit that referenced this pull request Apr 25, 2024
🤖 I have created a release *beep* *boop*
---


##
[0.8.0](v0.7.1...v0.8.0)
(2024-04-25)


### Features

* **deps:** upgrade ray version
([cc61b85](cc61b85))
* **ray:** adapt to native docker client instead of docker sdk
([#138](#138))
([7d19ccb](7d19ccb))
* **ray:** add accelerator and custom resource support
([#118](#118))
([f974f98](f974f98))
* **ray:** add llava 13b to predeploy list
([3fd5914](3fd5914))
* **ray:** add metadata and infer constructor for llm tasks
([#137](#137))
([be122d1](be122d1))
* **ray:** generate sha256 as tag if not presented
([#120](#120))
([6abb538](6abb538))
* **ray:** inject accelerator type at runtime
([#121](#121))
([f78a2d0](f78a2d0))
* **ray:** support containerized model serving
([#116](#116))
([ad0f250](ad0f250))
* **ray:** support custom accelerator type
([#134](#134))
([ae6c139](ae6c139))
* **ray:** use env for resource and deprecate deploy/undeploy
([#124](#124))
([a58bc50](a58bc50))
* **ray:** use tmp folder for image building
([#122](#122))
([9512cec](9512cec))


### Bug Fixes

* **deps:** downgrade ray to avoid grpc servicer issue
([#128](#128))
([9ead421](9ead421))
* **dockerfile:** avoid build hang at ARG statement
([#130](#130))
([f02a27c](f02a27c))
* **ray:** fix etrypoint module not found
([#126](#126))
([f1ed83d](f1ed83d))
* **ray:** fix missing default resource value
([#129](#129))
([b2f564a](b2f564a))
* **ray:** fix multi-platform build stage
([6f358fd](6f358fd))
* **ray:** support target platform for image building
([#127](#127))
([f4825fc](f4825fc))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

No open projects
Status: 👋 Done

Development

Successfully merging this pull request may close these issues.

3 participants