Skip to content

Conversation

@heiruwu
Copy link
Contributor

@heiruwu heiruwu commented Jan 26, 2024

Because

  • we want to dynamically adjust the memory value for each model

This commit

  • determine ram usage by file size

@linear
Copy link

linear bot commented Jan 26, 2024

@heiruwu heiruwu merged commit 7a0023d into main Jan 26, 2024
@heiruwu heiruwu deleted the INS-3529 branch January 26, 2024 11:16
heiruwu pushed a commit that referenced this pull request Jan 30, 2024
🤖 I have created a release *beep* *boop*
---


##
[0.7.0](v0.6.0...v0.7.0)
(2024-01-30)


### Features

* **ray:** determine ram usage by file size
([#89](#89))
([7a0023d](7a0023d))
* **ray:** determine vram usage by file size
([#87](#87))
([71e84e6](71e84e6))


### Bug Fixes

* **ray:** add vram ceiling and override list
([#94](#94))
([5804e4a](5804e4a))
* **ray:** fix application name
([f2bb563](f2bb563))
* **ray:** fix gpu resource > 1
([#91](#91))
([b121f56](b121f56))
* **ray:** fix ray autoscaling
([#95](#95))
([5bd8c2a](5bd8c2a))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

No open projects
Status: 👋 Done

Development

Successfully merging this pull request may close these issues.

3 participants