Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release 0.9.4 #865

Merged
merged 22 commits into from
Mar 13, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
22 commits
Select commit Hold shift + click to select a range
bfcd86b
minor copy update (#838)
philipkiely-baseten Feb 26, 2024
8b1bf5d
Refactor `truss.server` out of templates to run as normal python prog…
bolasim Feb 26, 2024
daab03a
Clean old templates, training code, deprecated serveres, and examples…
bolasim Feb 26, 2024
f097ab4
Improve `truss watch`: respect `.truss_ignore` when calculating patch…
helenlyang Feb 28, 2024
92d1eed
Fix integration tests (#843)
helenlyang Feb 28, 2024
a1eda26
Fix os.PathLike type annotation
helenlyang Feb 28, 2024
66c794b
Fall back to default .truss_ignore in truss upload
helenlyang Feb 28, 2024
7271f87
Move build configuration script to truss (#847)
bolasim Mar 2, 2024
d169f37
Truss request validation return error in response, not log (#850)
squidarth Mar 6, 2024
9d9b658
Update external package docs (#849)
bolasim Mar 6, 2024
f5cab3c
Release 0.10.0-rc.0 (#851)
basetenbot Mar 6, 2024
228fa7e
Merge branch 'release'
basetenbot Mar 6, 2024
57d4b74
Stop intercepting prints and adding new lines (#852)
bolasim Mar 6, 2024
09b3f5c
Merge branch 'release'
basetenbot Mar 7, 2024
b911be8
Bola/revert everything (#857)
bolasim Mar 7, 2024
a119df0
Fix integration tests. (#858)
squidarth Mar 8, 2024
695e522
Pydantiv v1/v2 compatibility fixes. (#859)
marius-baseten Mar 8, 2024
70b1c59
Fix missing dep in CLI and Builder (#860)
bolasim Mar 11, 2024
9e609ed
Fix multi-byte unicode bug in TRT-LLM postprocessor (#841)
aspctu Mar 12, 2024
c8e58a4
Tensor parallel fix (#863)
aspctu Mar 13, 2024
6ff9cf8
Fix issue with python executable (#864)
bolasim Mar 13, 2024
721fd36
Bump version to 0.9.4
basetenbot Mar 13, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
104 changes: 0 additions & 104 deletions .test_durations

This file was deleted.

5 changes: 3 additions & 2 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,13 +20,14 @@ We use `asdf` to manage Python binaries and `poetry` to manage Python dependenci

For development in a macOS environment, we use `brew` to manage system packages.

```
```bash
# Install asdf (or use another method https://asdf-vm.com/)
brew install asdf

# Install `asdf` managed python and poetry
asdf plugin add python
asdf plugin add poetry
asdf install

# Install poetry dependencies
poetry install
Expand All @@ -37,7 +38,7 @@ poetry run pre-commit install

Then to run the entire test suite

```
```bash
poetry run pytest truss/tests
```

Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@

* **Write once, run anywhere:** Package and test model code, weights, and dependencies with a model server that behaves the same in development and production.
* **Fast developer loop:** Implement your model with fast feedback from a live reload server, and skip Docker and Kubernetes configuration with a batteries-included model serving environment.
* **Support for all Python frameworks**: From `transformers` and `diffusors` to `PyTorch` and `Tensorflow` to `XGBoost` and `sklearn`, Truss supports models created with any framework, even entirely custom models.
* **Support for all Python frameworks**: From `transformers` and `diffusers` to `PyTorch` and `TensorFlow` to `TensorRT` and `Triton`, Truss supports models created and served with any framework.

See Trusses for popular models including:

Expand Down
8 changes: 3 additions & 5 deletions bin/generate_base_images.py
Original file line number Diff line number Diff line change
Expand Up @@ -87,8 +87,6 @@ def _build(

if job_type == "server":
reqs_copy_from = templates_path / "server" / "requirements.txt"
elif job_type == "training":
reqs_copy_from = templates_path / "training" / "requirements.txt"
else:
raise ValueError(f"Unknown job type {job_type}")

Expand Down Expand Up @@ -128,7 +126,7 @@ def _build_all(
dry_run: bool = False,
):
if job_types is None:
job_types = ["server", "training"]
job_types = ["server"]

if python_versions is None:
python_versions = SUPPORTED_PYTHON_VERSIONS
Expand Down Expand Up @@ -173,7 +171,7 @@ def _build_all(
"--job-type",
nargs="?",
default="all",
choices=["server", "training", "all"],
choices=["server", "all"],
help="Create images for server",
)
parser.add_argument(
Expand Down Expand Up @@ -204,7 +202,7 @@ def _build_all(
python_versions = {args.python_version}

if args.job_type == "all":
job_types = ["server", "training"]
job_types = ["server"]
else:
job_types = [args.job_type]

Expand Down
26 changes: 0 additions & 26 deletions docker/base_images/vllm.Dockerfile

This file was deleted.

25 changes: 0 additions & 25 deletions docs/_snippets/config-params.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -299,31 +299,6 @@ Not required.
### `build`

The `build` section is used to define options for custom servers.
The two main model servers we support are `TGI` an `vLLM`. These are
highly optimized servers that are built to support specific LLMs.

See the following examples for how to use each of these:
* [TGI](/examples/07-high-performance-tgi)
* [vLLM](/examples/08-high-performance-vllm)

Example configuration for TGI, running Falcon-7B:

```yaml config.yaml
build:
arguments:
endpoint: generate_stream
model_id: tiiuae/falcon-7b
model_server: TGI
```

#### `build.model_server`

Either `VLLM` for vLLM, or `TGI` for TGI.

#### `build.arguments`

The arguments for the model server. This includes information such as which model you intend to load, and
which endpoin from the server you'd like to use.

### `model_cache`

Expand Down
2 changes: 1 addition & 1 deletion docs/contribute/base-images.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ description: "Add a custom base image to Truss"

Base images are used to create Docker images for containers. Truss uses customized base images to:

* Reduce Docker image build times for model serving and training
* Reduce Docker image build times for model serving
* Reduce container startup time
* Re-use common parts of containers

Expand Down
102 changes: 0 additions & 102 deletions docs/examples/07-high-performance-tgi.mdx

This file was deleted.