Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DM-42220: Incorporate ModelPackage Butler datasets into Prompt Processing #125

Merged
merged 6 commits into from
Feb 21, 2024

Conversation

kfindeisen
Copy link
Member

This PR adds support for machine learning models (specifically, the pretrainedModelPackage dataset type) to prep_butler. Applicable models have already been added to /repo/embargo and our integration testing repos.

python/activator/middleware_interface.py Outdated Show resolved Hide resolved
Previously standard collection names were represented partly by
constants, and partly by methods. Making them all
attributes/properties, with the same naming convention, provides a more
uniform API.
A model is quite large (hundreds of MB), but because the same model is
used regardless of data ID, it should be a one-time overhead for
the pod.

This commit also chains the dummy model to the test repo's
DECam/defaults, something that should have been done when it was added.
Now that the model is loaded in prep_butler (in both unit and
integration tests), we can build a graph for the complete pipeline
without getting missing dataset errors.
@kfindeisen kfindeisen merged commit 6ef6df2 into main Feb 21, 2024
6 checks passed
@kfindeisen kfindeisen deleted the tickets/DM-42220 branch February 21, 2024 01:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants