A high-performance inference system for large language models, designed for production environments.
-
Updated
Jun 8, 2024 - C++
A high-performance inference system for large language models, designed for production environments.
Multi-Branch CI/CD (2 Branches, 2 Pipelines) allows developers to efficiently refine and test applications before deployment to live servers. By leveraging AWS CodeDeploy, CodePipeline, and GitHub - developers seamlessly integrate and deliver their code through CI/CD pipelines.
Tgi public docs about anything and everything
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Free, open-source time keeping for live events
Easily creation of pre-production and production scripts to automate your deployment
An overview of the possibilities offered by artificial intelligence (AI) to serve as a technical basis for a digital product offering: from understanding, personalization, design of machine learning models and its deployment through an API built with FastAPI into the Cloud
A browser extension that lets you know when you're connected to production by giving you a clear visual warning.
📦 Create Projects with Webpack, TypeScript, Preact, Redux-Zero and Babel
Open Source, Google Zanzibar-inspired permissions database to enable fine-grained authorization for customer applications
A ProseMirror plugin for adding user-defined 'elements' containing arbitrary fields to a document.
Add a description, image, and links to the production topic page so that developers can more easily learn about it.
To associate your repository with the production topic, visit your repo's landing page and select "manage topics."