Skip to content

Actions: huggingface/text-generation-inference

Build and push docker image to internal registry

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,342 workflow runs
2,342 workflow runs
Event

Filter by event

Loading
Status

Filter by status

Loading
Branch
Actor

Filter by actor

Loading
Add FP8 KVCache support
Build and push docker image to internal registry #2980: Pull request #2028 synchronize by mht-sharma
June 24, 2024 15:22 59m 9s fp8_kvcache
June 24, 2024 15:22 59m 9s
Add FP8 KVCache support
Build and push docker image to internal registry #2979: Pull request #2028 synchronize by mht-sharma
June 24, 2024 15:20 1m 34s fp8_kvcache
June 24, 2024 15:20 1m 34s
Add FP8 KVCache support
Build and push docker image to internal registry #2978: Pull request #2028 synchronize by mht-sharma
June 24, 2024 15:09 11m 53s fp8_kvcache
June 24, 2024 15:09 11m 53s
Add FP8 KVCache support
Build and push docker image to internal registry #2977: Pull request #2028 synchronize by mht-sharma
June 24, 2024 15:06 2m 49s fp8_kvcache
June 24, 2024 15:06 2m 49s
Add FP8 KVCache support
Build and push docker image to internal registry #2976: Pull request #2028 synchronize by mht-sharma
June 24, 2024 14:38 28m 53s fp8_kvcache
June 24, 2024 14:38 28m 53s
Add FP8 KVCache support
Build and push docker image to internal registry #2975: Pull request #2028 synchronize by mht-sharma
June 24, 2024 14:31 8m 0s fp8_kvcache
June 24, 2024 14:31 8m 0s
Use GPTQ-Marlin for supported GPTQ configurations
Build and push docker image to internal registry #2974: Pull request #2111 opened by danieldk
June 24, 2024 13:22 1h 4m 3s feature/use-gptq-marlin-for-gptq
June 24, 2024 13:22 1h 4m 3s
Add FP8 KVCache support
Build and push docker image to internal registry #2973: Pull request #2028 synchronize by mht-sharma
June 24, 2024 11:38 1h 14m 16s fp8_kvcache
June 24, 2024 11:38 1h 14m 16s
feat: sort cuda graphs in descending order (#2104)
Build and push docker image to internal registry #2971: Commit 811a938 pushed by drbh
June 21, 2024 18:28 54m 6s main
June 21, 2024 18:28 54m 6s
feat: sort cuda graphs in descending order
Build and push docker image to internal registry #2970: Pull request #2104 opened by drbh
June 21, 2024 16:16 1h 0m 20s descending-cuda-graphs
June 21, 2024 16:16 1h 0m 20s
Fix text-generation-server quantize (#2103)
Build and push docker image to internal registry #2969: Commit 197c47a pushed by danieldk
June 21, 2024 13:28 39m 48s main
June 21, 2024 13:28 39m 48s
Fix text-generation-server quantize
Build and push docker image to internal registry #2968: Pull request #2103 opened by danieldk
June 21, 2024 12:28 59m 57s bugfix/server-quantize
June 21, 2024 12:28 59m 57s
Add support for Marlin 2:4 sparsity
Build and push docker image to internal registry #2967: Pull request #2102 opened by danieldk
June 21, 2024 12:17 1h 8m 31s feature/marlin-24
June 21, 2024 12:17 1h 8m 31s
feat: add simple tests for weights
Build and push docker image to internal registry #2964: Pull request #2092 synchronize by drbh
June 21, 2024 03:25 1h 1m 28s add-weights-tests
June 21, 2024 03:25 1h 1m 28s
feat: add simple tests for weights
Build and push docker image to internal registry #2963: Pull request #2092 synchronize by drbh
June 20, 2024 20:49 29m 59s add-weights-tests
June 20, 2024 20:49 29m 59s
feat: add simple tests for weights
Build and push docker image to internal registry #2962: Pull request #2092 synchronize by drbh
June 20, 2024 19:28 44m 49s add-weights-tests
June 20, 2024 19:28 44m 49s
feat: add simple tests for weights
Build and push docker image to internal registry #2961: Pull request #2092 synchronize by drbh
June 20, 2024 18:57 32m 2s add-weights-tests
June 20, 2024 18:57 32m 2s
Fix nccl regression on PyTorch 2.3 upgrade
Build and push docker image to internal registry #2960: Pull request #2099 synchronize by fxmarty
June 20, 2024 18:35 1h 27m 36s fix-nccl-regression
June 20, 2024 18:35 1h 27m 36s
Fix nccl regression on PyTorch 2.3 upgrade
Build and push docker image to internal registry #2959: Pull request #2099 synchronize by fxmarty
June 20, 2024 18:12 22m 44s fix-nccl-regression
June 20, 2024 18:12 22m 44s
Fix nccl regression on PyTorch 2.3 upgrade
Build and push docker image to internal registry #2958: Pull request #2099 synchronize by fxmarty
June 20, 2024 18:03 9m 50s fix-nccl-regression
June 20, 2024 18:03 9m 50s
Fix nccl regression on PyTorch 2.3 upgrade
Build and push docker image to internal registry #2957: Pull request #2099 opened by fxmarty
June 20, 2024 18:01 2m 0s fix-nccl-regression
June 20, 2024 18:01 2m 0s
Fix LLaVA-NeXT handling of non-square images
Build and push docker image to internal registry #2956: Pull request #2097 opened by danieldk
June 20, 2024 13:43 1h 1m 20s bugfix/llava-unpad
June 20, 2024 13:43 1h 1m 20s
Factor out sharding of packed tensors (#2059)
Build and push docker image to internal registry #2954: Commit bcb3faa pushed by danieldk
June 20, 2024 07:56 39m 31s main
June 20, 2024 07:56 39m 31s
Idefics2: sync added image tokens with transformers
Build and push docker image to internal registry #2953: Pull request #2080 synchronize by danieldk
June 20, 2024 07:22 1h 1m 5s bugfix/idefics2-no-image-splitting
June 20, 2024 07:22 1h 1m 5s
Factor out sharding of packed tensors
Build and push docker image to internal registry #2952: Pull request #2059 synchronize by danieldk
June 20, 2024 06:57 58m 12s maintenance/packed-sharded-refactor
June 20, 2024 06:57 58m 12s