Skip to content

Actions: huggingface/text-generation-inference

Automatic Documentation for Launcher

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
251 workflow runs
251 workflow runs
Event

Filter by event

Loading
Status

Filter by status

Loading
Branch
Actor

Filter by actor

Loading
Move quantized weight handling out of the Weights class
Automatic Documentation for Launcher #260: Pull request #2194 synchronize by danieldk
July 8, 2024 11:26 6m 13s refactor/quantizer-weights
July 8, 2024 11:26 6m 13s
Fixing AMD CI
Automatic Documentation for Launcher #259: Pull request #2109 synchronize by fxmarty
July 8, 2024 11:10 6m 12s ci_amd3
July 8, 2024 11:10 6m 12s
Fixing AMD CI
Automatic Documentation for Launcher #258: Pull request #2109 synchronize by fxmarty
July 8, 2024 11:07 6m 25s ci_amd3
July 8, 2024 11:07 6m 25s
Falcon/DBRX: get correct number of key-value heads
Automatic Documentation for Launcher #257: Pull request #2205 synchronize by danieldk
July 8, 2024 11:04 6m 10s bugfix/falcon-num-kv-heads
July 8, 2024 11:04 6m 10s
Falcon/DBRX: get correct number of key-value heads
Automatic Documentation for Launcher #256: Pull request #2205 synchronize by danieldk
July 8, 2024 09:56 6m 25s bugfix/falcon-num-kv-heads
July 8, 2024 09:56 6m 25s
Falcon/DBRX: get correct number of key-value heads
Automatic Documentation for Launcher #255: Pull request #2205 opened by danieldk
July 8, 2024 09:36 6m 6s bugfix/falcon-num-kv-heads
July 8, 2024 09:36 6m 6s
fp8 marlin kernel
Automatic Documentation for Launcher #254: Pull request #2204 opened by flozi00
July 8, 2024 09:28 Action required flozi00:fp8marlin
July 8, 2024 09:28 Action required
Fix incorrect cache allocation with multi-query
Automatic Documentation for Launcher #253: Pull request #2203 opened by danieldk
July 8, 2024 08:45 6m 28s bugfix/sharded-1kv-head
July 8, 2024 08:45 6m 28s
Move quantized weight handling out of the Weights class
Automatic Documentation for Launcher #252: Pull request #2194 synchronize by danieldk
July 8, 2024 07:56 6m 21s refactor/quantizer-weights
July 8, 2024 07:56 6m 21s
hotfix: Fix number of KV heads
Automatic Documentation for Launcher #251: Pull request #2202 opened by danieldk
July 8, 2024 07:48 6m 11s bugfix/num-kv-heads
July 8, 2024 07:48 6m 11s
Move quantized weight handling out of the Weights class
Automatic Documentation for Launcher #250: Pull request #2194 synchronize by danieldk
July 8, 2024 07:10 6m 18s refactor/quantizer-weights
July 8, 2024 07:10 6m 18s
Fixed README ToC
Automatic Documentation for Launcher #248: Pull request #2196 opened by vinkamath
July 5, 2024 22:07 Action required vinkamath:fix-readme-toc
July 5, 2024 22:07 Action required
Move quantized weight handling out of the Weights class
Automatic Documentation for Launcher #247: Pull request #2194 opened by danieldk
July 5, 2024 16:13 6m 29s refactor/quantizer-weights
July 5, 2024 16:13 6m 29s
fix: refactor adapter weight loading and mapping
Automatic Documentation for Launcher #246: Pull request #2193 opened by drbh
July 5, 2024 15:05 6m 19s simplify-lora-adapter-layer-loading
July 5, 2024 15:05 6m 19s
Consistently take prefix in model constructors
Automatic Documentation for Launcher #245: Pull request #2191 synchronize by danieldk
July 5, 2024 14:07 6m 14s maintenance/accept-prefix
July 5, 2024 14:07 6m 14s
Consistently take prefix in model constructors
Automatic Documentation for Launcher #244: Pull request #2191 synchronize by danieldk
July 5, 2024 12:34 6m 27s maintenance/accept-prefix
July 5, 2024 12:34 6m 27s
Consistently take prefix in model constructors
Automatic Documentation for Launcher #243: Pull request #2191 synchronize by danieldk
July 5, 2024 12:12 6m 51s maintenance/accept-prefix
July 5, 2024 12:12 6m 51s
Consistently take prefix in model constructors
Automatic Documentation for Launcher #242: Pull request #2191 opened by danieldk
July 5, 2024 12:00 6m 23s maintenance/accept-prefix
July 5, 2024 12:00 6m 23s
update to metrics 0.23.0 or could work with metrics-exporter-promethe…
Automatic Documentation for Launcher #241: Pull request #2190 opened by sywangyi
July 5, 2024 10:58 Action required sywangyi:metrics_fix
July 5, 2024 10:58 Action required
Fix Starcoder2 after refactor
Automatic Documentation for Launcher #240: Pull request #2189 opened by danieldk
July 5, 2024 10:17 6m 16s bugfix/starcoder2
July 5, 2024 10:17 6m 16s
Use symmetric quantization in the quantize subcommand
Automatic Documentation for Launcher #239: Pull request #2120 synchronize by danieldk
July 5, 2024 07:19 6m 23s bugfix/quantize-use-sym
July 5, 2024 07:19 6m 23s
misc: update vllm dependency to support attention size 160
Automatic Documentation for Launcher #238: Pull request #2187 opened by PaoloAlbano
July 4, 2024 15:25 Action required igeniusai:misc_support_attention_160
July 4, 2024 15:25 Action required
Refactor dead code - Removing all flash_xxx.py files.
Automatic Documentation for Launcher #237: Pull request #2166 synchronize by Narsil
July 4, 2024 15:18 6m 16s refactor_dead_code
July 4, 2024 15:18 6m 16s
Refactor dead code - Removing all flash_xxx.py files.
Automatic Documentation for Launcher #236: Pull request #2166 synchronize by Narsil
July 4, 2024 15:18 6m 14s refactor_dead_code
July 4, 2024 15:18 6m 14s
Use symmetric quantization in the quantize subcommand
Automatic Documentation for Launcher #235: Pull request #2120 synchronize by danieldk
July 4, 2024 15:11 6m 36s bugfix/quantize-use-sym
July 4, 2024 15:11 6m 36s