Skip to content

Actions: huggingface/text-generation-inference

Build documentation

Actions

Loading...

Show workflow options

Create status badge

99 workflow runs
99 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

Adding architecture document (#2044)
Build documentation #99: Commit 445f313 pushed by drbh
June 14, 2024 13:28 40s main
June 14, 2024 13:28 40s
Update the link for qwen2 (#2068)
Build documentation #98: Commit 96b7b40 pushed by danieldk
June 14, 2024 09:59 37s main
June 14, 2024 09:59 37s
Add support for Marlin-quantized models
Build documentation #97: Commit 4594e6f pushed by danieldk
June 6, 2024 11:16 43s main
June 6, 2024 11:16 43s
fix: update triton implementation reference (#2002)
Build documentation #96: Commit fec0167 pushed by Narsil
June 4, 2024 12:26 47s main
June 4, 2024 12:26 47s
single char ` addition for docs (#1989)
Build documentation #95: Commit 08b3eac pushed by Narsil
May 31, 2024 16:42 40s main
May 31, 2024 16:42 40s
Update documentation version to 2.0.4 (#1980)
Build documentation #94: Commit 659bd67 pushed by Narsil
May 31, 2024 14:03 37s main
May 31, 2024 14:03 37s
Gemma GPTQ checks: skip logprob checks
Build documentation #93: Commit 967ced2 pushed by danieldk
May 30, 2024 09:28 38s main
May 30, 2024 09:28 38s
Build documentation
Build documentation #92: by Narsil
May 28, 2024 12:52 54s main
May 28, 2024 12:52 54s
fix small typo and broken link (#1958)
Build documentation #91: Commit b7ffa28 pushed by drbh
May 27, 2024 15:31 37s main
May 27, 2024 15:31 37s
feat: add train medusa head tutorial (#1934)
Build documentation #90: Commit a103e3e pushed by Narsil
May 23, 2024 09:34 37s main
May 23, 2024 09:34 37s
Creating doc automatically for supported models. (#1929)
Build documentation #89: Commit 2f243a1 pushed by Narsil
May 22, 2024 14:23 37s main
May 22, 2024 14:23 37s
docs: Fix grafana dashboard url (#1925)
Build documentation #88: Commit 904ff36 pushed by drbh
May 21, 2024 17:12 54s main
May 21, 2024 17:12 54s
ROCm: make CK FA2 default instead of Triton (#1924)
Build documentation #87: Commit 293b812 pushed by fxmarty
May 20, 2024 00:44 52s main
May 20, 2024 00:44 52s
Fix TunableOp bug (#1920)
Build documentation #86: Commit b5f1c9d pushed by Narsil
May 17, 2024 16:21 36s main
May 17, 2024 16:21 36s
Add TGI monitoring guide through Grafana and Prometheus (#1908)
Build documentation #85: Commit c4cf8b4 pushed by drbh
May 17, 2024 14:34 39s main
May 17, 2024 14:34 39s
MI300 compatibility (#1764)
Build documentation #84: Commit 232e8d5 pushed by Narsil
May 17, 2024 13:30 36s main
May 17, 2024 13:30 36s
Add GPT-2 with flash attention (#1889)
Build documentation #83: Commit b5bc6e5 pushed by Narsil
May 15, 2024 11:31 43s main
May 15, 2024 11:31 43s
Correct 'using guidance' link (#1892)
Build documentation #82: Commit 92f1338 pushed by drbh
May 14, 2024 18:23 41s main
May 14, 2024 18:23 41s
feat: prefer huggingface_hub in docs and show image api (#1844)
Build documentation #81: Commit 65539b7 pushed by Narsil
May 2, 2024 14:56 42s main
May 2, 2024 14:56 42s
fix: split docs and start conceptual page (#1836)
Build documentation #80: Commit 6073ece pushed by Narsil
May 1, 2024 07:03 42s main
May 1, 2024 07:03 42s
feat: add vlm docs and simple examples (#1812)
Build documentation #79: Commit b2c9827 pushed by Narsil
April 30, 2024 10:14 36s main
April 30, 2024 10:14 36s
feat: add how it works section (#1773)
Build documentation #78: Commit f661508 pushed by Narsil
April 30, 2024 09:45 36s main
April 30, 2024 09:45 36s
Changing the waiting_served_ratio default (stack more aggressively by…
Build documentation #77: Commit 007d5e5 pushed by Narsil
April 28, 2024 15:54 2m 52s main
April 28, 2024 15:54 2m 52s
Update guidance docs to reflect grammar support in API (#1775)
Build documentation #76: Commit eb08b9f pushed by drbh
April 25, 2024 17:11 3m 18s main
April 25, 2024 17:11 3m 18s
Idefics2. (#1756)
Build documentation #75: Commit bfddfa5 pushed by Narsil
April 23, 2024 21:04 36m 57s main
April 23, 2024 21:04 36m 57s