Skip to content

Actions: predibase/lorax

Release Charts

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
292 workflow runs
292 workflow runs

Filter by Event

Loading

Filter by Status

Loading

Filter by Branch

Loading

Filter by Actor

Loading
Tokenize inputs in router (#548)
Release Charts #292: Commit 452ac73 pushed by tgaddair
July 19, 2024 21:51 13s main
July 19, 2024 21:51 13s
Fix : compile bug causing models to error with 'lora' key not found (…
Release Charts #291: Commit 1adc076 pushed by ajtejankar
July 19, 2024 19:26 15s main
July 19, 2024 19:26 15s
Move kv cache allocation to router to ensure correct block allocation…
Release Charts #290: Commit 5a7a1be pushed by tgaddair
July 19, 2024 17:10 17s main
July 19, 2024 17:10 17s
Preload adapters during init (#543)
Release Charts #289: Commit 5c25e26 pushed by tgaddair
July 17, 2024 22:30 18s main
July 17, 2024 22:30 18s
no warm up (#540)
Release Charts #288: Commit 2dd5277 pushed by magdyksaleh
July 15, 2024 23:31 17s main
July 15, 2024 23:31 17s
Fix gemma2 (#539)
Release Charts #287: Commit 35f666a pushed by Infernaught
July 12, 2024 20:42 14s main
July 12, 2024 20:42 14s
Lorax NER (#531)
Release Charts #286: Commit a3ad209 pushed by magdyksaleh
July 9, 2024 13:14 13s main
July 9, 2024 13:14 13s
Infer dtype from model config when not explicitly specified (#534)
Release Charts #285: Commit 24cb494 pushed by arnavgarg1
July 3, 2024 22:48 14s main
July 3, 2024 22:48 14s
bug : fix Qwen-2 sliding_window config bug (#532)
Release Charts #284: Commit ecbe9ea pushed by ajtejankar
July 1, 2024 22:39 12s main
July 1, 2024 22:39 12s
Added Gemma2 (#530)
Release Charts #283: Commit c88fa9e pushed by tgaddair
July 1, 2024 21:12 17s main
July 1, 2024 21:12 17s
bug : fix the type checking errors thrown by new ruff version (#533)
Release Charts #282: Commit 2731478 pushed by ajtejankar
July 1, 2024 17:50 15s main
July 1, 2024 17:50 15s
Bug fix for illegal memory access error caused when running medusa lo…
Release Charts #281: Commit f3a67bb pushed by ajtejankar
June 26, 2024 07:45 17s main
June 26, 2024 07:45 17s
Update development env
Release Charts #280: Commit 3247ef6 pushed by tgaddair
June 24, 2024 23:24 18s main
June 24, 2024 23:24 18s
Added eager prefill option (#524)
Release Charts #279: Commit ee5b7fe pushed by tgaddair
June 24, 2024 18:21 14s main
June 24, 2024 18:21 14s
Disable fp8 kv cache for lovelace (#520)
Release Charts #278: Commit 49bb52f pushed by tgaddair
June 18, 2024 23:20 13s main
June 18, 2024 23:20 13s
docs: update development_env.md (#515)
Release Charts #277: Commit 559fc3b pushed by tgaddair
June 18, 2024 19:01 19s main
June 18, 2024 19:01 19s
try out an integration test workflow (#516)
Release Charts #276: Commit cfc1e19 pushed by noyoshi
June 14, 2024 17:21 14s main
June 14, 2024 17:21 14s
Fix issue with GQA initialization for Qwen2 (#514)
Release Charts #275: Commit 9bed4da pushed by arnavgarg1
June 13, 2024 19:37 14s main
June 13, 2024 19:37 14s
fix batching bug (#513)
Release Charts #274: Commit 835d19c pushed by tgaddair
June 12, 2024 21:38 14s main
June 12, 2024 21:38 14s
Fixed case where loaded lora adapter has no segments (#510)
Release Charts #273: Commit 432be6e pushed by tgaddair
June 12, 2024 04:03 14s main
June 12, 2024 04:03 14s
feat: return usage in ChatCompletionStreamResponse (#506)
Release Charts #272: Commit 4187cab pushed by tgaddair
June 11, 2024 16:33 16s main
June 11, 2024 16:33 16s
Add distilbert (#508)
Release Charts #271: Commit 84fb56d pushed by magdyksaleh
June 10, 2024 22:09 15s main
June 10, 2024 22:09 15s
Bert to gpu (#507)
Release Charts #270: Commit f5e71bd pushed by magdyksaleh
June 10, 2024 21:31 14s main
June 10, 2024 21:31 14s
Add support for batching to embedder models (#503)
Release Charts #269: Commit e8f3d33 pushed by tgaddair
June 8, 2024 05:34 13s main
June 8, 2024 05:34 13s
hqq upgrades (#491)
Release Charts #268: Commit 1b528e0 pushed by tgaddair
June 6, 2024 16:25 14s main
June 6, 2024 16:25 14s