Tags: IBM/vllm
Tags
🐛 change fork to spawn Signed-off-by: Joe Runde <joe@joerun.de>
🐛 fix guided decoding Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
add install of libsodium (#42) To support encryption of model weights with tensorizer Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
⚗️ turn it way down and wait Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
🐛 fix prometheus metric labels (#27) This fixes a miss where I had seen usages of `.labels` `**`a dictionary into kwargs, and I accidentally passed a raw dictionary as a value instead of using keyword arguments 🤦. This caused metrics to show eg. `method="{'method':'prefill'}"` instead of `method=prefill` Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
🐛 set paths for wheel built Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
PreviousNext