v1.6.2

Latest

Latest

mitya52 released this 27 May 14:33

Refact.ai Self-hosted

Models Support: We've introduced support for gated models and the new llama3 model
Even More Models: GPT4o and GPT4-turbo models are now available

Refact.ai Enterprise

VLLM Speed Improvement: You are now able to experience faster processing times with our optimized VLLM
VLLM LoRa-Less Mode: In cases where LoRa is not set up, VLLM will now operate 20% faster due to the new LoRa-less mode
Empty Prompt and OOM Handling: We've addressed issues in VLLM that caused broken generations

Assets 2