Skip to content

v1.6.2

Latest
Compare
Choose a tag to compare
@mitya52 mitya52 released this 27 May 14:33

Refact.ai Self-hosted

  • Models Support: We've introduced support for gated models and the new llama3 model
  • Even More Models: GPT4o and GPT4-turbo models are now available

Refact.ai Enterprise

  • VLLM Speed Improvement: You are now able to experience faster processing times with our optimized VLLM
  • VLLM LoRa-Less Mode: In cases where LoRa is not set up, VLLM will now operate 20% faster due to the new LoRa-less mode
  • Empty Prompt and OOM Handling: We've addressed issues in VLLM that caused broken generations