Best Practices & ###### results problem #77

MyraBaba · 2024-01-02T12:23:38Z

Thanks for your efforts and beatifull project.

1 . Would you mind to give more example for hugginaface llamacpp models ? ie what would be the best accurate result for multilingual or language different from the english ?

2 - What would be the choice for summarization document ?

3 - sometimes result adding many ##### and stopping

4 - Example config for mistral , mixtral and dolphin2

5 Do you know haystack looks commercialized version.

6 what will be the future for enterprise AI search for internal documents ? Is it worth to invest ? Can we talk ?

snexus · 2024-01-03T05:36:58Z

Hi @MyraBaba

Would you mind to give more example for hugginaface llamacpp models ? ie what would be the best accurate result for multilingual or language different from the english ?

Unfortunately, it is hard to answer this question since it depends on the model and the task. The goal of the package is to enable users to choose the model they like without forcing any specific model, since "best" or "accurate" are subjective. A good place to look for new models or ask for the best model for the specific task would be https://www.reddit.com/r/LocalLLaMA/

What would be the choice for summarization document ?

Summarization is not in the scope of this project - it was built for question-answering.

3 - sometimes result adding many ##### and stopping

This is not coming from the package, but from the model itself - some models might require different prompt templates for asking the question. You can consult the model card e.g. Huggingface and update the prompt template in the config like specified here -

llm-search/sample_templates/generic/config_template.yaml

Line 86 in d0f756d

prompt_template: |

4 - Example config for mistral , mixtral and dolphin2

As long as it is supported by llamacpp, it should be similar - download the model (e.g. in gguf format), and specify the path in the config. Some models work better with different default parameters - you can configure it in the config.yaml, similar to here -

llm-search/sample_templates/generic/config_template.yaml

Line 97 in d0f756d

model_init_params:

Do you know haystack looks commercialized version.

Can you clarify the question, please?

what will be the future for enterprise AI search for internal documents ? Is it worth to invest ? Can we talk ?

Quite hard to answer - some big players are entering the space, e.g. Microsoft already offers services for enterprise-grade document question answering, where models can be deployed to an organization's private network and data doesn't leave the perimeter. I think there will be a space for open-source projects like this for small organisations and privacy-conscious users, but hard to tell if it can be monetized.

MyraBaba · 2024-01-03T06:30:45Z

@snexus
https://haystack.deepset.ai/ is the haystack

snexus closed this as completed Jan 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best Practices & ###### results problem #77

Best Practices & ###### results problem #77

MyraBaba commented Jan 2, 2024

snexus commented Jan 3, 2024

MyraBaba commented Jan 3, 2024

Best Practices & ###### results problem #77

Best Practices & ###### results problem #77

Comments

MyraBaba commented Jan 2, 2024

snexus commented Jan 3, 2024

MyraBaba commented Jan 3, 2024