关于 XAgentLlama 使用资源的疑问 #248

CQxiaocaimi · 2023-11-24T08:13:37Z

我看到你们推出了两个版本的XAgentLlama，34B和7B对GPU要求是多少，它是要下载到本地调用对吧，

Umpire2018 · 2023-11-24T10:50:32Z

非常抱歉，尽管我们最近发布了 XAgentLLaMa ，但在用户指南和其他相关文档方面未能提供充分指导，给您带来了不便。我们深感歉意，并承诺会尽快完善这些资料，以确保您能够顺畅、高效地使用我们提供的大模型。同时，我们欢迎任何反馈和建议，以帮助我们改进服务。再次为给您造成的任何困扰表示歉意，并感谢您的理解与支持。

当前可以分享的信息

模型还没进行量化，34B 在进行推理时使用了80G的显存.
请将模型下载到本地并确保遵循 codellama 的指引.

We sincerely apologize that despite the recent launch of XAgentLLaMa, we failed to provide sufficient guidance in the user manual and other related documentation, causing inconvenience to you. We are deeply sorry and commit to improving these materials as soon as possible, ensuring you can smoothly and efficiently utilize the large model we offer. We welcome any feedback and suggestions to help us improve our services. Once again, we apologize for any trouble caused and appreciate your understanding and support.

Current Information to Share:

The model has not yet been quantized, and using 80GB of VRAM for 34B during inference.
Please download the model to your local environment and ensure you follow the guidelines from codellama.

wangjiainchinatelecom · 2023-11-29T06:25:04Z

XAgentLlama 34B我现在下来了发现没有tokenizer相关文件是要把codellama/CodeLlama-34b-Instruct-hf的tokenizer放到里面吗?

AL-377 · 2023-11-29T07:09:34Z

感谢您的提醒，现已上传tokenizer相关文件，您可再次下载尝试！

Umpire2018 · 2023-11-30T13:50:04Z

我们预计会在未来两周内完善 #248 (comment) 提及内容。

samuelchen2015 · 2023-12-12T02:40:47Z

希望可以提供XAgentLlama 的llamaccp版和相应的fastapi 修改，个人还是用不起啊

lileishitou · 2024-04-22T07:29:11Z

xagentllama-34b-preview 缺少 pytorch_model.bin.index.json, 使用trasformers 库加载报错

Umpire2018 added the help wanted Extra attention is needed label Nov 24, 2023

Umpire2018 changed the title ~~XAgentLlama~~ 关于 XAgentLlama 使用资源的疑问 Nov 24, 2023

Umpire2018 pinned this issue Nov 29, 2023

Umpire2018 assigned AL-377 Nov 29, 2023

AL-377 closed this as completed Nov 29, 2023

Umpire2018 reopened this Nov 29, 2023

thunlp-zp unpinned this issue Nov 29, 2023

Umpire2018 mentioned this issue Dec 21, 2023

XAgentGen: XAgentLlaMa-34B-preview能否通过多卡直接推理 #355

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

关于 XAgentLlama 使用资源的疑问 #248

关于 XAgentLlama 使用资源的疑问 #248

CQxiaocaimi commented Nov 24, 2023

Umpire2018 commented Nov 24, 2023

wangjiainchinatelecom commented Nov 29, 2023

AL-377 commented Nov 29, 2023

Umpire2018 commented Nov 30, 2023

samuelchen2015 commented Dec 12, 2023

lileishitou commented Apr 22, 2024

关于 XAgentLlama 使用资源的疑问 #248

关于 XAgentLlama 使用资源的疑问 #248

Comments

CQxiaocaimi commented Nov 24, 2023

Umpire2018 commented Nov 24, 2023

wangjiainchinatelecom commented Nov 29, 2023

AL-377 commented Nov 29, 2023

Umpire2018 commented Nov 30, 2023

samuelchen2015 commented Dec 12, 2023

lileishitou commented Apr 22, 2024