Skip to content

Conversation

@JoeStech
Copy link
Contributor

  • sized down the model to 70B and the instances to 4xl to avoid segfaults and make things easier on learners
  • removed the llama-server section since at 10 minutes per inference it's unusable

@pareenaverma pareenaverma merged commit 941ef70 into ArmDeveloperEcosystem:main Aug 20, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants