Skip to content

Setup and tested OLMo architecture adapters#1174

Merged
jlarson4 merged 1 commit intodev-3.xfrom
feature/OLMo-architecture-adapter
Feb 14, 2026
Merged

Setup and tested OLMo architecture adapters#1174
jlarson4 merged 1 commit intodev-3.xfrom
feature/OLMo-architecture-adapter

Conversation

@jlarson4
Copy link
Collaborator

Added an Architecture Adapter for the OLMo family of architectures. They were all quite similar and easy to implement as a group thanks to the work already done on HookedTransformer for OLMo. The base OLMo model has an issue where it appends a "EOS" token to inputs, which was causing static generation. This bug was resolved in bridge.py and transformers.py to make sure that token is not added unexpectedly.

  • New feature (non-breaking change which adds functionality)

Checklist:

  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • I have not rewritten tests relating to key interfaces which would affect backward compatibility

@jlarson4 jlarson4 merged commit 86be110 into dev-3.x Feb 14, 2026
53 of 60 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant