Skip to content

Better mechanism for getting values to attention processors #6601

@spezialspezial

Description

@spezialspezial

I think it would be nice to generally have some better infrastructure/mechanism for getting values to attention processors.

At the moment the main means of transportation seems to be cross_attention_kwargs but they are finally expanded as function arguments and have to match ALL function signatures that receive them. That's why there is a lot of dict gymnastics before the call. Just further passing the dict via reference where every processor may pick what it needs not bothering with other items would be more flexible imo.

In my mind pipeline and processors should be the malleable 'loose ends' to modify and swap with infrastructure in between rarely in need of adjustment.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions