In [1]:
from dotenv import load_dotenv
from langchain_openai import ChatOpenAI
from IPython.display import Markdown, display

load_dotenv()

True

In [2]:
class Configs:
    GPT_MODEL = "gpt-4o"

settings = Configs()

In [3]:
llm = ChatOpenAI(model=settings.GPT_MODEL, temperature=0.3)

In [4]:
prompt = "How can I use SDPA with BERT in transformers?  Be short and precise."
response = llm.invoke(prompt)

display(Markdown(response.content))

To use Sparse Distributed Processing Attention (SDPA) with BERT in the Hugging Face Transformers library, you need to follow these steps:

1. **Install the Transformers library**:
   ```bash
   pip install transformers
   ```

2. **Load a BERT model with SDPA**:
   ```python
   from transformers import BertModel, BertTokenizer

   # Load the tokenizer
   tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')

   # Load the BERT model with SDPA
   model = BertModel.from_pretrained('bert-base-uncased', attention_type='sdpa')
   ```

3. **Tokenize your input text**:
   ```python
   text = "Your input text here"
   inputs = tokenizer(text, return_tensors='pt')
   ```

4. **Pass the inputs through the model**:
   ```python
   outputs = model(**inputs)
   ```

This will allow you to use BERT with SDPA in the Transformers library. Note that the actual availability of SDPA might depend on the specific version of the library and the model support. Always refer to the latest [Hugging Face Transformers documentation](https://huggingface.co/transformers/) for updates and detailed usage.