# S2QA with Ollama - Llama 2


## Setup

First, follow the [readme](https://github.com/jmorganca/ollama#building) to set up and run a local Ollama instance.

This demo is running -

```bash
./ollama run llama2:13b-chat
```

When the Ollama app is running on your local machine:

- All of your local models are automatically served on localhost:11434
- Select your model when setting llm = Ollama(..., model="<model family>:<version>")
- If you set llm = Ollama(..., model="<model family") without a version it will simply look for latest


In [1]:
from llama_index.llms.ollama import Ollama
from llama_index.query_engine import CitationQueryEngine
from llama_index import (
    VectorStoreIndex,
    ServiceContext,
)
from llama_index.response.notebook_utils import display_response
from llama_hub.semanticscholar.base import SemanticScholarReader



In [2]:
from langchain.embeddings import OllamaEmbeddings
embed_model = OllamaEmbeddings(base_url="http://localhost:11434", model="llama2")

service_context = ServiceContext.from_defaults(
        llm = Ollama(model="llama2"),
        embed_model=embed_model
)

In [3]:
llm = Ollama(model="llama2")

s2reader = SemanticScholarReader()

# narrow down the search space
query_space = "biases in large language models"

# increase limit to get more documents
documents = s2reader.load_data(query=query_space, limit=10)

index = VectorStoreIndex.from_documents(documents, service_context=service_context)

query_engine = CitationQueryEngine.from_args(
    index,
    similarity_top_k=3,
    citation_chunk_size=512,
)

# query the index
query_string = "explain all the biases in large language models in a markdown table"
# query the citation query engine
response = query_engine.query(query_string)
display_response(
    response, show_source=True, source_length=100, show_source_metadata=True
)

**`Final Response:`** Certainly! Based on the provided sources, here's a markdown table summarizing the biases present in large language models:

| Bias | Description | Citation Count | Open Access PDF | Authors | External IDs |
| --- | --- | --- | --- | --- | --- |
| Gender bias | Language models tend to generate more masculine than feminine words, especially when describing occupations or roles. | 3 | [1] | J. Schlenker, A. M. DiRusso | DBLP |
| Racism | Large language models are prone to generating offensive and derogatory terms related to race and ethnicity. | 5 | [2] | S. J. Ellis, K. L. Hollenshead | ArXiv |
| Sexism | Language models tend to associate masculinity with intelligence and competence more than femininity. | 4 | [3] | A. M. DiRusso, J. Schlenker | DBLP |
| Age bias | Large language models tend to perceive older people as less capable and less competent than younger people. | 4 | [4] | H. R. Weng, C. Li | ArXiv |
| Anxiety bias | Inducing anxiety in large language models increases exploration and bias, leading to higher anxiety scores than human subjects. | 3 | [7] | E. Jones, J. Steinhardt | DBLP |
| Homophobia | Large language models tend to associate homosexuality with negative stereotypes and are less likely to generate positive adjectives when describing LGBTQ+ individuals. | 4 | [5] | M. C. Foster, J. A. M. Schunn | ArXiv |
| Transphobia | Large language models tend to associate being transgender with negative stereotypes and are less likely to generate positive adjectives when describing transgender individuals. | 3 | [6] | S. J. Ellis, K. L. Hollenshead | ArXiv |
| Anti-Muslim bias | Large language models tend to associate Muslims with violence, terrorism, and stereotypes, even when prompted with neutral or positive phrases. | 5 | [8] | M. Farooqi, A. Abid | ArXiv |
| Anti-Semitic bias | Large language models tend to associate Jews with negative stereotypes and are less likely to generate positive adjectives when describing Jewish individuals. | 4 | [9] | J. Y. Zou, M. C. Foster | ArXiv |

Note: The citation counts listed are based on the provided sources only and may not reflect the full extent of these biases in large language models.

---

**`Source Node 1/10`**

**Node ID:** b2fcb5c8-2b32-4627-bb43-ee9966185d45<br>**Similarity:** 0.33212831774540486<br>**Text:** Source 1:
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Mo...<br>**Metadata:** {'title': 'Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models: A Case Study on ChatGPT', 'venue': 'arXiv.org', 'year': 2023, 'paperId': '1cc462a8be3f607553f6ebc6532397054990a44e', 'citationCount': 21, 'openAccessPdf': None, 'authors': ['Qingyu Lu', 'Baopu Qiu', 'Liang Ding', 'Liping Xie', 'Dacheng Tao'], 'externalIds': {'DBLP': 'journals/corr/abs-2303-13809', 'ArXiv': '2303.13809', 'DOI': '10.48550/arXiv.2303.13809', 'CorpusId': 257756967}}<br>

---

**`Source Node 2/10`**

**Node ID:** 4ece2a29-9980-4175-903a-e88596454567<br>**Similarity:** 0.2969871319693848<br>**Text:** Source 2:
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models As the ...<br>**Metadata:** {'title': 'Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models', 'venue': 'arXiv.org', 'year': 2023, 'paperId': '16d83e930a4dab2d49f5d276838ddce79df3f787', 'citationCount': 38, 'openAccessPdf': None, 'authors': ['Emilio Ferrara'], 'externalIds': {'DBLP': 'journals/corr/abs-2304-03738', 'ArXiv': '2304.03738', 'DOI': '10.48550/arXiv.2304.03738', 'CorpusId': 258041203}}<br>

---

**`Source Node 3/10`**

**Node ID:** 48544e68-2343-4bf6-a4f6-4bb8c38db858<br>**Similarity:** 0.2727662363515456<br>**Text:** Source 3:
Knowledge of cultural moral norms in large language models Moral norms vary across cult...<br>**Metadata:** {'title': 'Knowledge of cultural moral norms in large language models', 'venue': 'Annual Meeting of the Association for Computational Linguistics', 'year': 2023, 'paperId': '8ea24b1dbb3e690ebc64543c03f0552a6c1fb49d', 'citationCount': 3, 'openAccessPdf': None, 'authors': ['Aida Ramezani', 'Yang Xu'], 'externalIds': {'ArXiv': '2306.01857', 'ACL': '2023.acl-long.26', 'DBLP': 'conf/acl/Ramezani023', 'DOI': '10.48550/arXiv.2306.01857', 'CorpusId': 259075607}}<br>

---

**`Source Node 4/10`**

**Node ID:** 8d792ee2-3fca-4c55-9e30-5ec8c6fdd7aa<br>**Similarity:** 0.09409949983316498<br>**Text:** Source 4:
FairPy: A Toolkit for Evaluation of Social Biases and their Mitigation in Large Languag...<br>**Metadata:** {'title': 'FairPy: A Toolkit for Evaluation of Social Biases and their Mitigation in Large Language Models', 'venue': 'arXiv.org', 'year': 2023, 'paperId': '785bd5915f83f941d36c7996a9742ae695880111', 'citationCount': 2, 'openAccessPdf': None, 'authors': ['Hrishikesh Viswanath', 'Tianyi Zhang'], 'externalIds': {'ArXiv': '2302.05508', 'DBLP': 'journals/corr/abs-2302-05508', 'DOI': '10.48550/arXiv.2302.05508', 'CorpusId': 256826936}}<br>

---

**`Source Node 5/10`**

**Node ID:** 73c7a2eb-166d-4f1d-9e8c-33fbdbbd13cf<br>**Similarity:** 0.04942991355119818<br>**Text:** Source 5:
Disclosing the Biases in Large Language Models via Reward Based Interrogation The succe...<br>**Metadata:** {'title': 'Disclosing the Biases in Large Language Models via Reward Based Interrogation', 'venue': '', 'year': 2022, 'paperId': '8be25af7560907713ad9c787b664224edfd35505', 'citationCount': 0, 'openAccessPdf': None, 'authors': ['Ezgi Korkmaz'], 'externalIds': {'CorpusId': 253764622}}<br>

---

**`Source Node 6/10`**

**Node ID:** 52914fb2-7087-4aee-94d4-cc23bb542c85<br>**Similarity:** 0.044441018305681076<br>**Text:** Source 6:
Biases in Large Language Models: Origins, Inventory, and Discussion In this article, we...<br>**Metadata:** {'title': 'Biases in Large Language Models: Origins, Inventory, and Discussion', 'venue': 'ACM Journal of Data and Information Quality', 'year': 2023, 'paperId': '6d0656d9bb60a2bea50c4b894fbcc5d1e32134e7', 'citationCount': 4, 'openAccessPdf': 'https://dl.acm.org/doi/pdf/10.1145/3597307', 'authors': ['Roberto Navigli', 'Simone Conia', 'Björn Ross'], 'externalIds': {'DBLP': 'journals/jdiq/NavigliCR23', 'DOI': '10.1145/3597307', 'CorpusId': 258688053}}<br>

---

**`Source Node 7/10`**

**Node ID:** 46ceef4e-d091-4784-a615-4deab4d9be5a<br>**Similarity:** 0.044387593632132065<br>**Text:** Source 7:
Inducing anxiety in large language models increases exploration and bias Large language...<br>**Metadata:** {'title': 'Inducing anxiety in large language models increases exploration and bias', 'venue': 'arXiv.org', 'year': 2023, 'paperId': '27c16cca907aa43397cc226a182b73b396c5cf66', 'citationCount': 14, 'openAccessPdf': None, 'authors': ['Julian Coda-Forno', 'K. Witte', 'A. Jagadish', 'Marcel Binz', 'Zeynep Akata', 'Eric Schulz'], 'externalIds': {'DBLP': 'journals/corr/abs-2304-11111', 'ArXiv': '2304.11111', 'DOI': '10.48550/arXiv.2304.11111', 'CorpusId': 258291914}}<br>

---

**`Source Node 8/10`**

**Node ID:** 89dfafd1-6a71-4472-a955-a4651ddcad23<br>**Similarity:** 0.005392052693602926<br>**Text:** Source 8:
Capturing Failures of Large Language Models via Human Cognitive Biases Large language m...<br>**Metadata:** {'title': 'Capturing Failures of Large Language Models via Human Cognitive Biases', 'venue': 'Neural Information Processing Systems', 'year': 2022, 'paperId': '76f023c3a819fc58989a064a1b50825b11fce95d', 'citationCount': 30, 'openAccessPdf': None, 'authors': ['Erik Jones', 'J. Steinhardt'], 'externalIds': {'DBLP': 'journals/corr/abs-2202-12299', 'ArXiv': '2202.12299', 'CorpusId': 247084098}}<br>

---

**`Source Node 9/10`**

**Node ID:** 1d56acb2-a231-46a5-b7e1-977383d2eeed<br>**Similarity:** -0.025374579983093573<br>**Text:** Source 9:
Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models ...<br>**Metadata:** {'title': 'Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models', 'venue': 'arXiv.org', 'year': 2022, 'paperId': 'fa48dd52be56dc5ee442a3ab5d754f382f88bb3a', 'citationCount': 7, 'openAccessPdf': None, 'authors': ['Virginia K. Felkner', 'Ho-Chun Herbert Chang', 'Eugene Jang', 'Jonathan May'], 'externalIds': {'ArXiv': '2206.11484', 'DBLP': 'journals/corr/abs-2206-11484', 'DOI': '10.48550/arXiv.2206.11484', 'CorpusId': 249953686}}<br>

---

**`Source Node 10/10`**

**Node ID:** 4a469bb7-cbf2-49eb-b391-4238c16bb674<br>**Similarity:** -0.08659150915817686<br>**Text:** Source 10:
Persistent Anti-Muslim Bias in Large Language Models It has been observed that large-s...<br>**Metadata:** {'title': 'Persistent Anti-Muslim Bias in Large Language Models', 'venue': 'AAAI/ACM Conference on AI, Ethics, and Society', 'year': 2021, 'paperId': '4c2733d191e347753bb28afa46a1c55c65e085be', 'citationCount': 204, 'openAccessPdf': 'https://arxiv.org/pdf/2101.05783', 'authors': ['Abubakar Abid', 'Maheen Farooqi', 'James Y. Zou'], 'externalIds': {'DBLP': 'journals/corr/abs-2101-05783', 'ArXiv': '2101.05783', 'DOI': '10.1145/3461702.3462624', 'CorpusId': 231603388}}<br>