# Download Models

```bash
git clone https://huggingface.co/facebook/galactica-30b
git clone https://huggingface.co/facebook/galactica-120b
```

## Web References

### [Meta’s Galactica AI can write your essay in a minute](https://dataconomy.com/2022/11/meta-galactica-ai-demo-model-how-use-paper/)

This is an introductory article about the potential promise of the Galactica AI model. It is a good starting point for understanding the model and its potential.


### [How to access Scientific Knowledge with Galactica](https://towardsai.net/p/l/how-to-access-scientific-knowledge-with-galactica)

This provides various prompt engineering examples:

#### Papers and Surveys

This functionality is particularly useful when you need a comprehensive survey on a particular topic. Simply design the prompt as Title: TOPIC, A Survey and Galactica will automatically generate one for you

```python
# generate paper document
input_text = "Title: Self-Supervised Learning, A Survey\n\nAuthors: John Smith\n\n"

generated_text = model.generate(input_text, new_doc=True)
print(generated_text)
```

#### Lecture Notes and Wikipedia Articles

For Wikipedia-style articles or lecture notes, begin the prompt with #.

```python
# generate wiki style articles
input_text = "# Multi-Head Attention\n\n"

generated_text = model.generate(input_text, new_doc=True)
print(generated_text)
```

#### Citation Prediction

To search for a paper that discusses a certain topic, make use of the following prompt — PAPER TOPIC [START_REF]. Since the [START_REF] token is appended to the end of the input context, Galactica would treat it as the beginning of a citation and complete the rest.

```python
# search citation
input_text = "An NLP paper that compares different ways of encoding positions in Transformer-based architectures "

generated_text = model.generate(input_text + "[START_REF]")
print(generated_text)
```

Given a prompt such as TEXT [START_REF], Galactica can automatically suggest citations relevant to the TEXT.

```python
# predict citation
input_text = """Recurrent neural networks, long short-term memory and gated recurrent neural networks in particular, have been firmly established as state of the art approaches in sequence modeling and transduction problems such as language modeling and machine translation """

generated_text = model.generate(input_text + "[START_REF]")
print(generated_text)
```

The remainder of the article discuss _Downstream NLP Tasks_ such as 

- **Summarization** - simply append `'\n\nTLDR:'`
- **Entity Extraction** 
- **Question Answering** - `input_text = f"Question: {query}\n\nAnswer:"`
- **Multi-modal Tasks** - Process modalities other than unstructured text, such as LaTeX equations, code, SMILES formula, DNA sequences, and amino acid sequences.
- **Math** - Mathematical equations are represented as LaTeX. Galactica wraps LaTeX equations with brackets `\[ EQUATION \]`. For example `generated_text = model.generate(input_text + "\[")` There is also the functionality to convert mathematical equations into code.
- **Code**
- **Chemical Understanding**
- *Biological Understanding**