# Instructions for Evaluating Rosie Corpus Documents

Welcome to the active learning evaluation notebook. This notebook will guide you through the process of labeling documents to improve our classifier. Please follow the steps below:

## Task Orientation

You are going to be evaluating document passages from the Rosie Corpus. Your task is to review each passage and annotate whether it is "in domain" or "not in domain" based on the following criteria:
- **In Domain:** Documents that are health relevant, even if they are not specifically about maternal or infant health.
- **Not In Domain:** Documents that are meaningless, such as those made up of references, location information, meaningless tables of content, etc.

## Steps to Follow

1. **Review the Document:**
    - Each document will be presented with a unique `Document ID` and `Document Content`.
2. **Provide a Label:**
    - You will be prompted to provide a label for each document.
    - Enter `0` if the document is **not in domain**, or `1` if the document is **in domain**.
    - If an invalid input is entered, you will see an error message in red. Please enter either `0` or `1`.
3. **Separator Line:**
    - After each document, a separator line in magenta will appear to clearly indicate the end of the current document and the start of the next one.
4. **Completion Message:**
    - Once the active learning loop is completed, a completion message will be displayed in large, green font.
    
<div style="border: 2px solid #4CAF50; padding: 10px; border-radius: 5px; background-color: #f9f9f9;">
  <strong>Important!</strong> If you are unsure about whether the document is In Domain or not, select it as In Domain.
</div>

Please proceed by running the following cells. From the top menu, select <em>Run > Run All Cells</em>, and you will be prompted to write the annotations for the documents.

In [4]:
import sys
sys.path.append("../")
from blade import Blade


blade = Blade(
    model_path = "/home/jovyan/work/data/models/filtering/rosie_1_10",
    source_path="/home/jovyan/work/data/source/corpus_rosie/passages/translated/df_1.parquet",
    lang = "EN",
    blade_state_path='/home/jovyan/work/data/models/filtering/rosie_1_10/blade_state_EN.pkl'
)

In [None]:
blade.active_learning_loop(max_duration_minutes=0.1)