teachrag

RAG (Retrieval-Augmented Generation) for course materials. Parses teaching materials (slides, scripts, syllabus), builds a DuckDB-backed ragnar store with embeddings, and provides Q&A via Ollama (local) or Claude (API).

Requirements

Before installing or running teachrag, ensure you have:

R packages: dplyr (>= 1.2.0) and others (see DESCRIPTION). Run teachrag::check_dependencies() to verify, or teachrag::ensure_dependencies() to install missing packages.
Ollama with models qwen2.5:3b and nomic-embed-text (for local Q&A and embeddings)
Claude API (optional): if you prefer using Claude via API, set ANTHROPIC_API_KEY in your environment to use Claude instead of a local LLM. You will still need ollama and nomic-embed-text for the embeddings.

Installation

# From source
devtools::install("fellennert/teachrag")

Out of the box (bundled data)

The package ships with pre-built course data. You can run Q&A immediately without any setup:

library(teachrag)

# Single-turn Q&A (uses bundled data by default)
ask_rag("What is supervised machine learning?")

# Multi-turn chat
chat_state <- NULL
res1 <- ask_rag_chat(chat_state, "What is supervised machine learning?")
chat_state <- res1$chat_state
res2 <- ask_rag_chat(chat_state, "Can you give an example?")

# Shiny app (with progress bar)
run_app()

# CLI (prints status: Querying database → Producing answer → Fact-checking)
interactive_cli()

All of these show progress: Querying database… → Producing initial answer… → Fact-checking answer…

Setup wizard (for your own materials)

To parse and index your own course materials:

library(teachrag)
run_setup_wizard()

The wizard guides you through: choosing directories, parsing materials, building the store, testing a question, and launching the app.

Custom paths

If you have your own intermediate directory and store:

library(teachrag)

intermediate_dir <- "path/to/your/intermediate"  # contains chunks.rds, syllabus.rds, store
store_path <- file.path(intermediate_dir, "teaching_db.ragnar.duckdb")

# Pass explicitly, or set options
options(teachrag.intermediate_dir = intermediate_dir, teachrag.store_path = store_path)

ask_rag("What is supervised machine learning?")
run_app()

Building the store

Note that right now there is no rigourous checking procedure to ensure that the chunks aren't too large for nomic-embed-text (i.e., length of chunks needs to be <8,192). You might want to manually preprocess your data or amend the preprocessing functions (R/parse.R) to prevent issues.

# 1. Parse and chunk materials
parse_materials(corpus_dir = "path/to/course_material", output_dir = "path/to/intermediate")

# 2. Build ragnar store (requires nomic-embed-text via Ollama)
build_store(output_dir = "path/to/intermediate")

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
R		R
course-material		course-material
inst		inst
intermediate		intermediate
man		man
.DS_Store		.DS_Store
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

teachrag

Requirements

Installation

Out of the box (bundled data)

Setup wizard (for your own materials)

Custom paths

Building the store

About

Uh oh!

Releases

Packages

Languages

fellennert/teachrag

Folders and files

Latest commit

History

Repository files navigation

teachrag

Requirements

Installation

Out of the box (bundled data)

Setup wizard (for your own materials)

Custom paths

Building the store

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages