GitHub - foundersandcoders/pedagogue: An LLM-powered assistant for democratic/peer-led learning cohorts

An AI-powered curriculum generation tool for peer-led learning cohorts. Pedagogue uses Claude AI with optional web research to generate comprehensive, up-to-date module specifications based on your project briefs, skills requirements, and research topics.

What Pedagogue Does

Pedagogue takes your existing curriculum materials (projects, skills, research topics) and generates a complete module specification that includes:

Learning objectives and module overview
Detailed project briefs with examples and success criteria
Research topics with guidance for learners
Additional skills categorized by importance
Project "twists" to add interesting challenges

The key value: Pedagogue can use web search to ensure your curriculum reflects current industry practices and technologies, not just what Claude knew at training time.

📝 Change Tracking & Provenance

Pedagogue now includes comprehensive change tracking to support the cascade pattern - where AI-generated modules are updated iteratively whilst maintaining human oversight:

Automatic Changelog: Every generated module includes a detailed changelog documenting what changed, why, and with what confidence level
Confidence Scoring: Changes are marked as high/medium/low confidence to help reviewers prioritize what needs scrutiny
Research Citations: When web research informs a decision, sources are automatically cited
Provenance Tracking: Metadata shows when the module was generated, by which AI model, and flags sections needing human review

This enables councils to:

Quickly identify what's been updated since the last version
Focus review time on low-confidence changes
Understand the rationale behind AI-proposed updates
Track which sections have accumulated multiple AI updates without human review

See /docs/changelog-schema-design.md for technical details.

Installation & Setup

Prerequisites

Node.js 16+ and npm
Anthropic API key (required)

1. Get an Anthropic API Key

Visit console.anthropic.com
Sign up or log in to your Anthropic account
Navigate to API Keys in the dashboard
Click Create Key and copy your API key

Cost Information:

Pay-as-you-go pricing based on tokens used
Claude Sonnet 4.5 costs approximately $3 per million input tokens, $15 per million output tokens
Typical module generation: 20,000-40,000 input tokens + 8,000-15,000 output tokens
Estimated cost per module: $0.20-$0.40 (with research enabled)
New accounts often include free credits to get started

See Anthropic's pricing page for current rates.

2. Clone and Install

# Clone the repository
git clone https://github.com/your-org/pedagogue.git
cd pedagogue

# Install dependencies
npm install

3. Configure Environment

# Copy the example environment file
cp .env.example .env

Open .env in your text editor and add your API key:

# Anthropic API Configuration
ANTHROPIC_API_KEY=sk-ant-api03-xxxxx  # Replace with your actual key

Important:

Never commit your .env file to version control (it's already in .gitignore)
Keep your API key secure - treat it like a password
Don't share your key in screenshots, issues, or public forums

4. Run the Application

# Start the development server
npm run dev

The app will be available at http://localhost:5173

Open this URL in your browser to start using Pedagogue.

5. Verify Setup

To confirm everything is working:

Open http://localhost:5173
Upload three minimal XML files (see Input File Format)
Fill in the structured form
Click "Generate Module"
You should see progress updates and successful generation

If you see "ANTHROPIC_API_KEY not configured" - check that:

Your .env file exists in the project root
The API key is correctly set in .env
You've restarted the dev server after adding the key

Deployment

For production deployment:

# Build the application
npm run build

# Preview the production build
npm run preview

See SvelteKit's deployment documentation for platform-specific instructions (Vercel, Netlify, etc.).

Production environment variables:

Set ANTHROPIC_API_KEY in your hosting platform's environment variables
Never hardcode API keys in your source code

Quick Start

1. Prepare Your Input Files

You'll need three XML files:

projects.xml - Project briefs your learners will work on
skills.xml - Additional skills to incorporate (optional but recommended)
research.xml - Research topics for learners to explore

See Input File Format below for detailed requirements.

2. Upload Files

Upload all three XML files in Step 1 of the interface.

3. Provide Context

Fill in the structured input form with:

Module duration (weeks)
Expected cohort size
Learners' experience levels
Required technologies
Any additional requirements

4. Generate Module

Review your inputs and click "Generate Module". With web research enabled (recommended), generation typically takes 2-4 minutes.

5. Download Result

Once generation completes, download your module specification as XML.

Input File Format

Required Structure

Each input file must have a specific root element. XML is case-sensitive and tag names matter.

projects.xml

<Projects>
  <!-- Your project briefs here -->
  <ProjectBriefs>
    <ProjectBrief>
      <Overview>
        <Name>Project Name</Name>
        <Task>What learners will build</Task>
        <Focus>Key technologies and techniques</Focus>
      </Overview>
      <Criteria>
        Success criteria as bullet points
      </Criteria>
      <Skills>
        <Skill>
          <Name>Skill Name</Name>
          <Details>What learners will learn</Details>
        </Skill>
      </Skills>
      <Examples>
        <Example>
          <Name>Example Name</Name>
          <Description>Brief description</Description>
        </Example>
      </Examples>
    </ProjectBrief>
  </ProjectBriefs>
</Projects>

Minimal valid file:

<Projects>
</Projects>

skills.xml

<AdditionalSkills>
  <!-- Your skills categories here -->
  <SkillsCategory>
    <Name>Category Name (e.g., "Python")</Name>
    <Skill>
      <SkillName>Specific Skill</SkillName>
      <Importance>Recommended / Stretch / Essential</Importance>
      <SkillDescription>Brief description</SkillDescription>
    </Skill>
  </SkillsCategory>
</AdditionalSkills>

Minimal valid file:

<Skills>
</Skills>

Note: You can use either <AdditionalSkills> (for structured skills) or <Skills> (for flexible content).

research.xml

<ResearchTopics>
  <!-- Your research topics here -->
  <PrimaryTopics>
    <PrimaryTopic>
      <TopicName>Topic Name</TopicName>
      <TopicDescription>
        What to research and how to approach it
      </TopicDescription>
    </PrimaryTopic>
  </PrimaryTopics>
</ResearchTopics>

Minimal valid file:

<Research>
</Research>

Note: You can use either <ResearchTopics> (structured) or <Research> (flexible).

Common XML Gotchas

1. Unescaped Special Characters

Problem: XML has special characters that must be escaped.

Common errors:

<!-- WRONG - will cause parse error -->
<Task>Compare values with < and > operators</Task>

<!-- CORRECT -->
<Task>Compare values with &lt; and &gt; operators</Task>

Characters to escape:

& → &
< → <
> → >
" → " (in attributes)
' → ' (in attributes)

2. Mismatched or Unclosed Tags

<!-- WRONG -->
<Name>Project Name</name>

<!-- CORRECT - tags are case-sensitive -->
<Name>Project Name</Name>

<!-- WRONG -->
<Overview>
  <Name>Project Name
</Overview>

<!-- CORRECT - all tags must close -->
<Overview>
  <Name>Project Name</Name>
</Overview>

3. Invalid XML Declaration

If you include an XML declaration, it must be the first line with no whitespace before it:

<!-- WRONG -->

<?xml version="1.0" encoding="UTF-8"?>

<!-- CORRECT -->
<?xml version="1.0" encoding="UTF-8"?>
<Projects>

Note: The XML declaration is optional. If omitted, UTF-8 encoding is assumed.

4. Comments Inside Tags

<!-- WRONG -->
<Name <!-- This is the project name --> >Project</Name>

<!-- CORRECT -->
<!-- This is the project name -->
<Name>Project</Name>

5. Empty Elements

Both formats are valid:

<ProjectBriefs></ProjectBriefs>
<!-- or -->
<ProjectBriefs />

Using the Structured Input Form

The form provides context that helps Claude generate a more targeted module specification.

Required Fields

Module Duration (weeks)

How long learners will spend on this module
Influences the depth and breadth of content
Range: 1-52 weeks

Expected Cohort Size

Number of learners
Helps Claude recommend appropriate group activities and scale
Range: 1-1000 learners

Experience in Related Fields

Prior experience in broader field (e.g., software development)
Options: <1 year, 1-3 years, 4+ years

Experience in Course Focus

Prior experience in this specific domain (e.g., AI engineering)
Options: None, Limited, Skilled Amateur, Current Professional

Optional Fields

Expected Delivery Date

When the module will run
If provided, helps research focus on technologies/practices relevant at that time

Required Technologies & Techniques

Select from common options or add custom technologies
Claude will incorporate these into project briefs and skills
Be specific: "FastAPI" not just "Python"

Additional Information

Any special requirements, constraints, or goals
Example: "Learners must deploy to cloud platforms" or "No paid services"
Max 1000 characters

Generation Options

Enable Deep Research (Recommended)

Allows Claude to use web search during generation
Searches trusted domains: vendor docs, established tech publications, academic sources
Ensures curriculum reflects current best practices
Adds 1-2 minutes to generation time
This is the core value of Pedagogue - without research, you're just reformatting existing content

Use Extended Thinking

Gives Claude more time to reason about complex requirements
Useful for modules with many dependencies or advanced topics
Adds minimal time but improves quality for complex modules

How Generation Works

What Gets Sent to Claude

Your three input files (projects, skills, research)
Structured form data (cohort size, duration, experience levels, etc.)
Schema requirements (the exact XML structure Claude must produce)
Generation instructions (how to synthesize inputs into a cohesive module)

What Claude Does

Analyzes your inputs to understand learning outcomes
If research enabled: Uses web search to verify content is current
- Checks if technologies/practices are outdated
- Updates recommendations based on current industry trends
- Limits: 5 searches per generation
- Allowed domains: anthropic.com, openai.com, langchain.com, github.com, stackoverflow.com, arxiv.org, and other trusted tech sources
Synthesizes everything into a comprehensive module specification
Validates output against the schema (see Output Schema)

Validation & Retry

Pedagogue automatically validates Claude's output against the required schema. If validation fails (e.g., too few objectives, missing required sections), it will:

Show you the validation error
Automatically retry with feedback (up to 3 attempts)
Include validation errors in the retry prompt so Claude can fix them

You'll see progress updates during this process.

Generation Time

Without research: 30-60 seconds
With research: 2-4 minutes (searches take time but are worth it)

Output Schema

The generated module follows this structure:

<Module>
  <Metadata>
    <!-- NEW: Automatic change tracking and provenance -->
    <GenerationInfo>
      <Timestamp>2025-10-11T14:30:00Z</Timestamp>
      <Source>AI-Generated</Source>
      <Model>claude-sonnet-4-5-20250929</Model>
    </GenerationInfo>
    <Changelog>
      <!-- Documents what changed, why, and with what confidence -->
      <Change>
        <Section>ModuleObjectives/Objective[1]</Section>
        <Type>content_update</Type>
        <Confidence>high</Confidence>
        <Summary>Updated to reflect latest best practices</Summary>
        <Rationale>Industry has moved to newer patterns...</Rationale>
        <ResearchSources>
          <Source url="...">Documentation reference</Source>
        </ResearchSources>
      </Change>
    </Changelog>
  </Metadata>

  <ModuleOverview>
    <ModuleDescription>...</ModuleDescription>
    <ModuleObjectives>
      <!-- Minimum 3 distinct objectives -->
      <ModuleObjective>
        <Name>...</Name>
        <Details>...</Details>
      </ModuleObjective>
    </ModuleObjectives>
  </ModuleOverview>

  <ResearchTopics>
    <PrimaryTopics>
      <!-- Minimum 5 distinct topics -->
      <PrimaryTopic>
        <TopicName>...</TopicName>
        <TopicDescription>...</TopicDescription>
      </PrimaryTopic>
    </PrimaryTopics>
    <StretchTopics>
      <!-- Optional extension topics -->
    </StretchTopics>
  </ResearchTopics>

  <Projects>
    <ProjectBriefs>
      <!-- Minimum 2 distinct briefs -->
      <ProjectBrief>
        <Overview>...</Overview>
        <Criteria>...</Criteria>
        <Skills>
          <!-- Minimum 3 distinct skills per brief -->
        </Skills>
        <Examples>
          <!-- Minimum 3 significantly different examples -->
        </Examples>
      </ProjectBrief>
    </ProjectBriefs>

    <ProjectTwists>
      <!-- Minimum 2 interesting twists -->
      <ProjectTwist>
        <Name>...</Name>
        <Task>...</Task>
        <ExampleUses>
          <!-- Minimum 2 examples -->
        </ExampleUses>
      </ProjectTwist>
    </ProjectTwists>
  </Projects>

  <AdditionalSkills>
    <!-- Supplementary skills organized by category -->
  </AdditionalSkills>

  <Notes>
    <!-- Any additional context that doesn't fit elsewhere -->
  </Notes>
</Module>

Validation Requirements

The schema enforces:

✅ At least 3 module objectives
✅ At least 5 primary research topics
✅ At least 2 project briefs
✅ At least 3 skills per project brief
✅ At least 3 examples per project brief
✅ At least 2 project twists
✅ At least 2 example uses per twist
✅ Proper XML structure with matching tags
✅ All required sections present

If any requirement isn't met, Pedagogue will automatically retry with specific feedback.

Information Sources

When Research is Disabled

Claude uses only:

Your input files (projects, skills, research)
Your structured form data
Claude's training data (knowledge cutoff: January 2025)

When Research is Enabled

Claude additionally searches these trusted domains:

AI Platforms & Vendors

anthropic.com, openai.com, huggingface.co

Documentation Sites

js.langchain.com, python.langchain.com, modelcontextprotocol.io, docs.python.org

Developer Resources

github.com, stackoverflow.com, dev.to, medium.com

Tech News & Analysis

techcrunch.com, venturebeat.com

Technical Blogs

simonwillison.net, newsletter.pragmaticengineer.com

Academic Sources

arxiv.org, acm.org, ieee.org

This ensures recommendations come from authoritative sources while staying current.

Troubleshooting

"ANTHROPIC_API_KEY not configured"

Cause: The application can't find your Anthropic API key.

Solutions:

Check that .env file exists in the project root directory
Verify your API key is correctly set: ANTHROPIC_API_KEY=sk-ant-api03-xxxxx
Restart the development server after adding/modifying .env
Ensure there are no extra spaces around the = sign in .env
For production deployments, confirm environment variables are set in your hosting platform

See Installation & Setup for detailed configuration instructions.

"Failed to parse XML file"

Cause: Invalid XML syntax in your input files.

Solutions:

Check for unescaped special characters (&, <, >)
Verify all tags are properly closed and match case
Ensure root element is correct (<Projects>, <Skills>/<AdditionalSkills>, <Research>/<ResearchTopics>)
Validate your XML with an online validator

"Validation failed after 3 attempts"

Cause: Claude couldn't produce valid output meeting schema requirements.

Solutions:

Check if your input files are very sparse (add more detail to help Claude)
Verify "Additional Information" field doesn't have contradictory requirements
Try disabling "Extended Thinking" (sometimes simpler is better)
Review the validation errors shown - they indicate what's missing

"Generation timed out"

Cause: Request took longer than 2 minutes (usually with research enabled).

Solutions:

Try again (web searches can occasionally be slow)
Simplify your inputs (fewer technologies, shorter additional info)
Disable research temporarily to get a baseline module

Output doesn't match your requirements

Improvements:

Add more detail to your input XML files (Claude follows your examples)
Use the "Additional Information" field to specify constraints
Be specific about technologies (not "databases" but "PostgreSQL")
Enable research if disabled (gets current best practices)

API Rate Limits or Billing Issues

Symptoms:

429 errors ("rate limit exceeded")
402 errors ("insufficient credits")
Slow response times

Solutions:

Rate limits: Wait a minute and try again. Anthropic has per-minute rate limits on API usage
Insufficient credits: Check your Anthropic console billing section
- Add credits to your account
- Verify your payment method is valid
High costs: Each module generation costs $0.20-$0.40 with research enabled
- Disable research for testing/development to reduce costs
- Use simpler input files during iteration

Tips for Best Results

1. Provide Rich Input Files

The more detail in your projects, skills, and research files, the better Claude understands what you want:

✅ Include multiple project briefs with detailed criteria
✅ Specify 3-5 skills per project with explanations
✅ Provide 3-4 examples per project to show diversity
❌ Don't use empty/minimal input files

2. Be Specific About Technologies

Claude responds better to specific technologies than generic categories:

✅ "React with TypeScript", "FastAPI", "PostgreSQL"
❌ "Frontend framework", "Backend", "Database"

3. Use Additional Information Wisely

Great uses:

"Learners must deploy projects to cloud platforms"
"No paid services - use only free tiers"
"Projects should emphasize testing and CI/CD"

Not helpful:

Repeating what's already in structured fields
Vague requests like "make it good"

4. Enable Research for Current Topics

Essential for:

✅ Fast-moving fields (AI/ML, web frameworks)
✅ Modules being delivered 3+ months from now
✅ Emerging technologies or practices

Less important for:

Stable, foundational topics (SQL basics, algorithms)
Historical or theoretical modules

5. Iterate

Pedagogue excels at refinement:

Generate a baseline module
Review what Claude produced
Adjust your inputs based on what you learned
Generate again

Model Information

Pedagogue uses Claude Sonnet 4.5 (claude-sonnet-4-5-20250929):

200K token context window (can handle large input files)
Up to 64K output tokens (comprehensive modules)
Web search capability (when enabled)
Extended thinking capability (when enabled)

License & Support

Pedagogue is built for peer-led learning communities.

For issues or questions: Create an issue in the repository or check the /docs folder for technical documentation.

Note: You'll need an Anthropic API key to use Pedagogue. See the Installation & Setup section above for detailed instructions.

Example Workflow

Let's walk through generating a module about "AI Agents and Tool Use":

Create projects.xml with two project briefs:
- "Web Research Agent" - uses search APIs
- "Personal Assistant Agent" - integrates calendar/email
Create skills.xml listing Python, API integration, prompt engineering
Create research.xml with topics like:
- "ReAct framework and reasoning patterns"
- "LangChain agent abstractions"
- "Function calling with OpenAI/Anthropic"
Fill the form:
- Duration: 3 weeks
- Cohort: 12 learners
- Experience: Limited experience in AI, 1-3 years in software dev
- Technologies: Python, LangChain, FastAPI
- Enable research: ✅
Generate - takes ~3 minutes with research
Review & Download - Claude produces:
- 3-5 learning objectives
- 5-8 research topics with guidance
- 2 detailed project briefs with examples
- Additional skills (testing agents, managing state, error handling)
- Project twists (multi-agent systems, human-in-the-loop)
Refine (if needed):
- Add more specific examples to projects.xml
- Request different deployment targets in "Additional Information"
- Re-generate

That's it! You now have a comprehensive, current module specification ready for your learners.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
docs		docs
src		src
static		static
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
svelte.config.js		svelte.config.js
vite.config.js		vite.config.js

foundersandcoders/pedagogue

Folders and files

Latest commit

History

Repository files navigation

What Pedagogue Does

📝 Change Tracking & Provenance

Installation & Setup

Prerequisites

1. Get an Anthropic API Key

2. Clone and Install

3. Configure Environment

4. Run the Application

5. Verify Setup

Deployment

Quick Start

1. Prepare Your Input Files

2. Upload Files

3. Provide Context

4. Generate Module

5. Download Result

Input File Format

Required Structure

projects.xml

skills.xml

research.xml

Common XML Gotchas

1. Unescaped Special Characters

2. Mismatched or Unclosed Tags

3. Invalid XML Declaration

4. Comments Inside Tags

5. Empty Elements

Using the Structured Input Form

Required Fields

Optional Fields

Generation Options

How Generation Works

What Gets Sent to Claude

What Claude Does

Validation & Retry

Generation Time

Output Schema

Validation Requirements

Information Sources

When Research is Disabled

When Research is Enabled

Troubleshooting

"ANTHROPIC_API_KEY not configured"

"Failed to parse XML file"

"Validation failed after 3 attempts"

"Generation timed out"

Output doesn't match your requirements

API Rate Limits or Billing Issues

Tips for Best Results

1. Provide Rich Input Files

2. Be Specific About Technologies

3. Use Additional Information Wisely

4. Enable Research for Current Topics

5. Iterate

Model Information

License & Support

Example Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages