**Smart Cultural Storytelling:
AI-Generated Cultural Narratives with Interactive Choices**

**Selected Track:** Smart Cultural Storyteller  
**Course:** Minor in Artificial Intelligence  
**Institution:** IIT Ropar  
**Student Name:** Muskan Yadav  
**Date:** 17 January 2026

## 1. Project Overview

This project explores how pre-trained AI tools can be combined to create engaging cultural stories in audio-visual formats. The system is designed to make cultural education more appealing to younger audiences who prefer short videos and interactive content.

The project has two modes:
- **Mode 1:** Cultural Story Generator (AI-generated videos)
- **Mode 2:** “What If?” Interactive Storytelling (choice-based narratives)

The focus of this project is not on training AI models, but on designing a complete AI-driven system using existing generative tools responsibly.

## 2. Problem Statement

Many cultural stories, myths, and festival origins are still shared through books or static content, which often fails to engage Gen-Z and Gen-Alpha audiences. At the same time, these generations actively consume reels, short videos, and interactive media.

This project addresses this gap by using AI to transform cultural stories into engaging audio-visual narratives while keeping the content respectful and educational.

## 3. Motivation & Target Audience

**Target Audience**
- Gen-Z and Gen-Alpha learners
- Educators and cultural content creators

**Motivation**
- Declining engagement with traditional cultural education
- Popularity of reels and short-form videos
- Opportunity to use AI for cultural preservation in a modern format

## 4. Data Understanding & Preparation

This project does not use a traditional dataset such as CSV or tabular data. Instead, it relies on cultural knowledge embedded in pre-trained AI models and curated prompts.

The outputs of these models (scripts, images, audio, and videos) act as the primary data used in the system.

### Data Sources Used

| Data Type                | Tool Used              |
|--------------------------|------------------------|
| Story Scripts            | Claude AI              |
| Images                   | Banana Pro and ChatGPT |
| Audio Narration (Mode 2) | gTTS                   |
| Video & Audio (Mode 1)   | Canva AI               |

### Data Cleaning & Curation

- Generated scripts were manually reviewed for cultural accuracy
- Story length was adjusted to suit reel-style videos
- Audio narration speed and clarity were standardized
- Unsupported combinations were handled through system-level fallbacks

## 5. System Design & Architecture

The system follows a modular pipeline where different AI tools handle different stages of storytelling. Each component works independently and is later combined into a single user-facing website.

The design focuses on simplicity, transparency, and responsible AI usage.

### AI Techniques Used

- Large Language Models (LLMs) for story generation
- Generative image models for visual storytelling
- Text-to-Speech (TTS) for narration
- Rule-based logic for interactive branching

No custom model training was performed in this project.

## Website Information 
The website was generated using Google Antigravity and runs as a local prototype. 
Here is the public link for the website prototype using Netlify : https://smartculturalstorytelling.netlify.app
All features are demonstrated via screenshots and a recorded demo video.

## 6. Mode 1: Cultural Story Generator

Mode 1 allows users to generate cultural stories by selecting:
- Country (India, Japan)
- Story Theme (Festival, Myth, Legend)
- Visual Style (Anime, Storybook Illustration)

Based on the selection, a pre-generated AI video is displayed.  
Both video and audio for Mode 1 were created using Canva AI.

<img src="images/pg2.png" width="500" height="250">

**Figure 1:** Mode 1 interface showing country, theme, and style selection.

<img src="images/pg2-1.png" width="500" height="250">

**Figure 2:** Mode 1 interface showing Video generated after the above selections.


## 7. Mode 2: “What If?” Interactive Storytelling

Mode 2 introduces interactive storytelling where the user makes choices that influence the story outcome.

Unlike Mode 1, creativity in this mode is controlled to ensure cultural accuracy. The implemented story is **The Honest Woodcutter**, an Indian folk tale.

<img src="images/pg3.png" width="500" height="250">

**Figure 3:** Mode 2 interface showing 2 descision choices.

In [None]:
# Text-to-Speech generation for Mode 2 using gTTS

!pip install gTTS

from gtts import gTTS
import os

# Create output folder
os.makedirs("audio", exist_ok=True)

# Dictionary of scene narration texts
scenes = {
    "scene1": "In a quiet village near a forest lived a poor woodcutter. His only possession was a simple iron axe, which he used every day to earn his living.",

    "scene2": "One day, while cutting wood near the river, the axe slipped from his hands and sank into the deep water. Without it, he had no way to survive.",

    "scene3": "The river suddenly glowed, and a divine spirit emerged holding a shining golden axe. She asked gently, Is this the axe you lost?",

    "scene4": "The golden axe glittered before him. At this moment, the woodcutter had to make a choice.",

    "scene5A": "Pleased with his honesty, the spirit smiled and returned his iron axe, along with the golden one as a reward.",

    "scene6A": "The woodcutter returned home grateful, knowing honesty had changed his fate.",

    "scene5B": "The spirit’s expression darkened. She vanished into the river, taking all the axes with her.",

    "scene6B": "Left alone by the river, the woodcutter realized that greed had cost him everything."
}

# Generate audio files
for scene, text in scenes.items():
    tts = gTTS(text=text, lang="en", slow=False)
    file_path = f"audio/{scene}.mp3"
    tts.save(file_path)

print("Audio narration generated successfully.")


The above code demonstrates how audio narration was generated for Mode 2 using gTTS. Similar scripts were used to generate narration for all interactive scenes.

Mode 1 narration was generated directly using Canva AI and does not require additional code.

In [None]:
# Pseudocode for interactive story branching

if user_choice == "honesty":
    load_scene("scene5A")
    load_scene("scene6A")
elif user_choice == "greed":
    load_scene("scene5B")
    load_scene("scene6B")

This pseudocode represents the decision-based narrative logic used in the website. Based on the user’s choice, the system loads different story scenes, resulting in different outcomes.

<img src="images/honest.png" width="500" height="250">

**Figure 4:** Mode 2 interface showing ending with 'Honest' choice made.

<img src="images/greed.png" width="500" height="250">

**Figure 5:** Mode 2 interface showing ending with 'Greedy' choice made.

## 8. Edge Case Handling

An edge case occurs when a user selects a story-style combination for which no pre-generated video exists (for example, Cinematic style).

Instead of producing an error or misleading output, the system informs the user that the option is unavailable and suggests alternative styles. This ensures transparency and responsible AI behavior.

<img src="images/edge.png" width="500" height="250">

**Figure 6:** Mode 1 interface showing 'Edge Case' scenario.

## 9. Results & Observations

- Anime style was most engaging for myths and legends
- Storybook illustration worked well for festival stories
- Interactive storytelling improved engagement and moral understanding

## 10. Limitations & Ethical Considerations

**Limitations**
- Limited number of cultures and stories
- Offline content generation only

**Ethical Considerations**
- Avoided cultural distortion
- No deepfake or impersonation content
- Transparent handling of unsupported options

## 11. Future Scope

- Add more cultures and languages
- Live AI generation using APIs
- More complex interactive stories
- Accessibility features such as subtitles

## 12. Conclusion

This project demonstrates how multiple AI tools can be orchestrated into a meaningful cultural storytelling system. By combining automated content generation with controlled interactivity and responsible design, the system bridges traditional cultural narratives and modern media consumption.