Week 1 – Data Collection Phase

During the first week, we focused on preparing the data collection infrastructure for the Bannerlord Database project. A Python virtual environment was created, and the project structure was organized into scrape/, data/, and sql/ directories. We implemented a basic web scraper using the Fandom (MediaWiki) API to retrieve HTML content for individual Lord pages. The scraper parses infobox details (name, age, gender, level) and extracts traits and skill data sections.

Sample data was collected for three Lords (Caladog, Lucon, Rhagaea) and stored in CSV format (lords.csv, lord_traits.csv, lord_skills.csv). SQL DDL files defining the lords, lord_traits, and lord_skills tables were also drafted.

This week’s goal was to establish the foundation for automated data gathering, which will later be imported into the relational database.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data/processed		data/processed
scrape		scrape
sql		sql
.gitignore		.gitignore
README.md		README.md
lords_list.txt		lords_list.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Week 1 – Data Collection Phase

About

Uh oh!

Releases

Packages

Languages

rixemre/database_project

Folders and files

Latest commit

History

Repository files navigation

Week 1 – Data Collection Phase

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages