Chopin Competition Voting Analysis

Analysis of voting results from the Chopin Piano Competition using multiple ranking and elimination methods. This project implements various voting systems to analyze judge scores, determine winners through different methodologies, and evaluate stage transition outcomes.

Overview

This tool analyzes competition voting data where multiple judges score contestants across multiple stages. It handles missing data (NA values) and implements several voting methods:

Copeland/Condorcet Method: Pairwise comparison ranking
IRV (Instant Runoff Voting): Top-down elimination with fractional vote sharing for ties
Bottom Elimination: Reverse runoff method
First Past The Post: Analysis of top picks with fractional vote sharing for ties

Additionally, the tool performs stage transition analysis to evaluate how well each voting method predicts which contestants advance to the next round.

Data Format

Input CSV files should follow this structure:

Contestant,Judge1,Judge2,Judge3,...
Contestant A,25,24,23,...
Contestant B,NA,23,24,...
Contestant C,22,25,NA,...

First column: Contestant names
Subsequent columns: Judge scores (numeric values or NA for missing)
Higher scores are better

Installation

Clone this repository
Install dependencies:

pip install -r requirements.txt

Usage

Basic Usage

The scripts default to analyzing CSV files in ./data/input/. If your data files are in a different location, edit the __main__ section in each script to specify the correct paths.

Then run:

python analyze_voting_results.py
python analyze_normalized_means.py

Programmatic Usage

from analyze_voting_results import load_scores, copeland_scores, irv_fractional_top

# Load data
contestants, judges, scores, df = load_scores("path/to/scores.csv")

# Get Copeland ranking
wins, ties, comps = compute_pairwise(scores, contestants, judges)
copeland_ranking, condorcet = copeland_scores(wins, comps, contestants)

# Get IRV winner
irv_winner, irv_elims = irv_fractional_top(scores, df, contestants, judges)

Voting Methods Explained

1. Copeland/Condorcet Method

Pairwise comparison where each contestant is compared head-to-head with every other contestant:

For each pair (A, B), count how many judges prefer A over B
Copeland score: +1 for each opponent beaten, -1 for each loss
A Condorcet winner beats all other contestants in pairwise comparisons

Use case: Identifies the candidate who would win most head-to-head matchups.

2. IRV (Instant Runoff Voting)

Top-down elimination process:

Each round, judges give 1 vote total to their top-scoring remaining contestant(s)
If multiple contestants tie for top score from one judge, they split that vote fractionally
Contestant with fewest votes is eliminated
Process repeats until one candidate has majority (>50% of votes)
Tie-breaker: lowest total raw score, then alphabetical

Use case: Ensures winner has majority support through successive elimination of least-preferred candidates.

3. Bottom Elimination (Reverse Runoff)

Bottom-up elimination process:

Each round, judges give a "bottom vote" to their lowest-scoring remaining contestant(s)
Contestants tied for lowest from one judge split that bottom vote fractionally
Contestant with most bottom votes is eliminated
Process repeats until one candidate remains
Tie-breaker: lowest total raw score, then alphabetical

Use case: Identifies the candidate who is least often anyone's worst choice.

4. First Past The Post

Top pick analysis:

Each judge gives a vote to their top-scoring contestant(s)
If multiple contestants tie for top score from one judge, they split that vote fractionally
Contestants are ranked by total votes received

Use case: Shows which contestants are most often among judges' top choices.

5. Stage Transition Analysis

For competitions with multiple stages, the tool analyzes how well each voting method predicts advancement:

Compares each method's top-N ranking to the contestants who actually advanced
Calculates disagreement scores: the number of contestants that should have advanced (according to the voting method) but didn't, or vice versa
Adds an "Advanced" column (Y/N) to each voting method table showing which contestants moved to the next stage
Generates summary metrics comparing voting method predictions across all stage transitions

Use case: Evaluates which voting methods best align with the actual advancement decisions.

Project Structure

chopin_voting/
├── analyze_voting_results.py    # Main analysis script
├── data/
│   ├── input/                   # Input CSV files
│   │   ├── Chopin_Stage1_scores_NA.csv
│   │   ├── Chopin_Stage2_scores_NA.csv
│   │   ├── Chopin_Stage3_scores_NA.csv
│   │   └── Chopin_FinalStage_scores_NA.csv
│   └── output/                  # Generated Excel reports (timestamped)
├── requirements.txt             # Python dependencies
├── .gitignore                   # Git ignore rules
└── README.md                    # This file

Output

Running the script generates timestamped Excel workbooks in ./data/output/:

Main Analysis Reports

chopin_voting_analysis_{timestamp}.xlsx - Standard analysis:

Summary Sheet: Overview of all stages with winners from each voting method and stage transition metrics
Individual Stage Sheets (Stage1, Stage2, Stage3, FinalStage): Each sheet contains 5 tables side-by-side:
1. Copeland Ranking - Pairwise scores with Advanced column
2. IRV Results - Full ranking from winner to last eliminated, with votes and Advanced column
3. Bottom Elimination - Full ranking with bottom scores and Advanced column
4. First Past The Post - FPTP scores with Advanced column
5. Judge Summary - Judge scoring statistics (min, max, mean, median, std dev)

chopin_cumulative_voting_analysis_{timestamp}.xlsx - Cumulative analysis including judges from previous stages

Available Functions

Data Loading

load_scores(csv_path): Load and parse CSV data

Pairwise/Condorcet

compute_pairwise(scores, contestants, judges): Compute pairwise comparison matrices
copeland_scores(wins, comps, contestants): Calculate Copeland scores and identify Condorcet winner

Elimination Methods

irv_fractional_top(scores, df, contestants, judges): IRV with fractional vote sharing
bottom_elimination_fractional(scores, df, contestants, judges): Reverse elimination method

First Past The Post Analysis

first_past_the_post_fractional(scores, df, contestants, judges): Fractional top pick counting with vote sharing
first_past_the_post_equal(scores, df, contestants, judges): Equal top pick counting (each tie gets 1 point)

Stage Transition Analysis

analyze_stage_transitions(all_results): Analyze advancement patterns across stages and calculate disagreement metrics for each voting method

Workflow and Reporting

analyze_voting_file(csv_path, candidates_to_inspect): Complete analysis workflow for a single stage
generate_excel_report(results, output_path, stage_transitions): Generate comprehensive Excel report with all analysis

Diagnostics

judge_top_summary(scores, judges): Summary of each judge's scoring patterns (min, max, mean, median, std dev, top score frequency)
fptp_contributions_for_candidate(scores, df, judges, candidate): Detailed per-judge First Past The Post analysis for specific candidate
total_raw_score(scores, df, contestant): Sum of all scores for a contestant

Console Output

The script produces console output for each stage:

Copeland ranking with scores
Condorcet winner identification (if exists)
IRV winner with complete elimination order
Bottom-elimination winner with complete elimination order
First Past The Post scores (fractional method)
Judge top-score patterns with statistical summaries
Summary table across all stages
Report generation confirmation with file paths

Dependencies

pandas >= 2.0.0
numpy >= 1.24.0
openpyxl >= 3.0.0

License

This project is licensed under the MIT License - see the LICENSE file for details.

The MIT License is a permissive open-source license that allows you to:

Use this software freely for any purpose (commercial or non-commercial)
Modify the code to suit your needs
Distribute copies of the original or modified software
Sublicense and incorporate it into your own projects

The only requirement is that you include the original copyright notice and license in any copies or substantial portions of the software.

Notes

All methods handle missing data (NA values) by excluding those judge-contestant pairs from calculations
Tie-breaking is consistent across methods: lower total raw score, then alphabetical order
Fractional vote sharing prevents judges who give ties from having outsized influence
Stage files are automatically sorted chronologically (Stage1 → Stage2 → Stage3 → FinalStage)
Excel reports use timestamped filenames to prevent overwriting previous analyses
The "Advanced" column appears only in stages that have a subsequent stage (not in FinalStage)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chopin Competition Voting Analysis

Overview

Data Format

Installation

Usage

Basic Usage

Programmatic Usage

Voting Methods Explained

1. Copeland/Condorcet Method

2. IRV (Instant Runoff Voting)

3. Bottom Elimination (Reverse Runoff)

4. First Past The Post

5. Stage Transition Analysis

Project Structure

Output

Main Analysis Reports

Available Functions

Data Loading

Pairwise/Condorcet

Elimination Methods

First Past The Post Analysis

Stage Transition Analysis

Workflow and Reporting

Diagnostics

Console Output

Dependencies

License

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analyze_normalized_means.py		analyze_normalized_means.py
analyze_voting_results.py		analyze_voting_results.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Chopin Competition Voting Analysis

Overview

Data Format

Installation

Usage

Basic Usage

Programmatic Usage

Voting Methods Explained

1. Copeland/Condorcet Method

2. IRV (Instant Runoff Voting)

3. Bottom Elimination (Reverse Runoff)

4. First Past The Post

5. Stage Transition Analysis

Project Structure

Output

Main Analysis Reports

Available Functions

Data Loading

Pairwise/Condorcet

Elimination Methods

First Past The Post Analysis

Stage Transition Analysis

Workflow and Reporting

Diagnostics

Console Output

Dependencies

License

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages