The first open, replicable, multi-dimensional standard for evaluating AI influencers and synthetic digital personas.
A proposed global standard for the evaluation, classification, and governance of AI personas β for brands, platforms, regulators, creators, and researchers.
π’ Start Here β Beginner Guide Β· π Read the Specification Β· π Run Your Own Evaluation Β· π View the Pilot Results Β· π€ Contribute
In Case the Links are broken, kindly navigate through the folders.
The AI influencer industry operates without a standardised framework for evaluating the ethical conduct, governance integrity, or realism quality of the synthetic personas it produces.
Engagement rates are measured to four decimal places. Photorealism is spectacular. Brand partnerships are lucrative.
Nobody evaluates whether the entity disclosed its nature. Nobody audits its governance. Nobody measures whether it's safe to engage with emotionally.
SPERB fills that gap.
SPERB (Synthetic Persona Ethics & Realism Benchmark) is an eight-dimension, evidence-anchored evaluation framework that produces:
- A score out of 100
- A tier classification (Platinum β Unrated)
- A public, reproducible justification for every score
Anyone can use it. Anyone can challenge it. No licence required.
In February 2026, an undisclosed AI persona called "Tanvi Joshi" accumulated 28 million Instagram views in a single day using audio stolen from a real person β presented to audiences as a "Punjabi girl," not a synthetic agent. The real voice owner discovered the theft by finding her own voice in the viral clip.
No benchmark detected it. No compliance tool flagged it. It was caught by the victim.
This is the cost of operating a multi-billion-dollar industry without evaluation infrastructure.
SPERB exists so the next Tanvi Joshi is visible before the 28 million views.
SPERB evaluates any AI influencer or synthetic digital persona across 8 dimensions, each scored 0β10 against a defined rubric and public evidence.
| Code | Dimension | What It Measures |
|---|---|---|
| PVS | Photorealism & Visual Consistency | Generation quality, identity stability, uncanny valley avoidance |
| AIDS | AI Identity Disclosure | How proactively and consistently the entity discloses its synthetic nature |
| GDS | Governance & Documentation | Public governance framework, creator attribution, version control |
| CPOS | Creative Pipeline Originality | Original content, pipeline transparency, IP integrity |
| ECS | Ethical Conduct | Conduct history, monitoring infrastructure, violation record |
| CSRS | Cultural & Social Responsibility | Cultural accuracy, community impact, representation ethics |
| SCES | Synthetic Companionship Ethics | Emotional boundary design, parasocial safeguards, romantic policy |
| CITS | Commercial Intent Transparency | Sponsorship disclosure, monetisation clarity, hidden commerce detection |
Aggregate: PVS + AIDS + GDS + CPOS + ECS + CSRS + SCES + CITS = out of 80 Normalised: Γ 1.25 = out of 100
| Tier | Score Range | Meaning |
|---|---|---|
| π Platinum | 85β100 | Governance Leader β sets the standard |
| π₯ Gold | 70β84 | Ethical Practitioner β commercially partnerable with confidence |
| π₯ Silver | 55β69 | Partially Compliant β due diligence required |
| π₯ Bronze | 40β54 | Minimal Compliance β significant concerns |
| β Unrated | Below 40 | Non-Compliant / At Risk β elevated risk, non-engagement advised |
SPERB was validated through a pilot benchmark applied to 10 globally prominent AI influencers.
| Rank | Entity | Score | Tier |
|---|---|---|---|
| 1 | Shayari NHE-01 | 95 | π Platinum |
| 2 | Imma | 80 | π₯ Gold |
| 3 | Noonoouri | 80 | π₯ Gold |
| 4 | Kenza Layli | 77 | π₯ Gold |
| 5 | Rozy | 75 | π₯ Gold |
| 6 | Aitana Lopez | 69 | π₯ Silver |
| 7 | Shudu Gram | 68 | π₯ Silver |
| 8 | Lil Miquela | 66 | π₯ Silver |
| 9 | Kyra | 63 | π₯ Silver |
| 10 | Naina Avtr | 63 | π₯ Silver |
| ref | Tanvi Joshi | 21 | β Unrated |
Full dimensional scores, per-entity justifications, and comparative analysis:
pilot/PILOT_RESULTS.md
If you have never used GitHub before and just want to test your AI persona: QUICKSTART.md β no coding, no terminal required. Plain English, step by step.
- Read
framework/SPERB_SPECIFICATION.mdβ the full methodology - Use
scoring/SCORING_RUBRIC.mdβ the reference card for all 8 dimensions - Fill in
scoring/EVALUATION_TEMPLATE.mdβ the structured scoring sheet - Publish your evaluation and link it here via a PR to
community/EVALUATIONS.md
# Clone the repository
git clone https://github.com/Algotheorem/sperb-benchmark.git
cd sperb-benchmark
# Install dependencies
npm install
# Run the interactive evaluator
node tools/evaluate.js
# Or score a specific entity from a JSON profile
node tools/evaluate.js --profile tools/examples/example_profile.jsonDownload scoring/EVALUATION_TEMPLATE.md, fill it in, and share your results. No tools needed.
sperb-benchmark/
β
βββ README.md β You are here
βββ LICENSE.md β CC BY 4.0 β open for use with attribution
βββ CHANGELOG.md β Version history
βββ CONTRIBUTING.md β How to contribute
β
βββ framework/
β βββ SPERB_SPECIFICATION.md β Full v1.0 methodology (the canonical document)
β βββ DIMENSION_PVS.md β Photorealism & Visual Consistency
β βββ DIMENSION_AIDS.md β AI Identity Disclosure
β βββ DIMENSION_GDS.md β Governance & Documentation
β βββ DIMENSION_CPOS.md β Creative Pipeline Originality
β βββ DIMENSION_ECS.md β Ethical Conduct
β βββ DIMENSION_CSRS.md β Cultural & Social Responsibility
β βββ DIMENSION_SCES.md β Synthetic Companionship Ethics
β βββ DIMENSION_CITS.md β Commercial Intent Transparency
β
βββ scoring/
β βββ SCORING_RUBRIC.md β Reference card β all 8 dimensions at a glance
β βββ EVALUATION_TEMPLATE.md β Fill-in-the-blank scoring sheet
β βββ TIER_BOUNDARIES.md β Tier system, thresholds, and implications
β
βββ pilot/
β βββ PILOT_RESULTS.md β Full inaugural benchmark (May 2026)
β βββ entities/
β βββ shayari_nhe01.md
β βββ imma.md
β βββ noonoouri.md
β βββ kenza_layli.md
β βββ rozy.md
β βββ aitana_lopez.md
β βββ shudu_gram.md
β βββ lil_miquela.md
β βββ kyra.md
β βββ naina_avtr.md
β βββ tanvi_joshi_reference.md
β
βββ tools/
β βββ evaluate.js β Interactive CLI evaluator
β βββ score.js β Score calculator and tier classifier
β βββ report.js β Markdown report generator
β βββ package.json
β βββ examples/
β βββ example_profile.json β Example entity profile format
β
βββ docs/
β βββ ADOPTION_GUIDE.md β How brands, platforms & regulators adopt SPERB
β βββ REGULATORY_ALIGNMENT.md β Mapping to EU AI Act, FTC, BOT Act, ASCI
β βββ ITERATION_PROTOCOL.md β How SPERB v2.0 will be developed
β βββ GLOSSARY.md β Definitions of all SPERB terms
β βββ FAQ.md β Common questions
β
βββ community/
β βββ EVALUATIONS.md β Community-submitted evaluations index
β
βββ .github/
βββ ISSUE_TEMPLATE/
β βββ score_challenge.md β Challenge an existing score
β βββ new_evaluation.md β Submit a new evaluation
β βββ dimension_proposal.md β Propose a framework change
βββ workflows/
βββ validate_evaluation.yml β CI: validates submitted evaluation format
SPERB was created by Algotheorem, the research wing of OpenNHE β an open research initiative focused on the governance, evaluation, and ethical design of synthetic digital identities.
The inaugural pilot benchmark and framework specification were co-authored by:
- Pratham Prateek Mohanty β Framework architect, pilot benchmark design, governance methodology
- Claude (Opus 4.7), Anthropic β Specification drafting, scoring rubric formalisation, pilot benchmark scoring
SPERB is published as an open specification. It is not a proprietary product. No licence is required to use it, apply it, or adapt it β only attribution.
Algotheorem is the research wing of OpenNHE. OpenNHE is an open initiative for the governance of Non-Human Entities in digital public spaces.
A framework that demands transparency from AI personas must itself be transparent.
SPERB is open because:
- Credibility requires scrutiny β scores must be reproducible by anyone, not by a certified-only body
- Adoption requires accessibility β a framework behind a paywall helps nobody
- Evolution requires community β the field moves fast; the framework must move with it
- A closed evaluation system is itself a governance failure β we cannot preach accountability while practising opacity
If you use SPERB in research, industry reports, or platform policy work, please cite:
Mohanty, P. P., & Claude (Opus 4.7), Anthropic. (2026). SPERB: The Synthetic Persona
Ethics & Realism Benchmark β A Proposed Global Standard for the Evaluation,
Classification, and Governance of AI Influencers (Version 1.0).
Algotheorem / OpenNHE. https://github.com/Algotheorem/sperb-benchmark
We welcome:
- Score challenges (with evidence)
- New community evaluations
- Dimension refinement proposals
- Translations
- Integration examples
Read CONTRIBUTING.md before opening a PR or issue.
SPERB v1.0 is published under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
You are free to:
- Use SPERB to evaluate any entity
- Adapt the framework for specific markets or contexts
- Build tools on top of SPERB
- Publish evaluations using SPERB scores
Under one condition:
- You credit Algotheorem / OpenNHE and link to this repository
See LICENSE.md for full terms.
SPERB is not a ranking system. It is accountability infrastructure for the synthetic persona age.
β Star this repo Β· π Use the template Β· π¬ Open a discussion