Skip to content
This repository

article quality visualization for english wikipedia

branch: master

new note

latest commit 6cce3f9021
Stephen LaPorte authored January 12, 2013
Octocat-spinner-32 article_html fix jquery, add more CLI arguments. May 07, 2012
Octocat-spinner-32 assets dashboard input in progress cell fills in yellow for retries September 17, 2012
Octocat-spinner-32 inputs added avg depth in dom; get param for input_server; various fixes for… December 09, 2012
Octocat-spinner-32 js Merge branch 'python-port' of github.com:slaporte/qualityvis August 23, 2012
Octocat-spinner-32 orange_reports first pass on metamars. 80+% accuracy on round-tripping article class… October 17, 2012
Octocat-spinner-32 orange_schemas new version of MetaMars December 09, 2012
Octocat-spinner-32 orange_scripts fix clean_missing_data January 06, 2013
Octocat-spinner-32 results added outputting, updated example report September 09, 2012
Octocat-spinner-32 views Merge branch 'master' of github.com:slaporte/qualityvis September 18, 2012
Octocat-spinner-32 .gitignore new orange schemas December 08, 2012
Octocat-spinner-32 .json_exporter.py.swp resolved merge conflicts; classifier experiments October 21, 2012
Octocat-spinner-32 INTERESTING.md new note January 12, 2013
Octocat-spinner-32 LICENSE merge in stephen's new domStats and bing web stats April 08, 2012
Octocat-spinner-32 NOTES.md add a notes file with some CLI conversion notes January 05, 2013
Octocat-spinner-32 TODO.txt mean average error of around 6% :D November 12, 2012
Octocat-spinner-32 all_revisions.py improved toolserver status api and added a few relative dom stats October 01, 2012
Octocat-spinner-32 bottle_compressor.py improve console progress output August 25, 2012
Octocat-spinner-32 dashboard.py update exporter to remove tablib dependency, tweak dashboard/loupe December 08, 2012
Octocat-spinner-32 export_settings.py added toolserver metadata to the dashboard and a simple export list September 16, 2012
Octocat-spinner-32 exporter.py new version of MetaMars December 09, 2012
Octocat-spinner-32 input_server.py added avg depth in dom; get param for input_server; various fixes for… December 09, 2012
Octocat-spinner-32 loupe.py update exporter to remove tablib dependency, tweak dashboard/loupe December 08, 2012
Octocat-spinner-32 loupe_list.py cli to create list of article titles January 12, 2013
Octocat-spinner-32 one_row.tmp.txt resolved merge conflicts; classifier experiments October 21, 2012
Octocat-spinner-32 progress.py python is happening August 19, 2012
Octocat-spinner-32 readme.markdown merge in stephen's new domStats and bing web stats April 08, 2012
Octocat-spinner-32 requirements.txt wrapping up requirements fixes December 08, 2012
Octocat-spinner-32 screen.png screenshot in markdown January 24, 2012
Octocat-spinner-32 stats.py add a couple more dom things, fix prefixing, update json_exporter, wh… September 15, 2012
Octocat-spinner-32 wapiti.py consolidated article_history and assessment inputs; average assessmen… September 18, 2012
readme.markdown

Article Quality Visualization

This is a javascript gadget for visualizing article quality in the English Wikipdia. We built this gadget for the San Francisco Mediawiki Hackathon in 2012.

Demo screenshot

Instillation and use

To use on the English Wikipedia, copy the contents of gadget.js into your common.js, and copy the contents of style.css into your common.css. For more detailed instructions on installing userscripts, see here.

Methodology

We used a number of metrics aggregated into four areas of quality: richness, structure, integratedness, community, citations, and significance. Each metric can be individually weighted, although they have not all been incorporated into the ranking formula.

Metrics

  • Assessment grade (featured article, good article, etc.)
  • Last 50 edits, grouped by editor
  • Category count
  • External links in the "External links" section
  • External links anywhere in the article
  • External links section count
  • Feedback score for completeness
  • Feedback score for objectivity
  • Feedback score for trustworthiness
  • Number of Google News results
  • Number of Google Web results
  • Image count
  • Incoming link count
  • Internal link count
  • Intro paragraph count
  • Page visits by date in the last month (according to http://stats.grok.se/)
  • Number of inline tags for POV statements
  • Number of inline tags for statements needing citation
  • Reference count
  • Reference section count
  • Likelihood the last revision was vandalism (according to http://www.wikitrust.net/)

About

This tool was built by:

  • Ben Plowman
  • Mahmoud Hashemi
  • Sarah Nahm
  • Stephen LaPorte

...with help and food from the awesome hosts of the hackathon!

Copyright 2012, licensed under the GPL 3.0. See LICENSE.

Something went wrong with that request. Please try again.