diff --git a/markdown_generator/publications.ipynb b/markdown_generator/publications.ipynb index 7360fc86bd291..8657e1009b7b6 100644 --- a/markdown_generator/publications.ipynb +++ b/markdown_generator/publications.ipynb @@ -9,7 +9,7 @@ "source": [ "# Publications markdown generator for academicpages\n", "\n", - "Takes a TSV of publications with metadata and converts them for use with [academicpages.github.io](academicpages.github.io). This is an interactive Jupyter notebook ([see more info here](http://jupyter-notebook-beginner-guide.readthedocs.io/en/latest/what_is_jupyter.html)). The core python code is also in `publications.py`. Run either from the `markdown_generator` folder after replacing `publications.tsv` with one that fits your format.\n", + "Takes a TSV of publications with metadata and converts them for use with [academicpages.github.io](academicpages.github.io). This is an interactive Jupyter notebook ([see more info here](http://jupyter-notebook-beginner-guide.readthedocs.io/en/latest/what_is_jupyter.html)). The core python code is also in `publications.py`. Run either from the `markdown_generator` folder after replacing `publications.tsv` with one containing your data.\n", "\n", "TODO: Make this work with BibTex and other databases of citations, rather than Stuart's non-standard TSV format and citation style.\n" ] @@ -24,7 +24,9 @@ "\n", "- `excerpt` and `paper_url` can be blank, but the others must have values. \n", "- `pub_date` must be formatted as YYYY-MM-DD.\n", - "- `url_slug` will be the descriptive part of the .md file and the permalink URL for the page about the paper. The .md file will be `YYYY-MM-DD-[url_slug].md` and the permalink will be `https://[yourdomain]/publications/YYYY-MM-DD-[url_slug]`" + "- `url_slug` will be the descriptive part of the .md file and the permalink URL for the page about the paper. The .md file will be `YYYY-MM-DD-[url_slug].md` and the permalink will be `https://[yourdomain]/publications/YYYY-MM-DD-[url_slug]`\n", + "\n", + "This is how the raw file looks (it doesn't look pretty, use a spreadsheet or other program to edit and create)." ] }, { @@ -211,7 +213,7 @@ "source": [ "## Creating the markdown files\n", "\n", - "This is where the heavy lifting is done. This loops through all the rows in the TSV dataframe, then starts to concatentate a big string (```md```) that contains the markdown for each type. It does the YAML metadata first, then does the description for the individual page. If you don't want something to appear (like the \"Recommended citation\")" + "This is where the heavy lifting is done. This loops through all the rows in the TSV dataframe, then starts to concatentate a big string (```md```) that contains the markdown for each type. It does the YAML metadata first, then does the description for the individual page." ] }, { @@ -254,12 +256,12 @@ " md += \"\\n---\"\n", " \n", " ## Markdown description for individual page\n", - " \n", - " if len(str(item.paper_url)) > 5:\n", - " md += \"\\n\\nDownload paper here\\n\" \n", " \n", " if len(str(item.excerpt)) > 5:\n", " md += \"\\n\" + html_escape(item.excerpt) + \"\\n\"\n", + " \n", + " if len(str(item.paper_url)) > 5:\n", + " md += \"\\n[Download paper here](\" + item.paper_url + \")\\n\" \n", " \n", " md += \"\\nRecommended citation: \" + item.citation\n", " \n", @@ -269,6 +271,13 @@ " f.write(md)" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "These files are in the publications directory, one directory below where we're working from." + ] + }, { "cell_type": "code", "execution_count": 6, @@ -314,11 +323,10 @@ "paperurl: 'http://academicpages.github.io/files/paper1.pdf'\r\n", "citation: 'Your Name, You. (2009). "Paper Title Number 1." Journal 1. 1(1).'\r\n", "---\r\n", - "\r\n", - "Download paper here\r\n", - "\r\n", "This paper is about the number 1. The number 2 is left for future work.\r\n", "\r\n", + "[Download paper here](http://academicpages.github.io/files/paper1.pdf)\r\n", + "\r\n", "Recommended citation: Your Name, You. (2009). \"Paper Title Number 1.\" Journal 1. 1(1)." ] } diff --git a/markdown_generator/publications.tsv b/markdown_generator/publications.tsv index 6c8691459b244..49e79632b0b24 100644 --- a/markdown_generator/publications.tsv +++ b/markdown_generator/publications.tsv @@ -1,20 +1,4 @@ -pub_date venue pub_details authors title citation url slug summary description -2016-03-21 Information, Communication, and Society 19(6) R. Stuart Geiger Bot-based collective blocklists in Twitter: the counterpublic moderation of harassment in a networked public space Geiger, R. Stuart. (2016). “Bot-based collective blocklists in Twitter: the counterpublic moderation of harassment in a networked public space.” Information, Communication, and Society 19(6). http://stuartgeiger.com/blockbots-ics.pdf http://stuartgeiger.com/blockbots-ics.pdf blockbots-ics This article introduces and discusses bot-based collective blocklists (or blockbots) in Twitter, which have been developed by volunteers to combat harassment in the social networking site. This article introduces and discusses bot-based collective blocklists (or blockbots) in Twitter, which have been developed by volunteers to combat harassment in the social networking site. Blockbots support the curation of a shared blocklist of accounts, where subscribers to a blockbot will not receive any notifications or messages from accounts on the blocklist. Blockbots support counterpublic communities, helping people moderate their own experiences of a site. This article provides an introduction and overview of blockbots and the issues that they raise about networked publics and platform governance, extending an intersecting literature on online harassment, platform governance, and the politics of algorithms. -2014-09-10 Journal of Broadcasting and Electronic Media 58(3) R. Stuart Geiger, Airi Lampinen Old Against New, or a Coming of Age? Broadcasting in an Era of Electronic Media. Geiger, R. Stuart and Lampinen, Airi. (2014). “Old Against New, or a Coming of Age? Broadcasting in an Era of Electronic Media.” Journal of Broadcasting and Electronic Media 58(3). http://www.stuartgeiger.com/jobem.pdf http://www.stuartgeiger.com/jobem.pdf jobem-old-against-new On the history and continued relevance of the term "broadcasting" in an era of social media. “Broadcasting” is often cast as an outdated term—we are constantly told that we are in the midst of a digital/social media revolution that will make the unidirectional, mass communication model obsolete. In response, we argue that to consider the continued relevance of terms like “broadcasting” in an era of electronic media is to neither hastily disregard the legacy of these terms, nor cling to them too rigidly. In this special issue of the Journal of Broadcasting and Electronic Media written and edited by graduate students, we begin a new thread in the longstanding conversation about what it means for media to be “old” and “new.” -2014-11-02 Proceedings of HCOMP, Citizen-X Workshop Nathan Matias, R. Stuart Geiger Defining, Designing, and Evaluating Civic Values in Human Computation and Collective Action Systems Matias, N. and Geiger, R.S. “Defining, Designing, and Evaluating Civic Values in Human Computation and Collective Action Systems.” In Proceedings of HCOMP 2014, Citizen-X Workshop. http://stuartgeiger.com/defining-civic-values-hcomp-matias-geiger.pdf. http://stuartgeiger.com/defining-civic-values-hcomp-matias-geiger.pdf hcomp-values-in-crowdsourcing We review various crowdsourcing and collective action systems, identifying particular sets of civic values and assumptions. Collective action is often described in terms of the relationships, learning, principled processes, and community capacities it fosters. Despite this, human computation and collective action systems are often designed and evaluated with system outputs in mind: the quality of answers, the number of votes, the accuracy of content created. In this proposal, we review literature on the design values of “citizen-x” systems, put forward a series of models for describing the civic values in “citizen-x”, and classify systems by those models. -2014-05-01 Proceedings of CHI Aaron Halfaker, R. Stuart Geiger, Loren Terveen Snuggle: Designing for efficient socialization and ideological critique Halfaker, Aaron., Geiger, R. Stuart., and Treveen, Loren. (2014). “Snuggle: Designing for Efficient Socialization and Ideological Critique.” In Proceedings of the 2014 ACM Conference on Human Factors in Computing (CHI 2014). http://www-users.cs.umn.edu/~halfak/publications/Snuggle/halfaker14snuggle-personal.pdf http://www-users.cs.umn.edu/~halfak/publications/Snuggle/halfaker14snuggle-personal.pdf chi-snuggle-wikipedia This paper discusses the Snuggle project, built to support newcomer socialization and reflexive critique of Wikipedia's existing socialization processes. We worked with a coalition of Wikipedians to design, develop, and deploy Snuggle, a new user interface that served two critical functions: making the work of newcomer socialization more effective, and bringing visibility to instances in which Wikipedians’ current practice of gatekeeping socialization breaks down. Snuggle supports positive socialization by helping mentors quickly find newcomers whose good-faith mistakes were reverted as damage. Snuggle also supports ideological critique and reflection by bringing visibility to the consequences of viewing newcomers through a lens of suspiciousness. -2014-01-03 Information, Communication, and Society 17 R. Stuart Geiger Bots, bespoke code, and the materiality of software platforms Geiger, R. Stuart. (2014). “Bots, Bespoke Code, and the Materiality of Software Platforms.” Information, Communication, and Society 17. http://stuartgeiger.com/bespoke-code-ics.pdf http://stuartgeiger.com/bespoke-code-ics.pdf ics-bots-bespoke-code This article introduces and discusses the role of bespoke code in Wikipedia, which is code that runs alongside a platform or system, rather than being integrated into server-side codebases. This article introduces and discusses the role of bespoke code in Wikipedia, which is code that runs alongside a platform or system, rather than being integrated into server-side codebases by individuals with privileged access to the server. Instead of taking for granted the pre-existing stability of Wikipedia as a platform, bots and other bespoke code require that we examine not only the software code itself, but also the concrete, historically contingent material conditions under which this code is run. -2013-09-01 Ecology and Society 18(3) Michele Romolini, Sydne Record, Rebecca Garvoille, Y. Marusenko, R. Stuart Geiger The Next Generation of Scientists: Examining the Experiences of Graduate Students in Network-Level Social-Ecological Science Romolini, Michele., Sydne Record, Rebecca. Garvoille, Y. Marusenko, and R. Stuart Geiger. (2013) “The Next Generation of Scientists: Examining the Experiences of Graduate Students in Network-Level Science.” In Ecology and Society 18(3). http://stuartgeiger.com/lter-network-level-science-es.pdf http://stuartgeiger.com/lter-network-level-science-es.pdf ecosoc-lter-students We examined how graduate students experienced and social-ecological research initiative within the large-scale, geographically distributed Long Term Ecological Research (LTER) Network. In the pursuit to confront pressing environmental issues such as climate change, many scientists, practitioners, policy makers, and institutions are promoting large-scale ‘network-level’ scientific research that integrates the social and ecological sciences. To understand how this scientific trend is unfolding among rising scientists, we examined how graduate students experienced one such emergent social-ecological research initiative within the large-scale, geographically distributed Long Term Ecological Research (LTER) Network. -2013-08-03 Proceedings of WikiSym R. Stuart Geiger, Aaron Halfaker When the Levee Breaks: Without Bots, What Happens to Wikipedia’s Quality Control Processes? Geiger, R. Stuart and Halfaker, Aaron. (2013). “When the Levee Breaks: Without Bots, What Happens to Wikipedia’s Quality Control Processes?” In Proceedings of the 9th International Symposium on Wikis and Open Collaboration (WikiSym 2013). http://stuartgeiger.com/wikisym13-cluebot.pdf http://stuartgeiger.com/wikisym13-cluebot.pdf wikisym-levee-breaks-bots This paper examines what happened when one of Wikipedia's counter-vandalism bots unexpectedly went offline. In the first half of 2011, ClueBot NG – one of the most prolific counter-vandalism bots in the English-language Wikipedia – went down for four distinct periods, each period of downtime lasting from days to weeks. In this paper, we use these periods of breakdown as naturalistic experiments to study Wikipedia’s heterogeneous quality control network. Our analysis showed that the overall time-to-revert edits was almost doubled when this software agent was down. Yet while a significantly fewer proportion of edits made during the bot’s downtime were reverted, we found that those edits were later eventually reverted. This suggests that other agents in Wikipedia took over this quality control work, but performed it at a far slower rate. -2013-02-23 Proceedings of CSCW R. Stuart Geiger, Aaron Halfaker Using Edit Sessions to Measure Participation in Wikipedia Geiger, R. Stuart and Halfaker, Aaron. (2013). “Using Edit Sessions to Measure Participation in Wikipedia.” In Proceedings of the 2013 ACM Conference on Computer Supported Cooperative Work (CSCW 2013). http://www.stuartgeiger.com/cscw-sessions.pdf http://www.stuartgeiger.com/cscw-sessions.pdf cscw-edit-sessions This paper establishes a quantitative metric for measuring editor activity through temporal edit sessions. Many quantitative, log-based studies of participation and contribution in CSCW and CMC systems measure the activity of users in terms of output, based on metrics like posts to forums, edits to Wikipedia articles, or commits to code repositories. In this paper, we estimate the amount of time users have spent contributing. Through an analysis of Wikipedia log data, we identify a pattern of punctuated bursts in editors’ activity that we refer to as edit sessions. Based on these edit sessions, we build a metric that approximates the labor hours of editors in the encyclopedia. Using this metric, we first compare labor-based analyses with output-based analyses, finding that the activity of many editors can appear quite differently based on the kind of metric used. -2012-11-05 Information and Organization 23:1-14 David Ribes, Steve Jackson, R. Stuart Geiger, Matt C. Burton, Tom Finholt Artifacts that Organize: Delegation in the Distributed Organization Ribes, David, Steve Jackson, R. Stuart Geiger, Matt C. Burton, and Tom Finholt (2012). “Artifacts that organize: Delegation in the distributed organization.” Information and Organization 23:1–14. http://www.stuartgeiger.com/artifacts-that-organize.pdf http://www.stuartgeiger.com/artifacts-that-organize.pdf infoorg-artifacts-that-organize This paper studies the role of computational infrastructure and organizational structure in the Open Science Grid. Increasingly, organizations are deploying automated modes of technology-supported coordination that seek to replace rather than enhance human communication. To study this phenomenon, we extend Bruno Latour’s concept of delegation and apply it to thorny questions around the work of sustaining organization over space and time. As we show with two cases from the Open Science Grid, delegation is complex, fragile, and central to the nature of contemporary organizing. Specifically, we argue that delegation: 1) reconfigures the organization of work; 2) transforms how outcomes are accomplished; 3) redistributes responsibility for organizational decision-making; and 4) shifts the visibility and invisibility of both actors and their work. -2013-05-01 American Behavioral Scientist 57(5) Aaron Halfaker, R. Stuart Geiger, Jonathan Morgan, John Riedl The Rise and Decline of an Open Collaboration Community: How Wikipedia’s reaction to sudden popularity is causing its decline Halfaker, Aaron., R. Stuart Geiger, Jonathan Morgan, and John Riedl. (2013). “The Rise and Decline of an Open Collaboration System: How Wikipedia’s reaction to sudden popularity is killing it.” American Behavioral Scientist 57(5). http://dx.doi.org/10.1177/0002764212469365 http://www-users.cs.umn.edu/~halfak/publications/The_Rise_and_Decline/halfaker13rise-preprint.pdf abs-rise-and-decline-wikipedia A mixed-method, multi-study analysis of editor retention, socialization, gatekeeping, and governance in Wikipedia. This paper presents evidence that several changes that the Wikipedia community made to manage quality and consistency in the face of a massive growth in participation have lead to a more restrictive environment for newcomers. Specifically, the restrictiveness of the encyclopedia’s primary quality control mechanism and the algorithmic tools used to reject contributions is implicated as a cause of decreased newcomer retention. Also, the community’s formal mechanisms for norm articulation is shown to have calcified against changes — especially for newcomers. -2012-06-05 Proceedings of ICWSM R. Stuart Geiger, Aaron Halfaker, Maryana Pinchuk, Steven Walling Defense Mechanism or Socialization Tactic? Improving Wikipedia’s Notifications to Rejected Contributors Geiger, R. Stuart, Aaron Halfaker, Maryana Pinchuk, and Steven Walling (2012). “Defense Mechanism or Socialization Tactic? Improving Wikipedia’s Notifications to Rejected Contributors.” In Proceedings of the 2012 International Conference on Weblogs and Social Media (ICWSM 2012). http://stuartgeiger.com/defense-mechanism-icwsm.pdf http://stuartgeiger.com/defense-mechanism-icwsm.pdf icwsm-socialization-wikipedia A descriptive study of Wikipedia's highly-automated socialization processes and an A/B test to improve templated messages to newcomers. In this paper, we first illustrate and describe the various defense mechanisms at work in Wikipedia, which we hypothesize are inhibiting newcomer retention. Next, we present results from an experiment aimed at increasing both the quantity and quality of editors by altering various elements of these defense mechanisms, specifically pre-scripted warnings and notifications that are sent to new editors upon reverting or rejecting contributions. Using regression models of new user activity, we show which tactics work best for different populations of users based on their motivations when joining Wikipedia. -2012-08-29 Proceedings of WikiSym Heather Ford, R. Stuart Geiger “Writing up rather than writing down”: Becoming Wikipedia Literate Ford, Heather and R. Stuart Geiger. (2012). “”Writing up rather than writing down”: Becoming Wikipedia Literate.” In Proceedings of the 8th International Symposium on Wikis and Open Collaboration (WikiSym 2012). New York: ACM Digital Library. http://www.stuartgeiger.com/becoming-wikipedia-literate.pdf http://www.stuartgeiger.com/writing-up-wikisym.pdf wikisym-wikipedia-literate We introduce and advocate a multi-faceted theory of literacy to investigate the knowledges and organizational forms are required to improve participation in Wikipedia’s communities. We introduce and advocate a multi-faceted theory of literacy to investigate the knowledges and organizational forms are required to improve participation in Wikipedia’s communities. We outline what Richard Darville refers to as the “background knowledges” required to be an empowered, literate member and apply this to the Wikipedia community. Using a series of examples drawn from interviews with new editors and qualitative studies of controversies in Wikipedia, we identify and outline several different literacy asymmetries. -2012-05-02 Proceedings of CHI (alt.CHI) R. Stuart Geiger, Yoon J. Jeong, Emily Manders Black-boxing the user: internet protocol over xylophone players (IPoXP) Geiger, R. Stuart, Yoon J. Jeong, and Emily Manders (2012). “Black-Boxing the User: Internet Protocol over Xylophone Players.” In Proceedings of the 2012 ACM Conference on Human-Computer Interaction (alt.CHI 2012). New York: ACM Digital Library. http://stuartgeiger.com/ipoxp.pdf http://stuartgeiger.com/ipoxp.pdf altchi-ipoxp We introduce IP over Xylophone Players (IPoXP), a novel Internet protocol between two computers using xylophone-based Arduino interfaces We introduce IP over Xylophone Players (IPoXP), a novel Internet protocol between two computers using xylophone-based Arduino interfaces. In our implementation, human operators are situated within the lowest layer of the network, transmitting data between computers by striking designated keys. We discuss how IPoXP inverts the traditional mode of human-computer interaction, with a computer using the human as an interface to communicate with another computer. -2011-10-05 Proceedings of WikiSym R. Stuart Geiger and Heather Ford Participation in Wikipedia’s Article Deletion Processes Geiger, R. Stuart and Heather Ford. (2011) “Participation in Wikipedia’s Deletion Processes.” In Proceedings of the 7th International Symposium on Wikis and Open Collaboration (WikiSym 2011). New York: ACM Digital Library. http://www.stuartgeiger.com/papers/article-deletion-wikisym-geiger-ford.pdf http://www.stuartgeiger.com/papers/article-deletion-wikisym-geiger-ford.pdf wikisym-article-deletion This paper investigates Wikipedia's article deletion processes, finding that it is heavily populated by specialists. We find that Wikipedia’s deletion process is heavily frequented by a relatively small number of longstanding users. The vast majority of such deleted articles are not spam, vandalism, or “patent nonsense,” but rather articles which could be considered encyclopedic, but do not fit the project‟s standards. -2011-11-01 Wikipedia: A Critical Point of View ed. G. Loving and N. Tkacz R. Stuart Geiger The Lives of Bots Geiger, R. Stuart. (2011). “The Lives of Bots.” In G. Lovink and N. Tkacz (eds.) In Wikipedia: A Critical Point of View. Amsterdam: Institute of Network Cultures. http://www.stuartgeiger.com/lives-of-bots-wikipedia-cpov.pdf http://www.stuartgeiger.com/papers/lives-of-bots-wikipedia-cpov.pdf cpov-lives-of-bots I describe the complex social and technical environment in which bots exist in Wikipedia, emphasizing not only how bots produce order and enforce rules, but also how humans produce bots and negotiate rules around their operation. I describe the complex social and technical environment in which bots exist in Wikipedia, emphasizing not only how bots produce order and enforce rules, but also how humans produce bots and negotiate rules around their operation. -2011-01-03 Proceedings of HICSS R. Stuart Geiger, David Ribes Trace Ethnography: Following Coordination through Documentary Practices Geiger, R. Stuart and David Ribes (2011). “Trace Ethnography: Following Coordination through Documentary Practices.” In Proceedings of the 44th Annual Hawaii International Conference on System Sciences (HICSS). http://www.stuartgeiger.com/trace-ethnography-hicss-geiger-ribes.pdf http://www.stuartgeiger.com/trace-ethnography-hicss-geiger-ribes.pdf hicss-trace-ethnography We detail the methodology of ‘trace ethnography’, which combines the richness of participant-observation with the wealth of data in logs so as to reconstruct patterns and practices of users in distributed sociotechnical systems We detail the methodology of ‘trace ethnography’, which combines the richness of participant-observation with the wealth of data in logs so as to reconstruct patterns and practices of users in distributed sociotechnical systems -2010-02-25 Proceedings of CSCW R. Stuart Geiger, David Ribes The Work of Sustaining Order in Wikipedia: The Banning of a Vandal Geiger, R. Stuart and David Ribes (2010). “The Work of Sustaining Order in Wikipedia: The Banning of a Vandal.” In Proceedings of the 2010 ACM Conference on Computer-Supported Cooperative Work (CSCW 2012). New York: ACM Digital Library. http://www.stuartgeiger.com/papers/cscw-sustaining-order-wikipedia.pdf http://www.stuartgeiger.com/papers/cscw-sustaining-order-wikipedia.pdf cscw-banning-vandal This paper traces out a heterogeneous network of humans and non-humans involved in the identification and banning of a single vandal in Wikipedia. This paper traces out a heterogeneous network of humans and non-humans involved in the identification and banning of a single vandal in Wikipedia. -2009-10-01 Gnovis R. Stuart Geiger Does Habermas Understand the Internet? The Algorithmic Construction of the Blogo/Public Sphere Geiger, R. Stuart (2009). “Does Habermas Understand the Internet? The Algorithmic Construction of the Blogo/Public Sphere.” Gnovis: A Journal of Communication, Culture, and Technology. 10(1). http://www.stuartgeiger.com/papers/gnovis-habermas-blogopublic-sphere.pdf http://www.stuartgeiger.com/papers/gnovis-habermas-blogopublic-sphere.pdf gnovis-habermas-understand-internet Habermasians have been debating about the role of the Internet in the public sphere, but they have all taken for granted the highly-automated software infrastructures that mediate our knowledge of the blogosphere. Habermasians have been debating about the role of the Internet in the public sphere, but they have all taken for granted the highly-automated software infrastructures that mediate our knowledge of the blogosphere. -2009-10-27 Proceedings of Wikisym R. Stuart Geiger The Social Roles of Bots and Assisted Editing Tools Geiger, R. Stuart (2009). “The Social Roles of Bots and Assisted Editing Tools.” In Proceedings of the 5th International Symposium on Wikis and Open Collaboration. New York: ACM Digital Library. http://www.stuartgeiger.com/papers/geiger-wikisym-bots.pdf http://www.stuartgeiger.com/papers/geiger-wikisym-bots.pdf wikisym-social-roles-bots A short paper showing the recent explosive growth of automated editors (or bots) in Wikipedia, which have taken on many new tasks in administrative spaces. A short paper showing the recent explosive growth of automated editors (or bots) in Wikipedia, which have taken on many new tasks in administrative spaces. +pub_date title venue excerpt citation url_slug paper_url +2009-10-01 Paper Title Number 1 Journal 1 This paper is about the number 1. The number 2 is left for future work. Your Name, You. (2009). "Paper Title Number 1." Journal 1. 1(1). paper-title-number-1 http://academicpages.github.io/files/paper1.pdf +2010-10-01 Paper Title Number 2 Journal 1 This paper is about the number 2. The number 3 is left for future work. Your Name, You. (2010). "Paper Title Number 2." Journal 1. 1(2). paper-title-number-2 http://academicpages.github.io/files/paper2.pdf +2015-10-01 Paper Title Number 3 Journal 1 This paper is about the number 3. The number 4 is left for future work. Your Name, You. (2015). "Paper Title Number 3." Journal 1. 1(3). paper-title-number-3 http://academicpages.github.io/files/paper3.pdf \ No newline at end of file diff --git a/markdown_generator/talks.ipynb b/markdown_generator/talks.ipynb index 5fe2d2ea33916..f7e2347d34f2f 100644 --- a/markdown_generator/talks.ipynb +++ b/markdown_generator/talks.ipynb @@ -2,17 +2,52 @@ "cells": [ { "cell_type": "markdown", - "metadata": {}, + "metadata": { + "deletable": true, + "editable": true + }, "source": [ - "# Talks markdown generator\n", - "Takes a TSV of talks with metadata and converts them for use with academicpages.github.io.\n", + "# Talks markdown generator for academicpages\n", + "\n", + "Takes a TSV of talks with metadata and converts them for use with [academicpages.github.io](academicpages.github.io). This is an interactive Jupyter notebook ([see more info here](http://jupyter-notebook-beginner-guide.readthedocs.io/en/latest/what_is_jupyter.html)). The core python code is also in `talks.py`. Run either from the `markdown_generator` folder after replacing `talks.tsv` with one containing your data.\n", "\n", - "TODO: Make this work with BibTex or other databases of talks, rather than Stuart's TSV format.\n" + "TODO: Make this work with BibTex and other databases, rather than Stuart's non-standard TSV format and citation style." ] }, { "cell_type": "code", "execution_count": 1, + "metadata": { + "collapsed": false, + "deletable": true, + "editable": true + }, + "outputs": [], + "source": [ + "import pandas as pd\n", + "import os" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Data format\n", + "\n", + "The TSV needs to have the following columns: title, type, url_slug, venue, date, location, talk_url, description, with a header at the top. Many of these fields can be blank, but the columns must be in the TSV.\n", + "\n", + "- Fields that cannot be blank: `title`, `url_slug`, `date`. All else can be blank. `type` defaults to \"Talk\" \n", + "- `date` must be formatted as YYYY-MM-DD.\n", + "- `url_slug` will be the descriptive part of the .md file and the permalink URL for the page about the paper. \n", + " - The .md file will be `YYYY-MM-DD-[url_slug].md` and the permalink will be `https://[yourdomain]/talks/YYYY-MM-DD-[url_slug]`\n", + " - The combination of `url_slug` and `date` must be unique, as it will be the basis for your filenames\n", + "\n", + "This is how the raw file looks (it doesn't look pretty, use a spreadsheet or other program to edit and create)." + ] + }, + { + "cell_type": "code", + "execution_count": 2, "metadata": { "collapsed": false }, @@ -21,29 +56,36 @@ "name": "stdout", "output_type": "stream", "text": [ - "Requirement already up-to-date: getorg in /srv/paws/lib/python3.4/site-packages\n", - "Requirement already up-to-date: geopy in /srv/paws/lib/python3.4/site-packages (from getorg)\n", - "Requirement already up-to-date: retrying in /srv/paws/lib/python3.4/site-packages (from getorg)\n", - "Requirement already up-to-date: pygithub in /srv/paws/lib/python3.4/site-packages (from getorg)\n", - "Requirement already up-to-date: six>=1.7.0 in /srv/paws/lib/python3.4/site-packages (from retrying->getorg)\n", - "Requirement already up-to-date: python-jose in /srv/paws/lib/python3.4/site-packages (from pygithub->getorg)\n", - "Requirement already up-to-date: pycrypto<2.7.0,>=2.6.0 in /srv/paws/lib/python3.4/site-packages (from python-jose->pygithub->getorg)\n", - "Requirement already up-to-date: ecdsa<1.0 in /srv/paws/lib/python3.4/site-packages (from python-jose->pygithub->getorg)\n", - "Requirement already up-to-date: future<1.0 in /srv/paws/lib/python3.4/site-packages (from python-jose->pygithub->getorg)\n" + "title\ttype\turl_slug\tvenue\tdate\tlocation\ttalk_url\tdescription\r\n", + "Talk 1 on Relevant Topic in Your Field\tTalk\ttalk-1\tUC San Francisco, Department of Testing\t2012-03-01\tSan Francisco, California\t\tThis is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!\r\n", + "Tutorial 1 on Relevant Topic in Your Field\tTutorial\ttutorial-1\tUC-Berkeley Institute for Testing Science\t2013-03-01\tBerkeley CA, USA\thttp://exampleurl.com\tThis is a description of your tutorial, note the different field in type. This is a markdown files that can be all markdown-ified like any other post. Yay markdown!\r\n", + "Talk 2 on Relevant Topic in Your Field\tTalk\ttalk-2\tLondon School of Testing\t2014-02-01\tLondon, UK\thttp://example2.com\tThis is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!\r\n", + "Conference Proceeding talk 3 on Relevant Topic in Your Field\tConference proceedings talk\ttalk-3\tTesting Institute of America 2014 Annual Conference\t2014-03-01\tLos Angeles, CA\t\tThis is a description of your conference proceedings talk, note the different field in type. You can put anything in this field." ] } ], "source": [ - "!pip install getorg --upgrade\n", - "import pandas as pd\n", - "import os" + "!cat talks.tsv" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Import TSV\n", + "\n", + "Pandas makes this easy with the read_csv function. We are using a TSV, so we specify the separator as a tab, or `\\t`.\n", + "\n", + "I found it important to put this data in a tab-separated values format, because there are a lot of commas in this kind of data and comma-separated values can get messed up. However, you can modify the import statement, as pandas also has read_excel(), read_json(), and others." ] }, { "cell_type": "code", - "execution_count": 2, + "execution_count": 3, "metadata": { - "collapsed": false + "collapsed": false, + "deletable": true, + "editable": true }, "outputs": [ { @@ -54,1389 +96,98 @@ " \n", " \n", " \n", - " date\n", - " talk_type\n", " title\n", + " type\n", + " url_slug\n", " venue\n", - " institution\n", - " geoloc\n", - " url\n", - " slug\n", - " summary\n", + " date\n", + " location\n", + " talk_url\n", " description\n", " \n", " \n", " \n", " \n", " 0\n", - " 2008-03-01\n", + " Talk 1 on Relevant Topic in Your Field\n", " Talk\n", - " A Communicative Ethnography of Argumentative S...\n", - " Exploring New Media Worlds\n", - " Texas A&M University\n", - " College Station, TX\n", - " NaN\n", - " tamu-communicative-ethnography\n", - " NaN\n", + " talk-1\n", + " UC San Francisco, Department of Testing\n", + " 2012-03-01\n", + " San Francisco, California\n", " NaN\n", + " This is a description of your talk, which is a...\n", " \n", " \n", " 1\n", - " 2008-07-19\n", - " Talk\n", - " Conceptions and Misconceptions Academics Hold ...\n", - " Annual Wikimedia Conference (Wikimania)\n", - " Wikimedia Foundation\n", - " Alexandria, Egypt\n", - " NaN\n", - " wikimania-academics-wikipedia\n", - " NaN\n", - " NaN\n", + " Tutorial 1 on Relevant Topic in Your Field\n", + " Tutorial\n", + " tutorial-1\n", + " UC-Berkeley Institute for Testing Science\n", + " 2013-03-01\n", + " Berkeley CA, USA\n", + " http://exampleurl.com\n", + " This is a description of your tutorial, note t...\n", " \n", " \n", " 2\n", - " 2009-03-28\n", + " Talk 2 on Relevant Topic in Your Field\n", " Talk\n", - " Working With/in Wikipedia: Infrastructures of ...\n", - " Annual Conference on Science and Technology in...\n", - " AAAS\n", - " Wasthington, Dc\n", - " NaN\n", - " aaas-wikipedia-infrastructures\n", - " NaN\n", - " NaN\n", + " talk-2\n", + " London School of Testing\n", + " 2014-02-01\n", + " London, UK\n", + " http://example2.com\n", + " This is a description of your talk, which is a...\n", " \n", " \n", " 3\n", - " 2009-04-25\n", - " Talk\n", - " Evolving Governance and Media Use in Wikipedia...\n", - " Media in Transition 6\n", - " MIT\n", - " Cambridge, MA\n", - " NaN\n", - " mit6-media-use-wikipedia\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 4\n", - " 2009-07-26\n", - " Talk\n", - " Algorithmic Governance: The Social Roles of Bo...\n", - " First Annual Wikiconference NYC\n", - " Wikimedia Foundation\n", - " New York, NY\n", - " NaN\n", - " wikiconf-nyc-algorithmc-governance\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 5\n", - " 2009-09-25\n", - " Talk\n", - " Trace Ethnography: An ANT Method for the Study...\n", - " the Second Annual Media Sociology Forum\n", - " New York University\n", - " New York, NY\n", - " NaN\n", - " nyu-trace-ethnography\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 6\n", - " 2009-10-27\n", + " Conference Proceeding talk 3 on Relevant Topic...\n", " Conference proceedings talk\n", - " The Social Roles of Bots and Assisted Editing ...\n", - " International Symposium on Wikis and Open Coll...\n", - " ACM\n", - " Orlando, Florida\n", - " http://www.stuartgeiger.com/papers/geiger-wiki...\n", - " wikisym-social-roles-bots\n", - " A short paper showing the recent explosive gro...\n", - " A short paper showing the recent explosive gro...\n", - " \n", - " \n", - " 7\n", - " 2009-10-28\n", - " Talk\n", - " Where Are the Missing Wikipedians? The Sociolo...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Arlington, Virginia\n", - " NaN\n", - " 4s-missing-wikipedians\n", - " NaN\n", + " talk-3\n", + " Testing Institute of America 2014 Annual Confe...\n", + " 2014-03-01\n", + " Los Angeles, CA\n", " NaN\n", - " \n", - " \n", - " 8\n", - " 2010-01-10\n", - " Talk\n", - " The Wisdom of Bots: A Critique of ‘Self-Organi...\n", - " Critical Point of View: Wikipedia and the Poli...\n", - " Centre for Internet and Society\n", - " Bangalore, India\n", - " NaN\n", - " cpov-wisdom-of-bots\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 9\n", - " 2010-02-25\n", - " Conference proceedings talk\n", - " The Work of Sustaining Order in Wikipedia: The...\n", - " Conference on Computer Supported Cooperative Work\n", - " ACM\n", - " Savannah, Georgia\n", - " http://www.stuartgeiger.com/papers/cscw-sustai...\n", - " cscw-banning-vandal\n", - " This paper traces out a heterogeneous network ...\n", - " This paper traces out a heterogeneous network ...\n", - " \n", - " \n", - " 10\n", - " 2010-03-26\n", - " Talk\n", - " Bot Politics: How is Automation Changing the W...\n", - " Critical Point of View: Wikipedia and the Poli...\n", - " Institute for Network Cultures\n", - " Amsterdam, the Netherlands\n", - " NaN\n", - " cpov-bot-politics\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 11\n", - " 2010-07-10\n", - " Panelist\n", - " Academic Researchers in Wikimedia Communities:...\n", - " Wikimania 2010\n", - " Wikimedia Foundation\n", - " Gdansk, Poland\n", - " https://wikimania2010.wikimedia.org/wiki/Submi...\n", - " wikimania-academic-researchers\n", - " A panel intended to foster a dialog between ac...\n", - " In the past few years, there has been an explo...\n", - " \n", - " \n", - " 12\n", - " 2011-01-03\n", - " Conference proceedings talk\n", - " Trace Ethnography: Following Coordination thro...\n", - " Hawaii International Conference on System Scie...\n", - " IEEE\n", - " Lihue, Hawaii\n", - " http://www.stuartgeiger.com/trace-ethnography-...\n", - " hicss-trace-ethnography\n", - " We detail the methodology of ‘trace ethnograph...\n", - " We detail the methodology of ‘trace ethnograph...\n", - " \n", - " \n", - " 13\n", - " 2011-03-04\n", - " Talk\n", - " Machine-Generated Content: Bots and the Govern...\n", - " Digital Media and Learning (DML)\n", - " NaN\n", - " Long Beach, CA\n", - " NaN\n", - " dml-bots-governance\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 14\n", - " 2011-10-05\n", - " Conference proceedings talk\n", - " Participation in Wikipedia’s Article Deletion ...\n", - " International Symposium on Wikis and Open Coll...\n", - " ACM\n", - " Mountain View, CA\n", - " http://www.stuartgeiger.com/papers/article-del...\n", - " wikisym-article-deletion\n", - " This paper investigates Wikipedia's article de...\n", - " We find that Wikipedia’s deletion process is h...\n", - " \n", - " \n", - " 15\n", - " 2011-11-02\n", - " Talk\n", - " ’The Internet is Here’: The Virtuality of ‘On-...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Cleveland, OH\n", - " NaN\n", - " 4s-internet-is-here\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 16\n", - " 2011-11-03\n", - " Talk\n", - " User-Generated Platforms in Wikipedian Governance\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Cleveland, OH\n", - " NaN\n", - " 4s-wikipedian-governance\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 17\n", - " 2012-03-31\n", - " Talk\n", - " Improving Wikipedia’s Notifications to Rejecte...\n", - " GCOE International Symposium on Informatics Ed...\n", - " Kyoto University\n", - " Kyoto, Japan\n", - " NaN\n", - " gcoe-wikipedia-notifications\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 18\n", - " 2012-05-02\n", - " Conference proceedings talk\n", - " Black-boxing the user: internet protocol over ...\n", - " Conference on Human Factors in Computing (CHI)\n", - " ACM\n", - " Austin, Texas\n", - " http://stuartgeiger.com/ipoxp.pdf\n", - " altchi-ipoxp\n", - " We introduce IP over Xylophone Players (IPoXP)...\n", - " We introduce IP over Xylophone Players (IPoXP)...\n", - " \n", - " \n", - " 19\n", - " 2012-05-07\n", - " Panelist\n", - " Hunting for Fail Whales: Lessons from Deviance...\n", - " Conference on Human Factors in Computing (CHI)\n", - " ACM\n", - " Austin, Texas\n", - " NaN\n", - " chi-fail-whales\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 20\n", - " 2012-06-05\n", - " Conference proceedings talk\n", - " Defense Mechanism or Socialization Tactic? Imp...\n", - " International Conference on Weblogs and Social...\n", - " AAAI\n", - " Dublin, Ireland\n", - " http://stuartgeiger.com/defense-mechanism-icws...\n", - " icwsm-socialization-wikipedia\n", - " A descriptive study of Wikipedia's highly-auto...\n", - " In this paper, we first illustrate and describ...\n", - " \n", - " \n", - " 21\n", - " 2012-10-12\n", - " Talk\n", - " Trace literacy: a framework for holistically c...\n", - " Infosocial\n", - " Northwestern University\n", - " Evanston, IL\n", - " NaN\n", - " infosocial-trace-literacy\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 22\n", - " 2012-10-17\n", - " Talk\n", - " Time to Degree: Examining the Experiences of G...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Copenhagen, Denmark\n", - " NaN\n", - " 4s-time-to-degree\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 23\n", - " 2012-10-29\n", - " Panelist\n", - " What Aren’t We Measuring? Methods for Quantif...\n", - " International Symposium on Wikis and Open Coll...\n", - " ACM\n", - " Linz, Austria\n", - " NaN\n", - " wikisym-methods\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 24\n", - " 2013-02-07\n", - " Guest lecture\n", - " Actor-Network Theory\n", - " Social Aspects of Information Systems course\n", - " UC-Berkeley School of Information\n", - " Berkeley, CA\n", - " NaN\n", - " 203-actor-network-theory\n", - " An introduction to Actor Network Theory for st...\n", - " NaN\n", - " \n", - " \n", - " 25\n", - " 2013-02-23\n", - " Conference proceedings talk\n", - " Using Edit Sessions to Measure Participation i...\n", - " Conference on Computer Supported Cooperative Work\n", - " ACM\n", - " San Antonio, TX\n", - " http://www.stuartgeiger.com/cscw-sessions.pdf\n", - " cscw-edit-sessions\n", - " This paper establishes a quantitative metric f...\n", - " Many quantitative, log-based studies of partic...\n", - " \n", - " \n", - " 26\n", - " 2013-02-26\n", - " Panelist\n", - " Community, Impact, and Credit: Where Do I Subm...\n", - " ACM Conference on Computer-Supported Cooperati...\n", - " NaN\n", - " San Antonio, TX\n", - " NaN\n", - " cscw-community-impact\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 27\n", - " 2013-03-01\n", - " Talk\n", - " Values Where? Interrogating Client-Side Script...\n", - " Theorizing the Web\n", - " NaN\n", - " New York, NY\n", - " NaN\n", - " ttw-values-where\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 28\n", - " 2013-08-03\n", - " Conference proceedings talk\n", - " When the Levee Breaks: Without Bots, What Happ...\n", - " International Symposium on Wikis and Open Coll...\n", - " ACM\n", - " Hong Kong\n", - " http://stuartgeiger.com/wikisym13-cluebot.pdf\n", - " wikisym-levee-breaks-bots\n", - " This paper examines what happened when one of ...\n", - " In the first half of 2011, ClueBot NG – one of...\n", - " \n", - " \n", - " 29\n", - " 2013-10-09\n", - " Talk\n", - " Hadoop as Grounded Theory: Is an STS Approach ...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " San Diego, CA\n", - " NaN\n", - " ica-hadoop-grounded-theory\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " ...\n", - " \n", - " \n", - " 32\n", - " 2014-04-04\n", - " Panelist\n", - " Robotic Ethics and Opportunities\n", - " Robots and New Media\n", - " Berkeley Center for New Media\n", - " Berkeley, CA\n", - " http://robotsandnewmedia.com/\n", - " robots-new-media\n", - " A panel discussing the ethical and political i...\n", - " NaN\n", - " \n", - " \n", - " 33\n", - " 2014-04-10\n", - " Guest lecture\n", - " Governing the Commons\n", - " History of Information\n", - " UC-Berkeley School of Information\n", - " Berkeley, CA\n", - " NaN\n", - " hofi-governing-commons\n", - " A lecture on the history of Wikipedia, in the ...\n", - " NaN\n", - " \n", - " \n", - " 34\n", - " 2014-04-25\n", - " Talk\n", - " Successor Systems: Enacting Ideological Critiq...\n", - " Theorizing the Web\n", - " NaN\n", - " Brooklyn, New York\n", - " http://theorizingtheweb.tumblr.com/2014/program\n", - " ttw-successor-systems\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 35\n", - " 2014-05-16\n", - " Talk\n", - " Successor Systems: The Role of Reflexive Algor...\n", - " The Contours of Algorithmic Life\n", - " Mellon Research Initiative in Digital Cultures\n", - " Davis, CA\n", - " NaN\n", - " algolife-successor-systems\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 36\n", - " 2014-05-23\n", - " Panelist\n", - " Data­-Driven Data Research Using Data and Data...\n", - " Annual Meeting of the International Communicat...\n", - " NaN\n", - " Seattle, WA\n", - " NaN\n", - " ica-data-driven-data\n", - " This panel focuses on the challenges faced by ...\n", - " In the past two years, the buzzword \"big data\"...\n", - " \n", - " \n", - " 37\n", - " 2014-05-24\n", - " Panelist\n", - " Big Data is Bullshit': Scoping the Next 5 Year...\n", - " Annual Meeting of the International Communicat...\n", - " NaN\n", - " Seattle, WA\n", - " NaN\n", - " ica-big-data-bullshit\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 38\n", - " 2014-08-23\n", - " Talk\n", - " Successor Systems: The Role of Reflexive Algor...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Buenos Aires, Argentina\n", - " NaN\n", - " 4s-successor-systems\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 39\n", - " 2014-10-21\n", - " Talk\n", - " Successor Systems: The Role of Reflexive Algor...\n", - " Annual Meeting of the Association of Internet ...\n", - " NaN\n", - " Daegu, South Korea\n", - " NaN\n", - " aoir-successor-systems\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 40\n", - " 2014-11-02\n", - " Talk\n", - " Defining, Designing, and Evaluating Civic Valu...\n", - " Human Computation Conference (HCOMP), Citizen-...\n", - " NaN\n", - " Pittsburgh, PA\n", - " http://stuartgeiger.com/defining-civic-values-...\n", - " hcomp-values-in-crowdsourcing\n", - " We review various crowdsourcing and collective...\n", - " Collective action is often described in terms ...\n", - " \n", - " \n", - " 41\n", - " 2014-12-09\n", - " Talk\n", - " Supporting Change from Outside Systems with De...\n", - " Berkman Center for Internet and Society\n", - " NaN\n", - " Cambridge, MA\n", - " NaN\n", - " berkman-successor-systems\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 42\n", - " 2015-03-15\n", - " Workshop presentation\n", - " Situated knowledges and successor systems: dev...\n", - " CSCW Workshop on Feminism and Feminist Approac...\n", - " ACM\n", - " Vancouver, BC\n", - " NaN\n", - " cscw-feminism-workshop\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 43\n", - " 2015-03-15\n", - " Workshop presentation\n", - " Does Facebook Have Civil Servants? On Governme...\n", - " CSCW Workshop on Ethics for Studying Sociotech...\n", - " NaN\n", - " Vancouver, BC\n", - " NaN\n", - " cscw-ethics-workshop\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 44\n", - " 2015-03-24\n", - " Workshop presentation\n", - " Trace Ethnography Workshop\n", - " ISchools Conference\n", - " NaN\n", - " Newport Beach, CA\n", - " http://trace-ethnography.github.io\n", - " iconf-trace-ethno\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 45\n", - " 2015-04-07\n", - " Guest lecture\n", - " Moderating Online Conversation Spaces\n", - " Social Aspects of Information Systems course\n", - " UC-Berkeley School of Information\n", - " Berkeley, CA\n", - " NaN\n", - " 203-moderating-online-spaces\n", - " An overview of how various online platforms mo...\n", - " NaN\n", - " \n", - " \n", - " 46\n", - " 2015-04-09\n", - " Guest lecture\n", - " Peer Production and Wikipedia\n", - " Social Aspects of Information Systems course\n", - " UC-Berkeley School of Information\n", - " Berkeley, CA\n", - " NaN\n", - " 203-wikipedia\n", - " An overview of Wikipedia and other peer produc...\n", - " NaN\n", - " \n", - " \n", - " 47\n", - " 2015-05-25\n", - " Talk\n", - " But it Wouldn’t Be an Encyclopedia; It Would B...\n", - " Annual Meeting of the International Communicat...\n", - " NaN\n", - " San Juan, Puerto Rico\n", - " NaN\n", - " ica-wiki-history\n", - " In this talk, I examine the early history of “...\n", - " In this talk, I examine the early history of “...\n", - " \n", - " \n", - " 48\n", - " 2015-10-23\n", - " Talk\n", - " Bot-Based Collective Blocklists in Twitter: Th...\n", - " Annual Meeting of the Association of Internet ...\n", - " NaN\n", - " Phoenix, AZ\n", - " NaN\n", - " aoir-blockbots\n", - " This presentation introduces bot-based collect...\n", - " NaN\n", - " \n", - " \n", - " 49\n", - " 2015-11-06\n", - " Panelist\n", - " Crowdsourcing: Theoretical Considerations\n", - " Crowdsourcing and the Academy Symposium\n", - " UC-Berkeley\n", - " Berkeley, CA\n", - " http://hssa.berkeley.edu/crowdsourcing-symposium\n", - " crowdsourcing-academy\n", - " A panel discussing how academics use crowdsour...\n", - " NaN\n", - " \n", - " \n", - " 50\n", - " 2015-11-12\n", - " Talk\n", - " The Bot Multiple: Unpacking the Materialities ...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Denver, CO\n", - " NaN\n", - " 4s-bot-multiple\n", - " I examine the roles that automated software ag...\n", - " This paper examines the roles that automated s...\n", - " \n", - " \n", - " 51\n", - " 2016-01-16\n", - " Talk\n", - " Why bots are my favorite contribution to Wikip...\n", - " Wikipedia 15th Anniversary Birthday Bash\n", - " Wikimedia Foundation\n", - " San Francisco, CA\n", - " NaN\n", - " wiki15-bots\n", - " A short talk to open up an event celebrating t...\n", - " A short talk to open up an event celebrating t...\n", - " \n", - " \n", - " 52\n", - " 2016-02-17\n", - " Talk\n", - " Scraping Wikipedia Data\n", - " The Hacker Within, BIDS\n", - " Berkeley Institute for Data Science\n", - " Berkeley, CA\n", - " http://www.thehackerwithin.org/berkeley/posts/...\n", - " thw-scraping-wikipedia\n", - " A tutorial (with Jupyter notebooks) about how ...\n", - " A tutorial (with Jupyter notebooks) about how ...\n", - " \n", - " \n", - " 53\n", - " 2016-04-15\n", - " Talk\n", - " “What the hack?” Hacking culture and discourse...\n", - " Theorizing the Web\n", - " NaN\n", - " Astoria, New York\n", - " http://theorizingtheweb.tumblr.com/2016/program\n", - " ttw-what-the-hack\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 54\n", - " 2016-04-16\n", - " Talk\n", - " Moderating harassment in Twitter with blockbot...\n", - " Theorizing the Web\n", - " NaN\n", - " Astoria, New York\n", - " http://theorizingtheweb.tumblr.com/2016/program\n", - " ttw-blockbots\n", - " NaN\n", - " NaN\n", - " \n", - " \n", - " 55\n", - " 2016-06-08\n", - " Talk\n", - " Algorithms as agents of gatekeeping, governanc...\n", - " Algorithms, Automation, and Politics workshop\n", - " International Communication Association\n", - " Fukuoka, Japan\n", - " NaN\n", - " ica-algorithms-gatekeeping\n", - " I discuss how algorithmic systems are deployed...\n", - " This talk is based on a multi-year ethnographi...\n", - " \n", - " \n", - " 56\n", - " 2016-06-09\n", - " Talk\n", - " Successor Systems: Lessons for Big Data From F...\n", - " Big Data: Critiques and Alternatives workshop\n", - " International Communication Association\n", - " Fukuoka, Japan\n", - " NaN\n", - " ica-successor-systems\n", - " I discuss four data-intensive activist project...\n", - " The concept of successor systems extends Hardi...\n", - " \n", - " \n", - " 57\n", - " 2016-06-11\n", - " Panelist\n", - " Drowning in Data: Industry and Academic Approa...\n", - " Annual Meeting of the International Communicat...\n", - " NaN\n", - " Fukuoka, Japan\n", - " NaN\n", - " ica-drowning-in-data\n", - " This panel extends discusses the potentials an...\n", - " In the past five years, as “big data” research...\n", - " \n", - " \n", - " 58\n", - " 2016-06-14\n", - " Talk\n", - " Administrative Support Bots in Wikipedia: How ...\n", - " Communicating with Machines workshop\n", - " International Communication Association\n", - " Fukuoka, Japan\n", - " NaN\n", - " ica-communicating-with-machines\n", - " I discuss cases from a multi-year ethnographic...\n", - " I discuss cases from a multi-year ethnographic...\n", - " \n", - " \n", - " 59\n", - " 2016-07-16\n", - " Talk\n", - " Governing Open Source Projects at Scale: Lesso...\n", - " SciPy\n", - " NaN\n", - " Austin, Texas\n", - " NaN\n", - " scipy-governing-scale\n", - " Many open source, volunteer-driven projects be...\n", - " Many open source, volunteer-driven projects be...\n", - " \n", - " \n", - " 60\n", - " 2016-08-13\n", - " Talk\n", - " Community Sustainability in Wikipedia: A Revie...\n", - " PyData SF\n", - " NumFocus\n", - " San Francisco, CA\n", - " NaN\n", - " pydata-community-sustainability\n", - " Wikipedia relies on one of the world’s largest...\n", - " Wikipedia relies on one of the world’s largest...\n", - " \n", - " \n", - " 61\n", - " 2016-09-02\n", - " Talk\n", - " “The Wisdom of Bots:” An ethnographic study of...\n", - " Annual Meeting of the Society for the Social S...\n", - " NaN\n", - " Barcelona, Spain\n", - " http://www.sts2016bcn.org/sessions/t001-2-mate...\n", - " 4s-wisdom-of-bots\n", - " Wikipedians rely on software agents to govern ...\n", - " I present findings from a multi-year ethnograp...\n", + " This is a description of your conference proce...\n", " \n", " \n", "\n", - "

62 rows × 10 columns

\n", "" ], "text/plain": [ - " date talk_type \\\n", - "0 2008-03-01 Talk \n", - "1 2008-07-19 Talk \n", - "2 2009-03-28 Talk \n", - "3 2009-04-25 Talk \n", - "4 2009-07-26 Talk \n", - "5 2009-09-25 Talk \n", - "6 2009-10-27 Conference proceedings talk \n", - "7 2009-10-28 Talk \n", - "8 2010-01-10 Talk \n", - "9 2010-02-25 Conference proceedings talk \n", - "10 2010-03-26 Talk \n", - "11 2010-07-10 Panelist \n", - "12 2011-01-03 Conference proceedings talk \n", - "13 2011-03-04 Talk \n", - "14 2011-10-05 Conference proceedings talk \n", - "15 2011-11-02 Talk \n", - "16 2011-11-03 Talk \n", - "17 2012-03-31 Talk \n", - "18 2012-05-02 Conference proceedings talk \n", - "19 2012-05-07 Panelist \n", - "20 2012-06-05 Conference proceedings talk \n", - "21 2012-10-12 Talk \n", - "22 2012-10-17 Talk \n", - "23 2012-10-29 Panelist \n", - "24 2013-02-07 Guest lecture \n", - "25 2013-02-23 Conference proceedings talk \n", - "26 2013-02-26 Panelist \n", - "27 2013-03-01 Talk \n", - "28 2013-08-03 Conference proceedings talk \n", - "29 2013-10-09 Talk \n", - ".. ... ... \n", - "32 2014-04-04 Panelist \n", - "33 2014-04-10 Guest lecture \n", - "34 2014-04-25 Talk \n", - "35 2014-05-16 Talk \n", - "36 2014-05-23 Panelist \n", - "37 2014-05-24 Panelist \n", - "38 2014-08-23 Talk \n", - "39 2014-10-21 Talk \n", - "40 2014-11-02 Talk \n", - "41 2014-12-09 Talk \n", - "42 2015-03-15 Workshop presentation \n", - "43 2015-03-15 Workshop presentation \n", - "44 2015-03-24 Workshop presentation \n", - "45 2015-04-07 Guest lecture \n", - "46 2015-04-09 Guest lecture \n", - "47 2015-05-25 Talk \n", - "48 2015-10-23 Talk \n", - "49 2015-11-06 Panelist \n", - "50 2015-11-12 Talk \n", - "51 2016-01-16 Talk \n", - "52 2016-02-17 Talk \n", - "53 2016-04-15 Talk \n", - "54 2016-04-16 Talk \n", - "55 2016-06-08 Talk \n", - "56 2016-06-09 Talk \n", - "57 2016-06-11 Panelist \n", - "58 2016-06-14 Talk \n", - "59 2016-07-16 Talk \n", - "60 2016-08-13 Talk \n", - "61 2016-09-02 Talk \n", - "\n", - " title \\\n", - "0 A Communicative Ethnography of Argumentative S... \n", - "1 Conceptions and Misconceptions Academics Hold ... \n", - "2 Working With/in Wikipedia: Infrastructures of ... \n", - "3 Evolving Governance and Media Use in Wikipedia... \n", - "4 Algorithmic Governance: The Social Roles of Bo... \n", - "5 Trace Ethnography: An ANT Method for the Study... \n", - "6 The Social Roles of Bots and Assisted Editing ... \n", - "7 Where Are the Missing Wikipedians? The Sociolo... \n", - "8 The Wisdom of Bots: A Critique of ‘Self-Organi... \n", - "9 The Work of Sustaining Order in Wikipedia: The... \n", - "10 Bot Politics: How is Automation Changing the W... \n", - "11 Academic Researchers in Wikimedia Communities:... \n", - "12 Trace Ethnography: Following Coordination thro... \n", - "13 Machine-Generated Content: Bots and the Govern... \n", - "14 Participation in Wikipedia’s Article Deletion ... \n", - "15 ’The Internet is Here’: The Virtuality of ‘On-... \n", - "16 User-Generated Platforms in Wikipedian Governance \n", - "17 Improving Wikipedia’s Notifications to Rejecte... \n", - "18 Black-boxing the user: internet protocol over ... \n", - "19 Hunting for Fail Whales: Lessons from Deviance... \n", - "20 Defense Mechanism or Socialization Tactic? Imp... \n", - "21 Trace literacy: a framework for holistically c... \n", - "22 Time to Degree: Examining the Experiences of G... \n", - "23 What Aren’t We Measuring? Methods for Quantif... \n", - "24 Actor-Network Theory \n", - "25 Using Edit Sessions to Measure Participation i... \n", - "26 Community, Impact, and Credit: Where Do I Subm... \n", - "27 Values Where? Interrogating Client-Side Script... \n", - "28 When the Levee Breaks: Without Bots, What Happ... \n", - "29 Hadoop as Grounded Theory: Is an STS Approach ... \n", - ".. ... \n", - "32 Robotic Ethics and Opportunities \n", - "33 Governing the Commons \n", - "34 Successor Systems: Enacting Ideological Critiq... \n", - "35 Successor Systems: The Role of Reflexive Algor... \n", - "36 Data­-Driven Data Research Using Data and Data... \n", - "37 Big Data is Bullshit': Scoping the Next 5 Year... \n", - "38 Successor Systems: The Role of Reflexive Algor... \n", - "39 Successor Systems: The Role of Reflexive Algor... \n", - "40 Defining, Designing, and Evaluating Civic Valu... \n", - "41 Supporting Change from Outside Systems with De... \n", - "42 Situated knowledges and successor systems: dev... \n", - "43 Does Facebook Have Civil Servants? On Governme... \n", - "44 Trace Ethnography Workshop \n", - "45 Moderating Online Conversation Spaces \n", - "46 Peer Production and Wikipedia \n", - "47 But it Wouldn’t Be an Encyclopedia; It Would B... \n", - "48 Bot-Based Collective Blocklists in Twitter: Th... \n", - "49 Crowdsourcing: Theoretical Considerations \n", - "50 The Bot Multiple: Unpacking the Materialities ... \n", - "51 Why bots are my favorite contribution to Wikip... \n", - "52 Scraping Wikipedia Data \n", - "53 “What the hack?” Hacking culture and discourse... \n", - "54 Moderating harassment in Twitter with blockbot... \n", - "55 Algorithms as agents of gatekeeping, governanc... \n", - "56 Successor Systems: Lessons for Big Data From F... \n", - "57 Drowning in Data: Industry and Academic Approa... \n", - "58 Administrative Support Bots in Wikipedia: How ... \n", - "59 Governing Open Source Projects at Scale: Lesso... \n", - "60 Community Sustainability in Wikipedia: A Revie... \n", - "61 “The Wisdom of Bots:” An ethnographic study of... \n", - "\n", - " venue \\\n", - "0 Exploring New Media Worlds \n", - "1 Annual Wikimedia Conference (Wikimania) \n", - "2 Annual Conference on Science and Technology in... \n", - "3 Media in Transition 6 \n", - "4 First Annual Wikiconference NYC \n", - "5 the Second Annual Media Sociology Forum \n", - "6 International Symposium on Wikis and Open Coll... \n", - "7 Annual Meeting of the Society for the Social S... \n", - "8 Critical Point of View: Wikipedia and the Poli... \n", - "9 Conference on Computer Supported Cooperative Work \n", - "10 Critical Point of View: Wikipedia and the Poli... \n", - "11 Wikimania 2010 \n", - "12 Hawaii International Conference on System Scie... \n", - "13 Digital Media and Learning (DML) \n", - "14 International Symposium on Wikis and Open Coll... \n", - "15 Annual Meeting of the Society for the Social S... \n", - "16 Annual Meeting of the Society for the Social S... \n", - "17 GCOE International Symposium on Informatics Ed... \n", - "18 Conference on Human Factors in Computing (CHI) \n", - "19 Conference on Human Factors in Computing (CHI) \n", - "20 International Conference on Weblogs and Social... \n", - "21 Infosocial \n", - "22 Annual Meeting of the Society for the Social S... \n", - "23 International Symposium on Wikis and Open Coll... \n", - "24 Social Aspects of Information Systems course \n", - "25 Conference on Computer Supported Cooperative Work \n", - "26 ACM Conference on Computer-Supported Cooperati... \n", - "27 Theorizing the Web \n", - "28 International Symposium on Wikis and Open Coll... \n", - "29 Annual Meeting of the Society for the Social S... \n", - ".. ... \n", - "32 Robots and New Media \n", - "33 History of Information \n", - "34 Theorizing the Web \n", - "35 The Contours of Algorithmic Life \n", - "36 Annual Meeting of the International Communicat... \n", - "37 Annual Meeting of the International Communicat... \n", - "38 Annual Meeting of the Society for the Social S... \n", - "39 Annual Meeting of the Association of Internet ... \n", - "40 Human Computation Conference (HCOMP), Citizen-... \n", - "41 Berkman Center for Internet and Society \n", - "42 CSCW Workshop on Feminism and Feminist Approac... \n", - "43 CSCW Workshop on Ethics for Studying Sociotech... \n", - "44 ISchools Conference \n", - "45 Social Aspects of Information Systems course \n", - "46 Social Aspects of Information Systems course \n", - "47 Annual Meeting of the International Communicat... \n", - "48 Annual Meeting of the Association of Internet ... \n", - "49 Crowdsourcing and the Academy Symposium \n", - "50 Annual Meeting of the Society for the Social S... \n", - "51 Wikipedia 15th Anniversary Birthday Bash \n", - "52 The Hacker Within, BIDS \n", - "53 Theorizing the Web \n", - "54 Theorizing the Web \n", - "55 Algorithms, Automation, and Politics workshop \n", - "56 Big Data: Critiques and Alternatives workshop \n", - "57 Annual Meeting of the International Communicat... \n", - "58 Communicating with Machines workshop \n", - "59 SciPy \n", - "60 PyData SF \n", - "61 Annual Meeting of the Society for the Social S... \n", - "\n", - " institution \\\n", - "0 Texas A&M University \n", - "1 Wikimedia Foundation \n", - "2 AAAS \n", - "3 MIT \n", - "4 Wikimedia Foundation \n", - "5 New York University \n", - "6 ACM \n", - "7 NaN \n", - "8 Centre for Internet and Society \n", - "9 ACM \n", - "10 Institute for Network Cultures \n", - "11 Wikimedia Foundation \n", - "12 IEEE \n", - "13 NaN \n", - "14 ACM \n", - "15 NaN \n", - "16 NaN \n", - "17 Kyoto University \n", - "18 ACM \n", - "19 ACM \n", - "20 AAAI \n", - "21 Northwestern University \n", - "22 NaN \n", - "23 ACM \n", - "24 UC-Berkeley School of Information \n", - "25 ACM \n", - "26 NaN \n", - "27 NaN \n", - "28 ACM \n", - "29 NaN \n", - ".. ... \n", - "32 Berkeley Center for New Media \n", - "33 UC-Berkeley School of Information \n", - "34 NaN \n", - "35 Mellon Research Initiative in Digital Cultures \n", - "36 NaN \n", - "37 NaN \n", - "38 NaN \n", - "39 NaN \n", - "40 NaN \n", - "41 NaN \n", - "42 ACM \n", - "43 NaN \n", - "44 NaN \n", - "45 UC-Berkeley School of Information \n", - "46 UC-Berkeley School of Information \n", - "47 NaN \n", - "48 NaN \n", - "49 UC-Berkeley \n", - "50 NaN \n", - "51 Wikimedia Foundation \n", - "52 Berkeley Institute for Data Science \n", - "53 NaN \n", - "54 NaN \n", - "55 International Communication Association \n", - "56 International Communication Association \n", - "57 NaN \n", - "58 International Communication Association \n", - "59 NaN \n", - "60 NumFocus \n", - "61 NaN \n", - "\n", - " geoloc \\\n", - "0 College Station, TX \n", - "1 Alexandria, Egypt \n", - "2 Wasthington, Dc \n", - "3 Cambridge, MA \n", - "4 New York, NY \n", - "5 New York, NY \n", - "6 Orlando, Florida \n", - "7 Arlington, Virginia \n", - "8 Bangalore, India \n", - "9 Savannah, Georgia \n", - "10 Amsterdam, the Netherlands \n", - "11 Gdansk, Poland \n", - "12 Lihue, Hawaii \n", - "13 Long Beach, CA \n", - "14 Mountain View, CA \n", - "15 Cleveland, OH \n", - "16 Cleveland, OH \n", - "17 Kyoto, Japan \n", - "18 Austin, Texas \n", - "19 Austin, Texas \n", - "20 Dublin, Ireland \n", - "21 Evanston, IL \n", - "22 Copenhagen, Denmark \n", - "23 Linz, Austria \n", - "24 Berkeley, CA \n", - "25 San Antonio, TX \n", - "26 San Antonio, TX \n", - "27 New York, NY \n", - "28 Hong Kong \n", - "29 San Diego, CA \n", - ".. ... \n", - "32 Berkeley, CA \n", - "33 Berkeley, CA \n", - "34 Brooklyn, New York \n", - "35 Davis, CA \n", - "36 Seattle, WA \n", - "37 Seattle, WA \n", - "38 Buenos Aires, Argentina \n", - "39 Daegu, South Korea \n", - "40 Pittsburgh, PA \n", - "41 Cambridge, MA \n", - "42 Vancouver, BC \n", - "43 Vancouver, BC \n", - "44 Newport Beach, CA \n", - "45 Berkeley, CA \n", - "46 Berkeley, CA \n", - "47 San Juan, Puerto Rico \n", - "48 Phoenix, AZ \n", - "49 Berkeley, CA \n", - "50 Denver, CO \n", - "51 San Francisco, CA \n", - "52 Berkeley, CA \n", - "53 Astoria, New York \n", - "54 Astoria, New York \n", - "55 Fukuoka, Japan \n", - "56 Fukuoka, Japan \n", - "57 Fukuoka, Japan \n", - "58 Fukuoka, Japan \n", - "59 Austin, Texas \n", - "60 San Francisco, CA \n", - "61 Barcelona, Spain \n", - "\n", - " url \\\n", - "0 NaN \n", - "1 NaN \n", - "2 NaN \n", - "3 NaN \n", - "4 NaN \n", - "5 NaN \n", - "6 http://www.stuartgeiger.com/papers/geiger-wiki... \n", - "7 NaN \n", - "8 NaN \n", - "9 http://www.stuartgeiger.com/papers/cscw-sustai... \n", - "10 NaN \n", - "11 https://wikimania2010.wikimedia.org/wiki/Submi... \n", - "12 http://www.stuartgeiger.com/trace-ethnography-... \n", - "13 NaN \n", - "14 http://www.stuartgeiger.com/papers/article-del... \n", - "15 NaN \n", - "16 NaN \n", - "17 NaN \n", - "18 http://stuartgeiger.com/ipoxp.pdf \n", - "19 NaN \n", - "20 http://stuartgeiger.com/defense-mechanism-icws... \n", - "21 NaN \n", - "22 NaN \n", - "23 NaN \n", - "24 NaN \n", - "25 http://www.stuartgeiger.com/cscw-sessions.pdf \n", - "26 NaN \n", - "27 NaN \n", - "28 http://stuartgeiger.com/wikisym13-cluebot.pdf \n", - "29 NaN \n", - ".. ... \n", - "32 http://robotsandnewmedia.com/ \n", - "33 NaN \n", - "34 http://theorizingtheweb.tumblr.com/2014/program \n", - "35 NaN \n", - "36 NaN \n", - "37 NaN \n", - "38 NaN \n", - "39 NaN \n", - "40 http://stuartgeiger.com/defining-civic-values-... \n", - "41 NaN \n", - "42 NaN \n", - "43 NaN \n", - "44 http://trace-ethnography.github.io \n", - "45 NaN \n", - "46 NaN \n", - "47 NaN \n", - "48 NaN \n", - "49 http://hssa.berkeley.edu/crowdsourcing-symposium \n", - "50 NaN \n", - "51 NaN \n", - "52 http://www.thehackerwithin.org/berkeley/posts/... \n", - "53 http://theorizingtheweb.tumblr.com/2016/program \n", - "54 http://theorizingtheweb.tumblr.com/2016/program \n", - "55 NaN \n", - "56 NaN \n", - "57 NaN \n", - "58 NaN \n", - "59 NaN \n", - "60 NaN \n", - "61 http://www.sts2016bcn.org/sessions/t001-2-mate... \n", + " title \\\n", + "0 Talk 1 on Relevant Topic in Your Field \n", + "1 Tutorial 1 on Relevant Topic in Your Field \n", + "2 Talk 2 on Relevant Topic in Your Field \n", + "3 Conference Proceeding talk 3 on Relevant Topic... \n", "\n", - " slug \\\n", - "0 tamu-communicative-ethnography \n", - "1 wikimania-academics-wikipedia \n", - "2 aaas-wikipedia-infrastructures \n", - "3 mit6-media-use-wikipedia \n", - "4 wikiconf-nyc-algorithmc-governance \n", - "5 nyu-trace-ethnography \n", - "6 wikisym-social-roles-bots \n", - "7 4s-missing-wikipedians \n", - "8 cpov-wisdom-of-bots \n", - "9 cscw-banning-vandal \n", - "10 cpov-bot-politics \n", - "11 wikimania-academic-researchers \n", - "12 hicss-trace-ethnography \n", - "13 dml-bots-governance \n", - "14 wikisym-article-deletion \n", - "15 4s-internet-is-here \n", - "16 4s-wikipedian-governance \n", - "17 gcoe-wikipedia-notifications \n", - "18 altchi-ipoxp \n", - "19 chi-fail-whales \n", - "20 icwsm-socialization-wikipedia \n", - "21 infosocial-trace-literacy \n", - "22 4s-time-to-degree \n", - "23 wikisym-methods \n", - "24 203-actor-network-theory \n", - "25 cscw-edit-sessions \n", - "26 cscw-community-impact \n", - "27 ttw-values-where \n", - "28 wikisym-levee-breaks-bots \n", - "29 ica-hadoop-grounded-theory \n", - ".. ... \n", - "32 robots-new-media \n", - "33 hofi-governing-commons \n", - "34 ttw-successor-systems \n", - "35 algolife-successor-systems \n", - "36 ica-data-driven-data \n", - "37 ica-big-data-bullshit \n", - "38 4s-successor-systems \n", - "39 aoir-successor-systems \n", - "40 hcomp-values-in-crowdsourcing \n", - "41 berkman-successor-systems \n", - "42 cscw-feminism-workshop \n", - "43 cscw-ethics-workshop \n", - "44 iconf-trace-ethno \n", - "45 203-moderating-online-spaces \n", - "46 203-wikipedia \n", - "47 ica-wiki-history \n", - "48 aoir-blockbots \n", - "49 crowdsourcing-academy \n", - "50 4s-bot-multiple \n", - "51 wiki15-bots \n", - "52 thw-scraping-wikipedia \n", - "53 ttw-what-the-hack \n", - "54 ttw-blockbots \n", - "55 ica-algorithms-gatekeeping \n", - "56 ica-successor-systems \n", - "57 ica-drowning-in-data \n", - "58 ica-communicating-with-machines \n", - "59 scipy-governing-scale \n", - "60 pydata-community-sustainability \n", - "61 4s-wisdom-of-bots \n", + " type url_slug \\\n", + "0 Talk talk-1 \n", + "1 Tutorial tutorial-1 \n", + "2 Talk talk-2 \n", + "3 Conference proceedings talk talk-3 \n", "\n", - " summary \\\n", - "0 NaN \n", - "1 NaN \n", - "2 NaN \n", - "3 NaN \n", - "4 NaN \n", - "5 NaN \n", - "6 A short paper showing the recent explosive gro... \n", - "7 NaN \n", - "8 NaN \n", - "9 This paper traces out a heterogeneous network ... \n", - "10 NaN \n", - "11 A panel intended to foster a dialog between ac... \n", - "12 We detail the methodology of ‘trace ethnograph... \n", - "13 NaN \n", - "14 This paper investigates Wikipedia's article de... \n", - "15 NaN \n", - "16 NaN \n", - "17 NaN \n", - "18 We introduce IP over Xylophone Players (IPoXP)... \n", - "19 NaN \n", - "20 A descriptive study of Wikipedia's highly-auto... \n", - "21 NaN \n", - "22 NaN \n", - "23 NaN \n", - "24 An introduction to Actor Network Theory for st... \n", - "25 This paper establishes a quantitative metric f... \n", - "26 NaN \n", - "27 NaN \n", - "28 This paper examines what happened when one of ... \n", - "29 NaN \n", - ".. ... \n", - "32 A panel discussing the ethical and political i... \n", - "33 A lecture on the history of Wikipedia, in the ... \n", - "34 NaN \n", - "35 NaN \n", - "36 This panel focuses on the challenges faced by ... \n", - "37 NaN \n", - "38 NaN \n", - "39 NaN \n", - "40 We review various crowdsourcing and collective... \n", - "41 NaN \n", - "42 NaN \n", - "43 NaN \n", - "44 NaN \n", - "45 An overview of how various online platforms mo... \n", - "46 An overview of Wikipedia and other peer produc... \n", - "47 In this talk, I examine the early history of “... \n", - "48 This presentation introduces bot-based collect... \n", - "49 A panel discussing how academics use crowdsour... \n", - "50 I examine the roles that automated software ag... \n", - "51 A short talk to open up an event celebrating t... \n", - "52 A tutorial (with Jupyter notebooks) about how ... \n", - "53 NaN \n", - "54 NaN \n", - "55 I discuss how algorithmic systems are deployed... \n", - "56 I discuss four data-intensive activist project... \n", - "57 This panel extends discusses the potentials an... \n", - "58 I discuss cases from a multi-year ethnographic... \n", - "59 Many open source, volunteer-driven projects be... \n", - "60 Wikipedia relies on one of the world’s largest... \n", - "61 Wikipedians rely on software agents to govern ... \n", + " venue date \\\n", + "0 UC San Francisco, Department of Testing 2012-03-01 \n", + "1 UC-Berkeley Institute for Testing Science 2013-03-01 \n", + "2 London School of Testing 2014-02-01 \n", + "3 Testing Institute of America 2014 Annual Confe... 2014-03-01 \n", "\n", - " description \n", - "0 NaN \n", - "1 NaN \n", - "2 NaN \n", - "3 NaN \n", - "4 NaN \n", - "5 NaN \n", - "6 A short paper showing the recent explosive gro... \n", - "7 NaN \n", - "8 NaN \n", - "9 This paper traces out a heterogeneous network ... \n", - "10 NaN \n", - "11 In the past few years, there has been an explo... \n", - "12 We detail the methodology of ‘trace ethnograph... \n", - "13 NaN \n", - "14 We find that Wikipedia’s deletion process is h... \n", - "15 NaN \n", - "16 NaN \n", - "17 NaN \n", - "18 We introduce IP over Xylophone Players (IPoXP)... \n", - "19 NaN \n", - "20 In this paper, we first illustrate and describ... \n", - "21 NaN \n", - "22 NaN \n", - "23 NaN \n", - "24 NaN \n", - "25 Many quantitative, log-based studies of partic... \n", - "26 NaN \n", - "27 NaN \n", - "28 In the first half of 2011, ClueBot NG – one of... \n", - "29 NaN \n", - ".. ... \n", - "32 NaN \n", - "33 NaN \n", - "34 NaN \n", - "35 NaN \n", - "36 In the past two years, the buzzword \"big data\"... \n", - "37 NaN \n", - "38 NaN \n", - "39 NaN \n", - "40 Collective action is often described in terms ... \n", - "41 NaN \n", - "42 NaN \n", - "43 NaN \n", - "44 NaN \n", - "45 NaN \n", - "46 NaN \n", - "47 In this talk, I examine the early history of “... \n", - "48 NaN \n", - "49 NaN \n", - "50 This paper examines the roles that automated s... \n", - "51 A short talk to open up an event celebrating t... \n", - "52 A tutorial (with Jupyter notebooks) about how ... \n", - "53 NaN \n", - "54 NaN \n", - "55 This talk is based on a multi-year ethnographi... \n", - "56 The concept of successor systems extends Hardi... \n", - "57 In the past five years, as “big data” research... \n", - "58 I discuss cases from a multi-year ethnographic... \n", - "59 Many open source, volunteer-driven projects be... \n", - "60 Wikipedia relies on one of the world’s largest... \n", - "61 I present findings from a multi-year ethnograp... \n", + " location talk_url \\\n", + "0 San Francisco, California NaN \n", + "1 Berkeley CA, USA http://exampleurl.com \n", + "2 London, UK http://example2.com \n", + "3 Los Angeles, CA NaN \n", "\n", - "[62 rows x 10 columns]" + " description \n", + "0 This is a description of your talk, which is a... \n", + "1 This is a description of your tutorial, note t... \n", + "2 This is a description of your talk, which is a... \n", + "3 This is a description of your conference proce... " ] }, - "execution_count": 2, + "execution_count": 3, "metadata": {}, "output_type": "execute_result" } @@ -1446,20 +197,29 @@ "talks" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Escape special characters\n", + "\n", + "YAML is very picky about how it takes a valid string, so we are replacing single and double quotes (and ampersands) with their HTML encoded equivilents. This makes them look not so readable in raw format, but they are parsed and rendered nicely." + ] + }, { "cell_type": "code", - "execution_count": 3, + "execution_count": 4, "metadata": { - "collapsed": false + "collapsed": false, + "deletable": true, + "editable": true }, "outputs": [], "source": [ "html_escape_table = {\n", " \"&\": \"&\",\n", " '\"': \""\",\n", - " \"'\": \"'\",\n", - " \">\": \">\",\n", - " \"<\": \"<\",\n", + " \"'\": \"'\"\n", " }\n", "\n", "def html_escape(text):\n", @@ -1470,904 +230,105 @@ ] }, { - "cell_type": "code", - "execution_count": 4, - "metadata": { - "collapsed": false - }, - "outputs": [], + "cell_type": "markdown", + "metadata": {}, "source": [ - "!mkdir _talks" + "## Creating the markdown files\n", + "\n", + "This is where the heavy lifting is done. This loops through all the rows in the TSV dataframe, then starts to concatentate a big string (```md```) that contains the markdown for each type. It does the YAML metadata first, then does the description for the individual page." ] }, { "cell_type": "code", "execution_count": 5, "metadata": { - "collapsed": false + "collapsed": false, + "deletable": true, + "editable": true }, - "outputs": [ - { - "name": "stdout", - "output_type": "stream", - "text": [ - "---\n", - "title: \"A Communicative Ethnography of Argumentative Strategies in a Wikipedian Content Dispute\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2008-03-01-tamu-communicative-ethnography\n", - "venue: \"Exploring New Media Worlds\"\n", - "date: 2008-03-01\n", - "location: \"College Station, TX\"\n", - "---\n", - "\n", - "---\n", - "title: \"Conceptions and Misconceptions Academics Hold About Wikipedia\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2008-07-19-wikimania-academics-wikipedia\n", - "venue: \"Annual Wikimedia Conference (Wikimania)\"\n", - "date: 2008-07-19\n", - "location: \"Alexandria, Egypt\"\n", - "---\n", - "\n", - "---\n", - "title: \"Working With/in Wikipedia: Infrastructures of Knowing and Knowledge Production\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2009-03-28-aaas-wikipedia-infrastructures\n", - "venue: \"Annual Conference on Science and Technology in Society\"\n", - "date: 2009-03-28\n", - "location: \"Wasthington, Dc\"\n", - "---\n", - "\n", - "---\n", - "title: \"Evolving Governance and Media Use in Wikipedia: A Historical Account\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2009-04-25-mit6-media-use-wikipedia\n", - "venue: \"Media in Transition 6\"\n", - "date: 2009-04-25\n", - "location: \"Cambridge, MA\"\n", - "---\n", - "\n", - "---\n", - "title: \"Algorithmic Governance: The Social Roles of Bots and Assisted Editing Tools\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2009-07-26-wikiconf-nyc-algorithmc-governance\n", - "venue: \"First Annual Wikiconference NYC\"\n", - "date: 2009-07-26\n", - "location: \"New York, NY\"\n", - "---\n", - "\n", - "---\n", - "title: \"Trace Ethnography: An ANT Method for the Study of Sociotechnical Networks\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2009-09-25-nyu-trace-ethnography\n", - "venue: \" the Second Annual Media Sociology Forum\"\n", - "date: 2009-09-25\n", - "location: \"New York, NY\"\n", - "---\n", - "\n", - "---\n", - "title: \"The Social Roles of Bots and Assisted Editing Tools\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2009-10-27-wikisym-social-roles-bots\n", - "venue: \"International Symposium on Wikis and Open Collaboration\"\n", - "date: 2009-10-27\n", - "location: \"Orlando, Florida\"\n", - "excerpt: \"A short paper showing the recent explosive growth of automated editors (or bots) in Wikipedia, which have taken on many new tasks in administrative spaces.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "A short paper showing the recent explosive growth of automated editors (or bots) in Wikipedia, which have taken on many new tasks in administrative spaces.\n", - "\n", - "---\n", - "title: \"Where Are the Missing Wikipedians? The Sociology of a Bot\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2009-10-28-4s-missing-wikipedians\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2009-10-28\n", - "location: \"Arlington, Virginia\"\n", - "---\n", - "\n", - "---\n", - "title: \"The Wisdom of Bots: A Critique of ‘Self-Organization’ in Wikipedia\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2010-01-10-cpov-wisdom-of-bots\n", - "venue: \"Critical Point of View: Wikipedia and the Politics of Open Knowledge\"\n", - "date: 2010-01-10\n", - "location: \"Bangalore, India\"\n", - "---\n", - "\n", - "---\n", - "title: \"The Work of Sustaining Order in Wikipedia: The Banning of a Vandal\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2010-02-25-cscw-banning-vandal\n", - "venue: \"Conference on Computer Supported Cooperative Work\"\n", - "date: 2010-02-25\n", - "location: \"Savannah, Georgia\"\n", - "excerpt: \"This paper traces out a heterogeneous network of humans and non-humans involved in the identification and banning of a single vandal in Wikipedia.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "This paper traces out a heterogeneous network of humans and non-humans involved in the identification and banning of a single vandal in Wikipedia.\n", - "\n", - "---\n", - "title: \"Bot Politics: How is Automation Changing the Wikipedian Society? Critical Point of View II\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2010-03-26-cpov-bot-politics\n", - "venue: \"Critical Point of View: Wikipedia and the Politics of Open Knowledge\"\n", - "date: 2010-03-26\n", - "location: \"Amsterdam, the Netherlands\"\n", - "---\n", - "\n", - "---\n", - "title: \"Academic Researchers in Wikimedia Communities: Ethics, Methods, and Policies\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2010-07-10-wikimania-academic-researchers\n", - "venue: \"Wikimania 2010\"\n", - "date: 2010-07-10\n", - "location: \"Gdansk, Poland\"\n", - "excerpt: \"A panel intended to foster a dialog between academic researchers who study Wikimedia projects and the Wikimedia community.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "In the past few years, there has been an explosion of academic researchers who are turning their attention towards Wikimedia projects. Hundreds of scholars representing virtually every discipline in the humanities, the social sciences, and the computational sciences have entered various Wikimedia communities to answer a broad range of questions using an equally diverse set of methods. As both researchers and editors have learned, actively studying wiki communities raises a number of problems and concerns, from the practical and logistical to the ethical and legal. Despite years of informal dialog and negotiation between individual researchers and editors, multiple WikiProjects dedicated to wiki research, and an ongoing attempt to craft a formal research policy, many issues are still unresolved.\\n\\nThe purpose of this panel is to foster a dialog between wiki researchers and the Wikimedia community on many of these issues, which include: Are talk pages, listservs, and IRC channels public spaces, and should researchers have to identify themselves when they enter? What kinds of privacy issues exist with reporting an editor’s actions in an academic article? Under what conditions should social scientists be allowed to send unsolicited requests to random samples of users, a process which has been sometimes been considered spam and resulted in the banning of researchers’ accounts? Are controversial 'breaching experiments' (such as vandalism response tests) of value, and is there a way they can be performed in a satisfactory manner? Should there be a group or committee that would review and approve certain kinds of academic research activities, similar to the Bot Approval Group? These are some of the many questions that will be asked, and audience members will be encouraged to participate in this discussion.\n", - "\n", - "---\n", - "title: \"Trace Ethnography: Following Coordination through Documentary Practices\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2011-01-03-hicss-trace-ethnography\n", - "venue: \"Hawaii International Conference on System Sciences\"\n", - "date: 2011-01-03\n", - "location: \"Lihue, Hawaii\"\n", - "excerpt: \"We detail the methodology of ‘trace ethnography’, which combines the richness of participant-observation with the wealth of data in logs so as to reconstruct patterns and practices of users in distributed sociotechnical systems\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "We detail the methodology of ‘trace ethnography’, which combines the richness of participant-observation with the wealth of data in logs so as to reconstruct patterns and practices of users in distributed sociotechnical systems\n", - "\n", - "---\n", - "title: \"Machine-Generated Content: Bots and the Governance of Wikipedia\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2011-03-04-dml-bots-governance\n", - "venue: \"Digital Media and Learning (DML)\"\n", - "date: 2011-03-04\n", - "location: \"Long Beach, CA\"\n", - "---\n", - "\n", - "---\n", - "title: \"Participation in Wikipedia’s Article Deletion Processes (with Heather Ford)\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2011-10-05-wikisym-article-deletion\n", - "venue: \"International Symposium on Wikis and Open Collaboration\"\n", - "date: 2011-10-05\n", - "location: \"Mountain View, CA\"\n", - "excerpt: \"This paper investigates Wikipedia's article deletion processes, finding that it is heavily populated by specialists.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "We find that Wikipedia’s deletion process is heavily frequented by a relatively small number of longstanding users. The vast majority of such deleted articles are not spam, vandalism, or “patent nonsense,” but rather articles which could be considered encyclopedic, but do not fit the project‟s standards.\n", - "\n", - "---\n", - "title: \"’The Internet is Here’: The Virtuality of ‘On-line Communities in Physical Spaces\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2011-11-02-4s-internet-is-here\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2011-11-02\n", - "location: \"Cleveland, OH\"\n", - "---\n", - "\n", - "---\n", - "title: \"User-Generated Platforms in Wikipedian Governance\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2011-11-03-4s-wikipedian-governance\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2011-11-03\n", - "location: \"Cleveland, OH\"\n", - "---\n", - "\n", - "---\n", - "title: \"Improving Wikipedia’s Notifications to Rejected Contributors\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2012-03-31-gcoe-wikipedia-notifications\n", - "venue: \"GCOE International Symposium on Informatics Education\"\n", - "date: 2012-03-31\n", - "location: \"Kyoto, Japan\"\n", - "---\n", - "\n", - "---\n", - "title: \"Black-boxing the user: internet protocol over xylophone players (IPoXP)\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2012-05-02-altchi-ipoxp\n", - "venue: \"Conference on Human Factors in Computing (CHI)\"\n", - "date: 2012-05-02\n", - "location: \"Austin, Texas\"\n", - "excerpt: \"We introduce IP over Xylophone Players (IPoXP), a novel Internet protocol between two computers using xylophone-based Arduino interfaces\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "We introduce IP over Xylophone Players (IPoXP), a novel Internet protocol between two computers using xylophone-based Arduino interfaces. In our implementation, human operators are situated within the lowest layer of the network, transmitting data between computers by striking designated keys. We discuss how IPoXP inverts the traditional mode of human-computer interaction, with a computer using the human as an interface to communicate with another computer.\n", - "\n", - "---\n", - "title: \"Hunting for Fail Whales: Lessons from Deviance and Failure in Social Computing\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2012-05-07-chi-fail-whales\n", - "venue: \"Conference on Human Factors in Computing (CHI)\"\n", - "date: 2012-05-07\n", - "location: \"Austin, Texas\"\n", - "---\n", - "\n", - "---\n", - "title: \"Defense Mechanism or Socialization Tactic? Improving Wikipedia’s Notifications to Rejected Contributors\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2012-06-05-icwsm-socialization-wikipedia\n", - "venue: \"International Conference on Weblogs and Social Media (ICWSM)\"\n", - "date: 2012-06-05\n", - "location: \"Dublin, Ireland\"\n", - "excerpt: \"A descriptive study of Wikipedia's highly-automated socialization processes and an A/B test to improve templated messages to newcomers.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "In this paper, we first illustrate and describe the various defense mechanisms at work in Wikipedia, which we hypothesize are inhibiting newcomer retention. Next, we present results from an experiment aimed at increasing both the quantity and quality of editors by altering various elements of these defense mechanisms, specifically pre-scripted warnings and notifications that are sent to new editors upon reverting or rejecting contributions. Using regression models of new user activity, we show which tactics work best for different populations of users based on their motivations when joining Wikipedia.\n", - "\n", - "---\n", - "title: \"Trace literacy: a framework for holistically conceptualizing newcomer socialization in socio-technical systems\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2012-10-12-infosocial-trace-literacy\n", - "venue: \"Infosocial\"\n", - "date: 2012-10-12\n", - "location: \"Evanston, IL\"\n", - "---\n", - "\n", - "---\n", - "title: \"Time to Degree: Examining the Experiences of Graduate Students in the Long-Term Ecological Research Network\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2012-10-17-4s-time-to-degree\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2012-10-17\n", - "location: \"Copenhagen, Denmark\"\n", - "---\n", - "\n", - "---\n", - "title: \"What Aren’t We Measuring? Methods for Quantifying Wiki-Work.\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2012-10-29-wikisym-methods\n", - "venue: \"International Symposium on Wikis and Open Collaboration (WikiSym 2012)\"\n", - "date: 2012-10-29\n", - "location: \"Linz, Austria\"\n", - "---\n", - "\n", - "---\n", - "title: \"Actor-Network Theory\"\n", - "collection: talks\n", - "type: \"Guest lecture\"\n", - "permalink: /talks/2013-02-07-203-actor-network-theory\n", - "venue: \"Social Aspects of Information Systems course\"\n", - "date: 2013-02-07\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"An introduction to Actor Network Theory for students in the Masters of Information Management and Systems (MIMS) course\"\n", - "---\n", - "\n", - "---\n", - "title: \"Using Edit Sessions to Measure Participation in Wikipedia (with Aaron Halfaker)\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2013-02-23-cscw-edit-sessions\n", - "venue: \"Conference on Computer Supported Cooperative Work\"\n", - "date: 2013-02-23\n", - "location: \"San Antonio, TX\"\n", - "excerpt: \"This paper establishes a quantitative metric for measuring editor activity through temporal edit sessions.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "Many quantitative, log-based studies of participation and contribution in CSCW and CMC systems measure the activity of users in terms of output, based on metrics like posts to forums, edits to Wikipedia articles, or commits to code repositories. In this paper, we estimate the amount of time users have spent contributing. Through an analysis of Wikipedia log data, we identify a pattern of punctuated bursts in editors’ activity that we refer to as edit sessions. Based on these edit sessions, we build a metric that approximates the labor hours of editors in the encyclopedia. Using this metric, we first compare labor-based analyses with output-based analyses, finding that the activity of many editors can appear quite differently based on the kind of metric used.\n", - "\n", - "---\n", - "title: \"Community, Impact, and Credit: Where Do I Submit My Papers?\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2013-02-26-cscw-community-impact\n", - "venue: \"ACM Conference on Computer-Supported Cooperative Work (CSCW)\"\n", - "date: 2013-02-26\n", - "location: \"San Antonio, TX\"\n", - "---\n", - "\n", - "---\n", - "title: \"Values Where? Interrogating Client-Side Scripting as a Design Process\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2013-03-01-ttw-values-where\n", - "venue: \"Theorizing the Web\"\n", - "date: 2013-03-01\n", - "location: \"New York, NY\"\n", - "---\n", - "\n", - "---\n", - "title: \"When the Levee Breaks: Without Bots, What Happens to Wikipedia’s Quality Control Processes? (with Aaron Halfaker)\"\n", - "collection: talks\n", - "type: \"Conference proceedings talk\"\n", - "permalink: /talks/2013-08-03-wikisym-levee-breaks-bots\n", - "venue: \"International Symposium on Wikis and Open Collaboration (WikiSym 2012)\"\n", - "date: 2013-08-03\n", - "location: \"Hong Kong\"\n", - "excerpt: \"This paper examines what happened when one of Wikipedia's counter-vandalism bots unexpectedly went offline.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "In the first half of 2011, ClueBot NG – one of the most prolific counter-vandalism bots in the English-language Wikipedia – went down for four distinct periods, each period of downtime lasting from days to weeks. In this paper, we use these periods of breakdown as naturalistic experiments to study Wikipedia’s heterogeneous quality control network. Our analysis showed that the overall time-to-revert edits was almost doubled when this software agent was down. Yet while a significantly fewer proportion of edits made during the bot’s downtime were reverted, we found that those edits were later eventually reverted. This suggests that other agents in Wikipedia took over this quality control work, but performed it at a far slower rate.\n", - "\n", - "---\n", - "title: \"Hadoop as Grounded Theory: Is an STS Approach to Big Data Possible? the 2013 Annual Meeting of the Society for the Social Study of Science 4S\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2013-10-09-ica-hadoop-grounded-theory\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2013-10-09\n", - "location: \"San Diego, CA\"\n", - "---\n", - "\n", - "---\n", - "title: \"Design by Bot: Power and Resistance in the Development of Automated Software Agents\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2013-10-23-aoir-design-by-bot\n", - "venue: \"Annual Meeting of the Association of Internet Researchers (AoIR)\"\n", - "date: 2013-10-23\n", - "location: \"Denver, CO\"\n", - "---\n", - "\n", - "---\n", - "title: \"Size Matters: How Big Data Changes Everything\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2013-11-25-bkk-data\n", - "venue: \"Bangkok Scientifique\"\n", - "date: 2013-11-25\n", - "location: \"Bangkok, Thailand\"\n", - "excerpt: \"A talk introducing various concepts around large-scale data analysis to a general audience, including spam detection and governmental survellance.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "In the wake of all the rigmarole going on with the Snowden leaks and details of the NSA intelligence gathering apparatus, it's suddenly very clear to the average person just how much data is out there, and how difficult it must be to recognize, organize and filter it for a usable purpose. Even if we try to minimize our digital footprint, each of us nonetheless generates an incredible amount of data that represents US in the digital realm. To talk us through what systems are used to parse such vast quantities of data into a usable format, we are happy to welcome Stuart Geiger to this month's BkkSci. Stuart's current research is on the intersection of data science and artificial intelligence (AI) that is often branded as "Big Data." These systems collect massive, diverse, and complex data sets, and then use this data to teach computers how to identify patterns and make decisions. Stuart will talk about his work both in building these automated agents to support the production of knowledge, and in studying how these systems are changing how scientists, governments, businesses, and ordinary people like you and me come know the world. \n", - "\n", - "---\n", - "title: \"Robotic Ethics and Opportunities\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2014-04-04-robots-new-media\n", - "venue: \"Robots and New Media\"\n", - "date: 2014-04-04\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"A panel discussing the ethical and political issues that are raised with autonomous robots and software bots.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "---\n", - "title: \"Governing the Commons\"\n", - "collection: talks\n", - "type: \"Guest lecture\"\n", - "permalink: /talks/2014-04-10-hofi-governing-commons\n", - "venue: \"History of Information\"\n", - "date: 2014-04-10\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"A lecture on the history of Wikipedia, in the broader context of the history of reference works.\"\n", - "---\n", - "\n", - "---\n", - "title: \"Successor Systems: Enacting Ideological Critique Through the Development of Software\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2014-04-25-ttw-successor-systems\n", - "venue: \"Theorizing the Web\"\n", - "date: 2014-04-25\n", - "location: \"Brooklyn, New York\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "---\n", - "title: \"Successor Systems: The Role of Reflexive Algorithms in Enacting Ideological Critique\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2014-05-16-algolife-successor-systems\n", - "venue: \"The Contours of Algorithmic Life\"\n", - "date: 2014-05-16\n", - "location: \"Davis, CA\"\n", - "---\n", - "\n", - "---\n", - "title: \"Data­-Driven Data Research Using Data and Databases: A Practical Critique of Methods and Approaches in “Big Data” Studies\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2014-05-23-ica-data-driven-data\n", - "venue: \"Annual Meeting of the International Communication Association (ICA)\"\n", - "date: 2014-05-23\n", - "location: \"Seattle, WA\"\n", - "excerpt: \"This panel focuses on the challenges faced by researchers conducting mixed-method research into online platforms, particularly where large amounts of data are widely available.\"\n", - "---\n", - "\n", - "In the past two years, the buzzword "big data" has provoked critiques by a number of social scientists (eg., boyd & Crawford 2011; Bruns & Burgess 2012; Burrell 2012; Baym 2013) on the theories, methodologies, and analysis of large data sources. This panel follows up from last year’s ICA 2013 panel, “Downsizing Data: Analyzing Social Digital Traces,” and draws from the experiential grounded approach of Hargittai’s (2009) “Research Confidential” to bring to light practical critiques of the application of digital data research methods in the study of social media platforms. Namely, the panelists will explore how social scientists can shift away from the rhetoric surrounding “big data” and robustly analyze the use of large-scale, computationally-driven, mixed-methods approaches in digital data research. Again, this panel will not discredit large-scale data approaches; instead, we aim to provide context to researchers who wish to employ them in combination with established methods in the field. The panel brings together five scholars to speak about their successes and failures working on projects that employ large-scale digital data methods and tools, regardless of the size of the data, in addition to their iterative approaches dealing with the practicalities of data collection, sampling, theory, analysis, and especially results. Notably, these projects are not purely quantitative, analytical studies employing large datasets: all participants use largescale data and computational approaches within the context of empirical mixed-methods or even (traditionally) qualitative, interpretive studies. The panelists will also discuss the critical approaches to “big data” that inhabit each project. These projects are all exemplars of an emerging mode of scholarship, and collectively they aim to generate a productive and concrete discussion about methodology and epistemology. After a framed introduction by the moderator, participants will spend 10 minutes each to speak in detail about the methodologies of their projects, after which the latter half of the panel will open to discussion with the audience. This panel also will be paired with a Blue Sky Workshop, provocatively entitled “‘Big Data is Bullshit’: Scoping the Next 5 Years of Digital Data Research.” We aim to use the panel as an expert-driven, experiential methods presentation as well as a launchpad for topics and debates that can be further explored in the workshop session (which will occur at some point following the panel).\n", - "\n", - "---\n", - "title: \"Big Data is Bullshit': Scoping the Next 5 Years of Digital Data Research\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2014-05-24-ica-big-data-bullshit\n", - "venue: \"Annual Meeting of the International Communication Association (ICA)\"\n", - "date: 2014-05-24\n", - "location: \"Seattle, WA\"\n", - "---\n", - "\n", - "---\n", - "title: \"Successor Systems: The Role of Reflexive Algorithms in Enacting Ideological Critique\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2014-08-23-4s-successor-systems\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2014-08-23\n", - "location: \"Buenos Aires, Argentina\"\n", - "---\n", - "\n", - "---\n", - "title: \"Successor Systems: The Role of Reflexive Algorithms in Enacting Ideological Critique\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2014-10-21-aoir-successor-systems\n", - "venue: \"Annual Meeting of the Association of Internet Researchers (AoIR)\"\n", - "date: 2014-10-21\n", - "location: \"Daegu, South Korea\"\n", - "---\n", - "\n", - "---\n", - "title: \"Defining, Designing, and Evaluating Civic Values in Human Computation and Collective Action Systems (with Nathan Matias)\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2014-11-02-hcomp-values-in-crowdsourcing\n", - "venue: \"Human Computation Conference (HCOMP), Citizen-X Workshop\"\n", - "date: 2014-11-02\n", - "location: \"Pittsburgh, PA\"\n", - "excerpt: \"We review various crowdsourcing and collective action systems, identifying particular sets of civic values and assumptions.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "Collective action is often described in terms of the relationships, learning, principled processes, and community capacities it fosters. Despite this, human computation and collective action systems are often designed and evaluated with system outputs in mind: the quality of answers, the number of votes, the accuracy of content created. In this proposal, we review literature on the design values of “citizen-x” systems, put forward a series of models for describing the civic values in “citizen-x”, and classify systems by those models.\n", - "\n", - "---\n", - "title: \"Supporting Change from Outside Systems with Design and Data\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2014-12-09-berkman-successor-systems\n", - "venue: \"Berkman Center for Internet and Society\"\n", - "date: 2014-12-09\n", - "location: \"Cambridge, MA\"\n", - "---\n", - "\n", - "---\n", - "title: \"Situated knowledges and successor systems: developing CSCW systems to enact ideological critiques\"\n", - "collection: talks\n", - "type: \"Workshop presentation\"\n", - "permalink: /talks/2015-03-15-cscw-feminism-workshop\n", - "venue: \"CSCW Workshop on Feminism and Feminist Approaches in Social Computing\"\n", - "date: 2015-03-15\n", - "location: \"Vancouver, BC\"\n", - "---\n", - "\n", - "---\n", - "title: \"Does Facebook Have Civil Servants? On Governmentality and Computational Social Science\"\n", - "collection: talks\n", - "type: \"Workshop presentation\"\n", - "permalink: /talks/2015-03-15-cscw-ethics-workshop\n", - "venue: \"CSCW Workshop on Ethics for Studying Sociotechnical Systems in a Big Data World\"\n", - "date: 2015-03-15\n", - "location: \"Vancouver, BC\"\n", - "---\n", - "\n", - "---\n", - "title: \"Trace Ethnography Workshop\"\n", - "collection: talks\n", - "type: \"Workshop presentation\"\n", - "permalink: /talks/2015-03-24-iconf-trace-ethno\n", - "venue: \"ISchools Conference\"\n", - "date: 2015-03-24\n", - "location: \"Newport Beach, CA\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "---\n", - "title: \"Moderating Online Conversation Spaces\"\n", - "collection: talks\n", - "type: \"Guest lecture\"\n", - "permalink: /talks/2015-04-07-203-moderating-online-spaces\n", - "venue: \"Social Aspects of Information Systems course\"\n", - "date: 2015-04-07\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"An overview of how various online platforms moderate content, discussing issues that link up to the theories discussed in the Social Aspects of Information Systems class.\"\n", - "---\n", - "\n", - "---\n", - "title: \"Peer Production and Wikipedia\"\n", - "collection: talks\n", - "type: \"Guest lecture\"\n", - "permalink: /talks/2015-04-09-203-wikipedia\n", - "venue: \"Social Aspects of Information Systems course\"\n", - "date: 2015-04-09\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"An overview of Wikipedia and other peer production platforms, discussing issues that link up to the theories discussed in the Social Aspects of Information Systems class.\"\n", - "---\n", - "\n", - "---\n", - "title: \"But it Wouldn’t Be an Encyclopedia; It Would Be a Wiki: Wikipedia and the Repurposing of WikiWikiWeb\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2015-05-25-ica-wiki-history\n", - "venue: \"Annual Meeting of the International Communication Association (ICA)\"\n", - "date: 2015-05-25\n", - "location: \"San Juan, Puerto Rico\"\n", - "excerpt: \"In this talk, I examine the early history of “anyone can edit” wiki software -- originally developed in 1995, six years before Wikipedia’s origin -- focusing on the ways in which this technological infrastructure has been repurposed across communities, domains, and scales.\"\n", - "---\n", - "\n", - "In this talk, I examine the early history of “anyone can edit” wiki software -- originally developed in 1995, six years before Wikipedia’s origin -- focusing on the ways in which this technological infrastructure has been repurposed across communities, domains, and scales. While today, the idea of a wiki is associated with large-scale, massively-distributed encyclopedic knowledge production, this was not always the case. As I show, many of the assumptions and practices in pre-Wikipedia wiki communities contradicted the idea of a universal repository to document the sum total of human knowledge. In fact, the title of this presentation comes from a conversation between Wikipedia’s co-founder Jimmy Wales and Ward Cunningham, the creator of the first wiki, who advised Wales that the goals of creating a general-purpose encyclopedia and a wiki might be inherently contradictory. As Wales, Sanger, and other early Wikipedians used Cunningham’s wiki software to produce a collective encyclopedia, they found themselves constantly modifying the wiki platform, incorporating features and affordances that supported the kind of encyclopedic knowledge production they found themselves engaged in. Many of these novel features -- such as a persistent history of edits to articles, separate discussion pages for individual articles, and citations/references -- are now taken for granted aspects of what it means for a wiki to be a wiki. Yet at the time, their existence was far more controversial and precarious. Using archival and software studies methods, I illustrate several ways in which wiki software was adapted for the specific purposes and practices of Wikipedians, departing substantially from the pre-Wikipedia understandings of what wiki-based collaboration is and ought to be. Beyond Wikipedia, this case shows how technological infrastructures intersect with particular configurations of communities, epistemologies, and ideologies. Focusing on how one particular infrastructure was re-used and repurposed for a rather different set of values gives us a useful case for problematizing technologically determinist narratives around media technology and society.\n", - "\n", - "---\n", - "title: \"Bot-Based Collective Blocklists in Twitter: The Counterpublic Moderation of a Privately-Owned Networked Public Space\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2015-10-23-aoir-blockbots\n", - "venue: \"Annual Meeting of the Association of Internet Researchers (AoIR)\"\n", - "date: 2015-10-23\n", - "location: \"Phoenix, AZ\"\n", - "excerpt: \"This presentation introduces bot-based collective blocklists (or blockbots) in Twitter, which have been created to help various groups better moderate their own experiences on the site.\"\n", - "---\n", - "\n", - "---\n", - "title: \"Crowdsourcing: Theoretical Considerations\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2015-11-06-crowdsourcing-academy\n", - "venue: \"Crowdsourcing and the Academy Symposium\"\n", - "date: 2015-11-06\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"A panel discussing how academics use crowdsourcing in research.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "---\n", - "title: \"The Bot Multiple: Unpacking the Materialities of Automated Software Agents\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2015-11-12-4s-bot-multiple\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2015-11-12\n", - "location: \"Denver, CO\"\n", - "excerpt: \"I examine the roles that automated software agents (or bots) play in the governance and moderation of Wikipedia, Twitter, and reddit – three online platforms that differently uphold a related set of commitments to ‘open’ and ‘public’ online participation.\"\n", - "---\n", - "\n", - "This paper examines the roles that automated software agents (or bots) play in the governance and moderation of Wikipedia, Twitter, and reddit – three online platforms that differently uphold a related set of commitments to ‘open’ and ‘public’ online participation. While bots are often discussed as malicious or fake agents (e.g. ‘socialbots’), the bots I discuss in these three platforms are more or less legitimate social actors, delegated substantial authority in autonomously enforcing norms and policies. These bots extend and modify the functionality of sites like Wikipedia, Twitter, and reddit, and are generally developed and deployed by volunteers on their own time – continuously operated on computers that are independent from the servers hosting the site. These governance bots involve alternative relations of power and code, requiring that we go beyond studying software code in order to unpack the sociomaterial configurations at work in such digitally-architected spaces. Instead of taking for granted the pre-existing stability of these sites as unified platforms, bots require that we examine the concrete, historically contingent material conditions under which this code is run. Reporting from a multi-sited ethnography of infrastructure, I demonstrate several ways in which bot development comes on the scene in relation to broader assemblages of server farms, platform code, federated databases, code repositories, issue trackers, application programming interfaces, terms of service, mailing lists, counterpublic groups, and a variety of other entities. I argue that bots give us a compelling set of cases for exploring the multiple materialities at work in highly-distributed online spaces.\n", - "\n", - "---\n", - "title: \"Why bots are my favorite contribution to Wikipedia\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-01-16-wiki15-bots\n", - "venue: \"Wikipedia 15th Anniversary Birthday Bash\"\n", - "date: 2016-01-16\n", - "location: \"San Francisco, CA\"\n", - "excerpt: \"A short talk to open up an event celebrating the 15th anniversary of Wikipedia. The prompt we were given was "Why [x] is my favorite contribution to Wikipedia."\"\n", - "---\n", - "\n", - "A short talk to open up an event celebrating the 15th anniversary of Wikipedia. The prompt we were given was "Why [x] is my favorite contribution to Wikipedia." Here is a photo of me handing off the mic: <br><img src="http://funcrunch.org/blog/wp-content/uploads/2016/01/funcrunch-20160116-0757.jpg>\n", - "\n", - "---\n", - "title: \"Scraping Wikipedia Data\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-02-17-thw-scraping-wikipedia\n", - "venue: \"The Hacker Within, BIDS\"\n", - "date: 2016-02-17\n", - "location: \"Berkeley, CA\"\n", - "excerpt: \"A tutorial (with Jupyter notebooks) about how to use APIs to query structured data from Wikipedia articles and the Wikidata project.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "A tutorial (with Jupyter notebooks) about how to use APIs to query structured data from Wikipedia articles and the Wikidata project.\n", - "\n", - "---\n", - "title: \"“What the hack?” Hacking culture and discourse in data science pedagogy (with Brittany Fiore-Gartland)\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-04-15-ttw-what-the-hack\n", - "venue: \"Theorizing the Web\"\n", - "date: 2016-04-15\n", - "location: \"Astoria, New York\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "---\n", - "title: \"Moderating harassment in Twitter with blockbots: a counterpublic and algorithmic strategy\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-04-16-ttw-blockbots\n", - "venue: \"Theorizing the Web\"\n", - "date: 2016-04-16\n", - "location: \"Astoria, New York\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "---\n", - "title: \"Algorithms as agents of gatekeeping, governance, and articulation work in Wikipedia\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-06-08-ica-algorithms-gatekeeping\n", - "venue: \"Algorithms, Automation, and Politics workshop\"\n", - "date: 2016-06-08\n", - "location: \"Fukuoka, Japan\"\n", - "excerpt: \"I discuss how algorithmic systems are deployed to enforce particular behavioral and epistemological standards in Wikipedia, which can become a site for collective sensemaking among veteran Wikipedians.\"\n", - "---\n", - "\n", - "This talk is based on a multi-year ethnographic study of algorithmic software agents in Wikipedia, where bots and automated tools have fundamentally transformed the nature of the notoriously decentralized, ‘anyone can edit’ encyclopedia project. I studied how the development and operation of automated software agents intersected with the development of organizational structures and epistemic norms. My ethnography of infrastructure (Star, 1999) involved participant-observation in various spaces of Wikipedia. I discuss how algorithmic systems are deployed to enforce particular behavioral and epistemological standards in Wikipedia, which can become a site for collective sensemaking among veteran Wikipedians.\n", - "\n", - "---\n", - "title: \"Successor Systems: Lessons for Big Data From Feminist Epistemology and Activism\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-06-09-ica-successor-systems\n", - "venue: \"Big Data: Critiques and Alternatives workshop\"\n", - "date: 2016-06-09\n", - "location: \"Fukuoka, Japan\"\n", - "excerpt: \"I discuss four data-intensive activist projects as "successor systems," discussing the political and epistemological implications of using data to advance activist projects.\"\n", - "---\n", - "\n", - "The concept of successor systems extends Harding (1987) and Haraway’s (1988) call for feminist “successor sciences” -- ways of knowing that critically blend objectivity with situatedness -- to the field of “Big Data.” I argue that successor systems involve a different form of data-intensive knowledge production, in which counterpublic collectives (Fraser, 1990) reflectively deploy algorithmic routines to build “a better account of the world” (Haraway, 579). I discuss four data-intensive activist projects as successor systems, discussing political and epistemological implications of such tactics. These successor systems have much to teach scholars and practitioners of “Big Data,” giving concrete and theoretical alternatives to the more dominant practices in academia and industry.\n", - "\n", - "---\n", - "title: \"Drowning in Data: Industry and Academic Approaches to Mixed Methods in “Holistic” Big Data Studies\"\n", - "collection: talks\n", - "type: \"Panelist\"\n", - "permalink: /talks/2016-06-11-ica-drowning-in-data\n", - "venue: \"Annual Meeting of the International Communication Association (ICA)\"\n", - "date: 2016-06-11\n", - "location: \"Fukuoka, Japan\"\n", - "excerpt: \"This panel extends discusses the potentials and complications of mixed-methods research in big data studies, specifically in cases when population-level data is available.\"\n", - "---\n", - "\n", - "In the past five years, as “big data” research increasingly has been adopted and adapted in the social sciences, the question of multimodal analysis pays a larger role in approaches and perspectives of research methodology. The buzzword "big data" has provoked critiques by a number of social scientists (eg., boyd & Crawford 2011; Bruns & Burgess 2012; Burrell 2012; Baym 2013; Lazer, et al. 2014; Tufekci 2014) on the theories, methodologies, and analysis of large data sources, and yet a growing number of scholars are experimenting with new ways to think about applying traditional and established methods to a newer domain and scale of data. Past panels (e.g., ICA 2013’s “Downsizing Data: Analyzing Social Digital Traces” and ICA 2014’s “Data-Driven Data Research Using Data and Databases: A Practical Critique of Methods and Approaches in ‘Big Data’ Studies”) have examined the practice of large-scale data analysis in social media research. This panel extends those discussions to look at the complications of mixed-methods research in big data studies, specifically in cases when “holistic,” population-level data is available.\n", - "\n", - "---\n", - "title: \"Administrative Support Bots in Wikipedia: How Automation Can Transform the Affordances of Platforms and the Governance of Communities\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-06-14-ica-communicating-with-machines\n", - "venue: \"Communicating with Machines workshop\"\n", - "date: 2016-06-14\n", - "location: \"Fukuoka, Japan\"\n", - "excerpt: \"I discuss cases from a multi-year ethnographic study of automated software agents in Wikipedia, where ‘bots’ have fundamentally transformed the nature of the ‘anyone can edit’ encyclopedia project.\"\n", - "---\n", - "\n", - "I discuss cases from a multi-year ethnographic study of automated software agents in Wikipedia, where ‘bots’ have fundamentally transformed the nature of the ‘anyone can edit’ encyclopedia project. Bots and bot developers have long been a core part of the Wikipedian community, and I studied how the development and operation of automated software agents intersected with the development of organizational and epistemic norms. My ethnographic project involved participant-observation in various spaces of Wikipedia: both routine editorial activity in Wikipedia (which is assisted through bots) and specific work in bot development, including proposing, developing, and operating a bot of my own. I also conducted extensive historical analysis of the history of Wikipedia, including case studies of bots throughout Wikipedia’s 15 year history. \n", - "\n", - "---\n", - "title: \"Governing Open Source Projects at Scale: Lessons from Wikipedia's Growing Pains\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-07-16-scipy-governing-scale\n", - "venue: \"SciPy\"\n", - "date: 2016-07-16\n", - "location: \"Austin, Texas\"\n", - "excerpt: \"Many open source, volunteer-driven projects begin with a small, tight-knit group of collaborators, but then rapidly expand far faster than anyone expects or plans for. I discuss cases of governance growing pains in Wikipedia, which have many lessons for running open source software projects.\"\n", - "---\n", - "\n", - "Many open source, volunteer-driven projects begin with a small, tight-knit group of collaborators, but then rapidly expand far faster than anyone expects or plans for. I discuss cases of governance growing pains in Wikipedia, which have many lessons for running open source software projects.\n", - "\n", - "---\n", - "title: \"Community Sustainability in Wikipedia: A Review of Research and Initiatives\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-08-13-pydata-community-sustainability\n", - "venue: \"PyData SF\"\n", - "date: 2016-08-13\n", - "location: \"San Francisco, CA\"\n", - "excerpt: \"Wikipedia relies on one of the world’s largest open collaboration communities. Since 2001, the community has grown substantially and faced many challenges. This presentation reviews research and initiatives around community sustainability in Wikipedia that are relevant for many open source projects, including issues of newcomer retention, governance, automated moderation, and marginalized groups.\"\n", - "---\n", - "\n", - "Wikipedia relies on one of the world’s largest open collaboration communities. Since 2001, the community has grown substantially and faced many challenges. This presentation reviews research and initiatives around community sustainability in Wikipedia that are relevant for many open source projects, including issues of newcomer retention, governance, automated moderation, and marginalized groups.\n", - "\n", - "---\n", - "title: \"“The Wisdom of Bots:” An ethnographic study of the delegation of governance work to information infrastructures in Wikipedia\"\n", - "collection: talks\n", - "type: \"Talk\"\n", - "permalink: /talks/2016-09-02-4s-wisdom-of-bots\n", - "venue: \"Annual Meeting of the Society for the Social Study of Science (4S)\"\n", - "date: 2016-09-02\n", - "location: \"Barcelona, Spain\"\n", - "excerpt: \"Wikipedians rely on software agents to govern the ‘anyone can edit’ encyclopedia project, in the absence of more formal and traditional organizational structures. Lessons from Wikipedia’s bots speak to debates about how algorithms are being delegated governance work in sites of cultural production.\"\n", - "---\n", - "\n", - "Link to more information\n", - "\n", - "I present findings from a multi-year ethnographic study of automated software agents in Wikipedia. "Bots" and bot developers are a core part of the volunteer community that curates one of the world's largest and most popular information resources. The Wikipedian community relies on hundreds of independently run bots to monitor and regulate almost all aspects of the site. Bots are delegated a wide variety of organizational and administrative work, including: patrolling for spam, 'vandalism', and 'edit wars'; standardizing grammar, layout, citations, and units; updating articles using public datasets; and identifying more complicated work and distributing those tasks to humans. In my infrastructural inversion (Bowker & Star 1999), I argue Wikipedia can only appear to be governed by an economistic "wisdom of crowds" if the work delegated to bots remains invisible. These bots have long been a core way in which Wikipedians govern the 'anyone can edit' project in the absence of more formal organizational structures. Wikipedians also work out fundamental disagreements about what the encyclopedia and the community ought to look like by, in part, debating about how bots ought to be delegated governance work. For example, one of the more consistently raised (and rejected) proposals on the English Wikipedia is a bot that would make all articles conform to a single national variety of English. Lessons from Wikipedia's bots speak to many debates about how algorithmic agents are being incorporated into sites of cultural production, drawing our focus to the governance work that is delegated to automated information infrastructures.\n", - "\n" - ] - } - ], + "outputs": [], "source": [ "loc_dict = {}\n", - "count = 0\n", - "base_url = \"staeiou.github.io\"\n", "\n", "for row, item in talks.iterrows():\n", - " md_filename = str(item.date) + \"-\" + item.slug + \".md\"\n", - " html_filename = str(item.date) + \"-\" + item.slug \n", - " year = item.date[:4]\n", - " count += 1\n", " \n", + " md_filename = str(item.date) + \"-\" + item.url_slug + \".md\"\n", + " html_filename = str(item.date) + \"-\" + item.url_slug \n", + " year = item.date[:4]\n", " \n", " md = \"---\\ntitle: \\\"\" + item.title + '\"\\n'\n", " md += \"collection: talks\" + \"\\n\"\n", - " if item.talk_type is not None:\n", - " md += 'type: \"' + item.talk_type + '\"\\n'\n", + " \n", + " if len(str(item.type)) > 3:\n", + " md += 'type: \"' + item.type + '\"\\n'\n", + " else:\n", + " md += 'type: \"Talk\"\\n'\n", " \n", " md += \"permalink: /talks/\" + html_filename + \"\\n\"\n", " \n", - " if item.venue is not None:\n", + " if len(str(item.venue)) > 3:\n", " md += 'venue: \"' + item.venue + '\"\\n'\n", " \n", - " if item.date is not None:\n", + " if len(str(item.location)) > 3:\n", " md += \"date: \" + str(item.date) + \"\\n\"\n", " \n", - " if item.geoloc is not None:\n", - " md += 'location: \"' + str(item.geoloc) + '\"\\n'\n", - " \n", - " \n", - " if len(str(item.summary))>10:\n", - " md += 'excerpt: \"'\n", - " md += html_escape(item.summary) + '\"\\n'\n", - " elif len(str(item.description))>10:\n", - " if len(str(item.description))>200:\n", - " md += 'excerpt: \"'\n", - " md += html_escape(item.description[:200])\n", - " md += '...\"\\n'\n", - " else:\n", - " md += 'excerpt: \"'\n", - " md += html_escape(item.description) + '\"\\n'\n", - " \n", + " if len(str(item.location)) > 3:\n", + " md += 'location: \"' + str(item.location) + '\"\\n'\n", + " \n", " md += \"---\\n\"\n", " \n", - " if isinstance(item.url, str):\n", - " md += \"\\nLink to more information\\n\" \n", " \n", - " if len(str(item.description))>10:\n", + " if len(str(item.talk_url)) > 3:\n", + " md += \"\\n[More information here](\" + item.talk_url + \")\\n\" \n", + " \n", + " \n", + " if len(str(item.description)) > 3:\n", " md += \"\\n\" + html_escape(item.description) + \"\\n\"\n", + " \n", + " \n", " md_filename = os.path.basename(md_filename)\n", - " print(md)\n", + " #print(md)\n", " \n", - " with open(\"_talks/\" + md_filename, 'w') as f:\n", + " with open(\"../_talks/\" + md_filename, 'w') as f:\n", " f.write(md)" ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "These files are in the talks directory, one directory below where we're working from." + ] + }, { "cell_type": "code", - "execution_count": 8, + "execution_count": 6, "metadata": { - "collapsed": false + "collapsed": false, + "deletable": true, + "editable": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - "2008-03-01-tamu-communicative-ethnography.md\r\n", - "2008-07-19-wikimania-academics-wikipedia.md\r\n", - "2009-03-28-aaas-wikipedia-infrastructures.md\r\n", - "2009-04-25-mit6-media-use-wikipedia.md\r\n", - "2009-07-26-wikiconf-nyc-algorithmc-governance.md\r\n", - "2009-09-25-nyu-trace-ethnography.md\r\n", - "2009-10-27-wikisym-social-roles-bots.md\r\n", - "2009-10-28-4s-missing-wikipedians.md\r\n", - "2010-01-10-cpov-wisdom-of-bots.md\r\n", - "2010-02-25-cscw-banning-vandal.md\r\n", - "2010-03-26-cpov-bot-politics.md\r\n", - "2010-07-10-wikimania-academic-researchers.md\r\n", - "2011-01-03-hicss-trace-ethnography.md\r\n", - "2011-03-04-dml-bots-governance.md\r\n", - "2011-10-05-wikisym-article-deletion.md\r\n", - "2011-11-02-4s-internet-is-here.md\r\n", - "2011-11-03-4s-wikipedian-governance.md\r\n", - "2012-03-31-gcoe-wikipedia-notifications.md\r\n", - "2012-05-02-altchi-ipoxp.md\r\n", - "2012-05-07-chi-fail-whales.md\r\n", - "2012-06-05-icwsm-socialization-wikipedia.md\r\n", - "2012-10-12-infosocial-trace-literacy.md\r\n", - "2012-10-17-4s-time-to-degree.md\r\n", - "2012-10-29-wikisym-methods.md\r\n", - "2013-02-07-203-actor-network-theory.md\r\n", - "2013-02-23-cscw-edit-sessions.md\r\n", - "2013-02-26-cscw-community-impact.md\r\n", - "2013-03-01-ttw-values-where.md\r\n", - "2013-08-03-wikisym-levee-breaks-bots.md\r\n", - "2013-10-09-ica-hadoop-grounded-theory.md\r\n", - "2013-10-23-aoir-design-by-bot.md\r\n", - "2013-11-25-bkk-data.md\r\n", - "2014-04-04-robots-new-media.md\r\n", - "2014-04-10-hofi-governing-commons.md\r\n", - "2014-04-25-ttw-successor-systems.md\r\n", - "2014-05-16-algolife-successor-systems.md\r\n", - "2014-05-23-ica-data-driven-data.md\r\n", - "2014-05-24-ica-big-data-bullshit.md\r\n", - "2014-08-23-4s-successor-systems.md\r\n", - "2014-10-21-aoir-successor-systems.md\r\n", - "2014-11-02-hcomp-values-in-crowdsourcing.md\r\n", - "2014-12-09-berkman-successor-systems.md\r\n", - "2015-03-15-cscw-ethics-workshop.md\r\n", - "2015-03-15-cscw-feminism-workshop.md\r\n", - "2015-03-24-iconf-trace-ethno.md\r\n", - "2015-04-07-203-moderating-online-spaces.md\r\n", - "2015-04-09-203-wikipedia.md\r\n", - "2015-05-25-ica-wiki-history.md\r\n", - "2015-10-23-aoir-blockbots.md\r\n", - "2015-11-06-crowdsourcing-academy.md\r\n", - "2015-11-12-4s-bot-multiple.md\r\n", - "2016-01-16-wiki15-bots.md\r\n", - "2016-02-17-thw-scraping-wikipedia.md\r\n", - "2016-04-15-ttw-what-the-hack.md\r\n", - "2016-04-16-ttw-blockbots.md\r\n", - "2016-06-08-ica-algorithms-gatekeeping.md\r\n", - "2016-06-09-ica-successor-systems.md\r\n", - "2016-06-11-ica-drowning-in-data.md\r\n", - "2016-06-14-ica-communicating-with-machines.md\r\n", - "2016-07-16-scipy-governing-scale.md\r\n", - "2016-08-13-pydata-community-sustainability.md\r\n", - "2016-09-02-4s-wisdom-of-bots.md\r\n" + "2012-03-01-talk-1.md\t 2014-02-01-talk-2.md\r\n", + "2013-03-01-tutorial-1.md 2014-03-01-talk-3.md\r\n" ] } ], "source": [ - "!ls _talks" + "!ls ../_talks" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { - "collapsed": false + "collapsed": false, + "deletable": true, + "editable": true }, "outputs": [ { @@ -2375,29 +336,24 @@ "output_type": "stream", "text": [ "---\r\n", - "title: \"A Communicative Ethnography of Argumentative Strategies in a Wikipedian Content Dispute\"\r\n", + "title: \"Tutorial 1 on Relevant Topic in Your Field\"\r\n", "collection: talks\r\n", - "type: \"Talk\"\r\n", - "permalink: /talks/2008-03-01-tamu-communicative-ethnography\r\n", - "venue: \"Exploring New Media Worlds\"\r\n", - "date: 2008-03-01\r\n", - "location: \"College Station, TX\"\r\n", - "---\r\n" + "type: \"Tutorial\"\r\n", + "permalink: /talks/2013-03-01-tutorial-1\r\n", + "venue: \"UC-Berkeley Institute for Testing Science\"\r\n", + "date: 2013-03-01\r\n", + "location: \"Berkeley CA, USA\"\r\n", + "---\r\n", + "\r\n", + "[More information here](http://exampleurl.com)\r\n", + "\r\n", + "This is a description of your tutorial, note the different field in type. This is a markdown files that can be all markdown-ified like any other post. Yay markdown!\r\n" ] } ], "source": [ - "!head _talks/2008-03-01-tamu-communicative-ethnography.md" + "!cat ../_talks/2013-03-01-tutorial-1.md" ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": { - "collapsed": true - }, - "outputs": [], - "source": [] } ], "metadata": { @@ -2416,7 +372,7 @@ "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", - "version": "3.4.2" + "version": "3.6.1" } }, "nbformat": 4, diff --git a/markdown_generator/talks.py b/markdown_generator/talks.py new file mode 100644 index 0000000000000..818fd28dae125 --- /dev/null +++ b/markdown_generator/talks.py @@ -0,0 +1,111 @@ + +# coding: utf-8 + +# # Talks markdown generator for academicpages +# +# Takes a TSV of talks with metadata and converts them for use with [academicpages.github.io](academicpages.github.io). This is an interactive Jupyter notebook ([see more info here](http://jupyter-notebook-beginner-guide.readthedocs.io/en/latest/what_is_jupyter.html)). The core python code is also in `talks.py`. Run either from the `markdown_generator` folder after replacing `talks.tsv` with one containing your data. +# +# TODO: Make this work with BibTex and other databases, rather than Stuart's non-standard TSV format and citation style. + +# In[1]: + +import pandas as pd +import os + + +# ## Data format +# +# The TSV needs to have the following columns: title, type, url_slug, venue, date, location, talk_url, description, with a header at the top. Many of these fields can be blank, but the columns must be in the TSV. +# +# - Fields that cannot be blank: `title`, `url_slug`, `date`. All else can be blank. `type` defaults to "Talk" +# - `date` must be formatted as YYYY-MM-DD. +# - `url_slug` will be the descriptive part of the .md file and the permalink URL for the page about the paper. +# - The .md file will be `YYYY-MM-DD-[url_slug].md` and the permalink will be `https://[yourdomain]/talks/YYYY-MM-DD-[url_slug]` +# - The combination of `url_slug` and `date` must be unique, as it will be the basis for your filenames +# + + +# ## Import TSV +# +# Pandas makes this easy with the read_csv function. We are using a TSV, so we specify the separator as a tab, or `\t`. +# +# I found it important to put this data in a tab-separated values format, because there are a lot of commas in this kind of data and comma-separated values can get messed up. However, you can modify the import statement, as pandas also has read_excel(), read_json(), and others. + +# In[3]: + +talks = pd.read_csv("talks.tsv", sep="\t", header=0) +talks + + +# ## Escape special characters +# +# YAML is very picky about how it takes a valid string, so we are replacing single and double quotes (and ampersands) with their HTML encoded equivilents. This makes them look not so readable in raw format, but they are parsed and rendered nicely. + +# In[4]: + +html_escape_table = { + "&": "&", + '"': """, + "'": "'" + } + +def html_escape(text): + if type(text) is str: + return "".join(html_escape_table.get(c,c) for c in text) + else: + return "False" + + +# ## Creating the markdown files +# +# This is where the heavy lifting is done. This loops through all the rows in the TSV dataframe, then starts to concatentate a big string (```md```) that contains the markdown for each type. It does the YAML metadata first, then does the description for the individual page. + +# In[5]: + +loc_dict = {} + +for row, item in talks.iterrows(): + + md_filename = str(item.date) + "-" + item.url_slug + ".md" + html_filename = str(item.date) + "-" + item.url_slug + year = item.date[:4] + + md = "---\ntitle: \"" + item.title + '"\n' + md += "collection: talks" + "\n" + + if len(str(item.type)) > 3: + md += 'type: "' + item.type + '"\n' + else: + md += 'type: "Talk"\n' + + md += "permalink: /talks/" + html_filename + "\n" + + if len(str(item.venue)) > 3: + md += 'venue: "' + item.venue + '"\n' + + if len(str(item.location)) > 3: + md += "date: " + str(item.date) + "\n" + + if len(str(item.location)) > 3: + md += 'location: "' + str(item.location) + '"\n' + + md += "---\n" + + + if len(str(item.talk_url)) > 3: + md += "\n[More information here](" + item.talk_url + ")\n" + + + if len(str(item.description)) > 3: + md += "\n" + html_escape(item.description) + "\n" + + + md_filename = os.path.basename(md_filename) + #print(md) + + with open("../_talks/" + md_filename, 'w') as f: + f.write(md) + + +# These files are in the talks directory, one directory below where we're working from. + diff --git a/markdown_generator/talks.tsv b/markdown_generator/talks.tsv index 048fef051cbda..4e4b0d22ded66 100644 --- a/markdown_generator/talks.tsv +++ b/markdown_generator/talks.tsv @@ -1,63 +1,5 @@ -date talk_type title venue institution geoloc url slug summary description -2008-03-01 Talk A Communicative Ethnography of Argumentative Strategies in a Wikipedian Content Dispute Exploring New Media Worlds Texas A&M University College Station, TX tamu-communicative-ethnography -2008-07-19 Talk Conceptions and Misconceptions Academics Hold About Wikipedia Annual Wikimedia Conference (Wikimania) Wikimedia Foundation Alexandria, Egypt wikimania-academics-wikipedia -2009-03-28 Talk Working With/in Wikipedia: Infrastructures of Knowing and Knowledge Production Annual Conference on Science and Technology in Society AAAS Wasthington, Dc aaas-wikipedia-infrastructures -2009-04-25 Talk Evolving Governance and Media Use in Wikipedia: A Historical Account Media in Transition 6 MIT Cambridge, MA mit6-media-use-wikipedia -2009-07-26 Talk Algorithmic Governance: The Social Roles of Bots and Assisted Editing Tools First Annual Wikiconference NYC Wikimedia Foundation New York, NY wikiconf-nyc-algorithmc-governance -2009-09-25 Talk Trace Ethnography: An ANT Method for the Study of Sociotechnical Networks the Second Annual Media Sociology Forum New York University New York, NY nyu-trace-ethnography -2009-10-27 Conference proceedings talk The Social Roles of Bots and Assisted Editing Tools International Symposium on Wikis and Open Collaboration ACM Orlando, Florida http://www.stuartgeiger.com/papers/geiger-wikisym-bots.pdf wikisym-social-roles-bots A short paper showing the recent explosive growth of automated editors (or bots) in Wikipedia, which have taken on many new tasks in administrative spaces. A short paper showing the recent explosive growth of automated editors (or bots) in Wikipedia, which have taken on many new tasks in administrative spaces. -2009-10-28 Talk Where Are the Missing Wikipedians? The Sociology of a Bot Annual Meeting of the Society for the Social Study of Science (4S) Arlington, Virginia 4s-missing-wikipedians -2010-01-10 Talk The Wisdom of Bots: A Critique of ‘Self-Organization’ in Wikipedia Critical Point of View: Wikipedia and the Politics of Open Knowledge Centre for Internet and Society Bangalore, India cpov-wisdom-of-bots -2010-02-25 Conference proceedings talk The Work of Sustaining Order in Wikipedia: The Banning of a Vandal Conference on Computer Supported Cooperative Work ACM Savannah, Georgia http://www.stuartgeiger.com/papers/cscw-sustaining-order-wikipedia.pdf cscw-banning-vandal This paper traces out a heterogeneous network of humans and non-humans involved in the identification and banning of a single vandal in Wikipedia. This paper traces out a heterogeneous network of humans and non-humans involved in the identification and banning of a single vandal in Wikipedia. -2010-03-26 Talk Bot Politics: How is Automation Changing the Wikipedian Society? Critical Point of View II Critical Point of View: Wikipedia and the Politics of Open Knowledge Institute for Network Cultures Amsterdam, the Netherlands cpov-bot-politics -2010-07-10 Panelist Academic Researchers in Wikimedia Communities: Ethics, Methods, and Policies Wikimania 2010 Wikimedia Foundation Gdansk, Poland https://wikimania2010.wikimedia.org/wiki/Submissions/Academic_Researchers_in_Wikimedia_Communities:_Ethics,_Methods,_and_Policies wikimania-academic-researchers A panel intended to foster a dialog between academic researchers who study Wikimedia projects and the Wikimedia community. In the past few years, there has been an explosion of academic researchers who are turning their attention towards Wikimedia projects. Hundreds of scholars representing virtually every discipline in the humanities, the social sciences, and the computational sciences have entered various Wikimedia communities to answer a broad range of questions using an equally diverse set of methods. As both researchers and editors have learned, actively studying wiki communities raises a number of problems and concerns, from the practical and logistical to the ethical and legal. Despite years of informal dialog and negotiation between individual researchers and editors, multiple WikiProjects dedicated to wiki research, and an ongoing attempt to craft a formal research policy, many issues are still unresolved.\n\nThe purpose of this panel is to foster a dialog between wiki researchers and the Wikimedia community on many of these issues, which include: Are talk pages, listservs, and IRC channels public spaces, and should researchers have to identify themselves when they enter? What kinds of privacy issues exist with reporting an editor’s actions in an academic article? Under what conditions should social scientists be allowed to send unsolicited requests to random samples of users, a process which has been sometimes been considered spam and resulted in the banning of researchers’ accounts? Are controversial 'breaching experiments' (such as vandalism response tests) of value, and is there a way they can be performed in a satisfactory manner? Should there be a group or committee that would review and approve certain kinds of academic research activities, similar to the Bot Approval Group? These are some of the many questions that will be asked, and audience members will be encouraged to participate in this discussion. -2011-01-03 Conference proceedings talk Trace Ethnography: Following Coordination through Documentary Practices Hawaii International Conference on System Sciences IEEE Lihue, Hawaii http://www.stuartgeiger.com/trace-ethnography-hicss-geiger-ribes.pdf hicss-trace-ethnography We detail the methodology of ‘trace ethnography’, which combines the richness of participant-observation with the wealth of data in logs so as to reconstruct patterns and practices of users in distributed sociotechnical systems We detail the methodology of ‘trace ethnography’, which combines the richness of participant-observation with the wealth of data in logs so as to reconstruct patterns and practices of users in distributed sociotechnical systems -2011-03-04 Talk Machine-Generated Content: Bots and the Governance of Wikipedia Digital Media and Learning (DML) Long Beach, CA dml-bots-governance -2011-10-05 Conference proceedings talk Participation in Wikipedia’s Article Deletion Processes (with Heather Ford) International Symposium on Wikis and Open Collaboration ACM Mountain View, CA http://www.stuartgeiger.com/papers/article-deletion-wikisym-geiger-ford.pdf wikisym-article-deletion This paper investigates Wikipedia's article deletion processes, finding that it is heavily populated by specialists. We find that Wikipedia’s deletion process is heavily frequented by a relatively small number of longstanding users. The vast majority of such deleted articles are not spam, vandalism, or “patent nonsense,” but rather articles which could be considered encyclopedic, but do not fit the project‟s standards. -2011-11-02 Talk ’The Internet is Here’: The Virtuality of ‘On-line Communities in Physical Spaces Annual Meeting of the Society for the Social Study of Science (4S) Cleveland, OH 4s-internet-is-here -2011-11-03 Talk User-Generated Platforms in Wikipedian Governance Annual Meeting of the Society for the Social Study of Science (4S) Cleveland, OH 4s-wikipedian-governance -2012-03-31 Talk Improving Wikipedia’s Notifications to Rejected Contributors GCOE International Symposium on Informatics Education Kyoto University Kyoto, Japan gcoe-wikipedia-notifications -2012-05-02 Conference proceedings talk Black-boxing the user: internet protocol over xylophone players (IPoXP) Conference on Human Factors in Computing (CHI) ACM Austin, Texas http://stuartgeiger.com/ipoxp.pdf altchi-ipoxp We introduce IP over Xylophone Players (IPoXP), a novel Internet protocol between two computers using xylophone-based Arduino interfaces We introduce IP over Xylophone Players (IPoXP), a novel Internet protocol between two computers using xylophone-based Arduino interfaces. In our implementation, human operators are situated within the lowest layer of the network, transmitting data between computers by striking designated keys. We discuss how IPoXP inverts the traditional mode of human-computer interaction, with a computer using the human as an interface to communicate with another computer. -2012-05-07 Panelist Hunting for Fail Whales: Lessons from Deviance and Failure in Social Computing Conference on Human Factors in Computing (CHI) ACM Austin, Texas chi-fail-whales -2012-06-05 Conference proceedings talk Defense Mechanism or Socialization Tactic? Improving Wikipedia’s Notifications to Rejected Contributors International Conference on Weblogs and Social Media (ICWSM) AAAI Dublin, Ireland http://stuartgeiger.com/defense-mechanism-icwsm.pdf icwsm-socialization-wikipedia A descriptive study of Wikipedia's highly-automated socialization processes and an A/B test to improve templated messages to newcomers. In this paper, we first illustrate and describe the various defense mechanisms at work in Wikipedia, which we hypothesize are inhibiting newcomer retention. Next, we present results from an experiment aimed at increasing both the quantity and quality of editors by altering various elements of these defense mechanisms, specifically pre-scripted warnings and notifications that are sent to new editors upon reverting or rejecting contributions. Using regression models of new user activity, we show which tactics work best for different populations of users based on their motivations when joining Wikipedia. -2012-10-12 Talk Trace literacy: a framework for holistically conceptualizing newcomer socialization in socio-technical systems Infosocial Northwestern University Evanston, IL infosocial-trace-literacy -2012-10-17 Talk Time to Degree: Examining the Experiences of Graduate Students in the Long-Term Ecological Research Network Annual Meeting of the Society for the Social Study of Science (4S) Copenhagen, Denmark 4s-time-to-degree -2012-10-29 Panelist What Aren’t We Measuring? Methods for Quantifying Wiki-Work. International Symposium on Wikis and Open Collaboration (WikiSym 2012) ACM Linz, Austria wikisym-methods -2013-02-07 Guest lecture Actor-Network Theory Social Aspects of Information Systems course UC-Berkeley School of Information Berkeley, CA 203-actor-network-theory An introduction to Actor Network Theory for students in the Masters of Information Management and Systems (MIMS) course -2013-02-23 Conference proceedings talk Using Edit Sessions to Measure Participation in Wikipedia (with Aaron Halfaker) Conference on Computer Supported Cooperative Work ACM San Antonio, TX http://www.stuartgeiger.com/cscw-sessions.pdf cscw-edit-sessions This paper establishes a quantitative metric for measuring editor activity through temporal edit sessions. Many quantitative, log-based studies of participation and contribution in CSCW and CMC systems measure the activity of users in terms of output, based on metrics like posts to forums, edits to Wikipedia articles, or commits to code repositories. In this paper, we estimate the amount of time users have spent contributing. Through an analysis of Wikipedia log data, we identify a pattern of punctuated bursts in editors’ activity that we refer to as edit sessions. Based on these edit sessions, we build a metric that approximates the labor hours of editors in the encyclopedia. Using this metric, we first compare labor-based analyses with output-based analyses, finding that the activity of many editors can appear quite differently based on the kind of metric used. -2013-02-26 Panelist Community, Impact, and Credit: Where Do I Submit My Papers? ACM Conference on Computer-Supported Cooperative Work (CSCW) San Antonio, TX cscw-community-impact -2013-03-01 Talk Values Where? Interrogating Client-Side Scripting as a Design Process Theorizing the Web New York, NY ttw-values-where -2013-08-03 Conference proceedings talk When the Levee Breaks: Without Bots, What Happens to Wikipedia’s Quality Control Processes? (with Aaron Halfaker) International Symposium on Wikis and Open Collaboration (WikiSym 2012) ACM Hong Kong http://stuartgeiger.com/wikisym13-cluebot.pdf wikisym-levee-breaks-bots This paper examines what happened when one of Wikipedia's counter-vandalism bots unexpectedly went offline. In the first half of 2011, ClueBot NG – one of the most prolific counter-vandalism bots in the English-language Wikipedia – went down for four distinct periods, each period of downtime lasting from days to weeks. In this paper, we use these periods of breakdown as naturalistic experiments to study Wikipedia’s heterogeneous quality control network. Our analysis showed that the overall time-to-revert edits was almost doubled when this software agent was down. Yet while a significantly fewer proportion of edits made during the bot’s downtime were reverted, we found that those edits were later eventually reverted. This suggests that other agents in Wikipedia took over this quality control work, but performed it at a far slower rate. -2013-10-09 Talk Hadoop as Grounded Theory: Is an STS Approach to Big Data Possible? the 2013 Annual Meeting of the Society for the Social Study of Science 4S Annual Meeting of the Society for the Social Study of Science (4S) San Diego, CA ica-hadoop-grounded-theory -2013-10-23 Talk Design by Bot: Power and Resistance in the Development of Automated Software Agents Annual Meeting of the Association of Internet Researchers (AoIR) Denver, CO aoir-design-by-bot -2013-11-25 Talk Size Matters: How Big Data Changes Everything Bangkok Scientifique Bangkok, Thailand http://www.meetup.com/bkksci/events/140639312/ bkk-data A talk introducing various concepts around large-scale data analysis to a general audience, including spam detection and governmental survellance. In the wake of all the rigmarole going on with the Snowden leaks and details of the NSA intelligence gathering apparatus, it's suddenly very clear to the average person just how much data is out there, and how difficult it must be to recognize, organize and filter it for a usable purpose. Even if we try to minimize our digital footprint, each of us nonetheless generates an incredible amount of data that represents US in the digital realm. To talk us through what systems are used to parse such vast quantities of data into a usable format, we are happy to welcome Stuart Geiger to this month's BkkSci. Stuart's current research is on the intersection of data science and artificial intelligence (AI) that is often branded as "Big Data." These systems collect massive, diverse, and complex data sets, and then use this data to teach computers how to identify patterns and make decisions. Stuart will talk about his work both in building these automated agents to support the production of knowledge, and in studying how these systems are changing how scientists, governments, businesses, and ordinary people like you and me come know the world. -2014-04-04 Panelist Robotic Ethics and Opportunities Robots and New Media Berkeley Center for New Media Berkeley, CA http://robotsandnewmedia.com/ robots-new-media A panel discussing the ethical and political issues that are raised with autonomous robots and software bots. -2014-04-10 Guest lecture Governing the Commons History of Information UC-Berkeley School of Information Berkeley, CA hofi-governing-commons A lecture on the history of Wikipedia, in the broader context of the history of reference works. -2014-04-25 Talk Successor Systems: Enacting Ideological Critique Through the Development of Software Theorizing the Web Brooklyn, New York http://theorizingtheweb.tumblr.com/2014/program ttw-successor-systems -2014-05-16 Talk Successor Systems: The Role of Reflexive Algorithms in Enacting Ideological Critique The Contours of Algorithmic Life Mellon Research Initiative in Digital Cultures Davis, CA algolife-successor-systems -2014-05-23 Panelist Data­-Driven Data Research Using Data and Databases: A Practical Critique of Methods and Approaches in “Big Data” Studies Annual Meeting of the International Communication Association (ICA) Seattle, WA ica-data-driven-data This panel focuses on the challenges faced by researchers conducting mixed-method research into online platforms, particularly where large amounts of data are widely available. In the past two years, the buzzword "big data" has provoked critiques by a number of social scientists (eg., boyd & Crawford 2011; Bruns & Burgess 2012; Burrell 2012; Baym 2013) on the theories, methodologies, and analysis of large data sources. This panel follows up from last year’s ICA 2013 panel, “Downsizing Data: Analyzing Social Digital Traces,” and draws from the experiential grounded approach of Hargittai’s (2009) “Research Confidential” to bring to light practical critiques of the application of digital data research methods in the study of social media platforms. Namely, the panelists will explore how social scientists can shift away from the rhetoric surrounding “big data” and robustly analyze the use of large-scale, computationally-driven, mixed-methods approaches in digital data research. Again, this panel will not discredit large-scale data approaches; instead, we aim to provide context to researchers who wish to employ them in combination with established methods in the field. The panel brings together five scholars to speak about their successes and failures working on projects that employ large-scale digital data methods and tools, regardless of the size of the data, in addition to their iterative approaches dealing with the practicalities of data collection, sampling, theory, analysis, and especially results. Notably, these projects are not purely quantitative, analytical studies employing large datasets: all participants use largescale data and computational approaches within the context of empirical mixed-methods or even (traditionally) qualitative, interpretive studies. The panelists will also discuss the critical approaches to “big data” that inhabit each project. These projects are all exemplars of an emerging mode of scholarship, and collectively they aim to generate a productive and concrete discussion about methodology and epistemology. After a framed introduction by the moderator, participants will spend 10 minutes each to speak in detail about the methodologies of their projects, after which the latter half of the panel will open to discussion with the audience. This panel also will be paired with a Blue Sky Workshop, provocatively entitled “‘Big Data is Bullshit’: Scoping the Next 5 Years of Digital Data Research.” We aim to use the panel as an expert-driven, experiential methods presentation as well as a launchpad for topics and debates that can be further explored in the workshop session (which will occur at some point following the panel). -2014-05-24 Panelist Big Data is Bullshit': Scoping the Next 5 Years of Digital Data Research Annual Meeting of the International Communication Association (ICA) Seattle, WA ica-big-data-bullshit -2014-08-23 Talk Successor Systems: The Role of Reflexive Algorithms in Enacting Ideological Critique Annual Meeting of the Society for the Social Study of Science (4S) Buenos Aires, Argentina 4s-successor-systems -2014-10-21 Talk Successor Systems: The Role of Reflexive Algorithms in Enacting Ideological Critique Annual Meeting of the Association of Internet Researchers (AoIR) Daegu, South Korea aoir-successor-systems -2014-11-02 Talk Defining, Designing, and Evaluating Civic Values in Human Computation and Collective Action Systems (with Nathan Matias) Human Computation Conference (HCOMP), Citizen-X Workshop Pittsburgh, PA http://stuartgeiger.com/defining-civic-values-hcomp-matias-geiger.pdf hcomp-values-in-crowdsourcing We review various crowdsourcing and collective action systems, identifying particular sets of civic values and assumptions. Collective action is often described in terms of the relationships, learning, principled processes, and community capacities it fosters. Despite this, human computation and collective action systems are often designed and evaluated with system outputs in mind: the quality of answers, the number of votes, the accuracy of content created. In this proposal, we review literature on the design values of “citizen-x” systems, put forward a series of models for describing the civic values in “citizen-x”, and classify systems by those models. -2014-12-09 Talk Supporting Change from Outside Systems with Design and Data Berkman Center for Internet and Society Cambridge, MA berkman-successor-systems -2015-03-15 Workshop presentation Situated knowledges and successor systems: developing CSCW systems to enact ideological critiques CSCW Workshop on Feminism and Feminist Approaches in Social Computing ACM Vancouver, BC cscw-feminism-workshop -2015-03-15 Workshop presentation Does Facebook Have Civil Servants? On Governmentality and Computational Social Science CSCW Workshop on Ethics for Studying Sociotechnical Systems in a Big Data World Vancouver, BC cscw-ethics-workshop -2015-03-24 Workshop presentation Trace Ethnography Workshop ISchools Conference Newport Beach, CA http://trace-ethnography.github.io iconf-trace-ethno -2015-04-07 Guest lecture Moderating Online Conversation Spaces Social Aspects of Information Systems course UC-Berkeley School of Information Berkeley, CA 203-moderating-online-spaces An overview of how various online platforms moderate content, discussing issues that link up to the theories discussed in the Social Aspects of Information Systems class. -2015-04-09 Guest lecture Peer Production and Wikipedia Social Aspects of Information Systems course UC-Berkeley School of Information Berkeley, CA 203-wikipedia An overview of Wikipedia and other peer production platforms, discussing issues that link up to the theories discussed in the Social Aspects of Information Systems class. -2015-05-25 Talk But it Wouldn’t Be an Encyclopedia; It Would Be a Wiki: Wikipedia and the Repurposing of WikiWikiWeb Annual Meeting of the International Communication Association (ICA) San Juan, Puerto Rico ica-wiki-history In this talk, I examine the early history of “anyone can edit” wiki software -- originally developed in 1995, six years before Wikipedia’s origin -- focusing on the ways in which this technological infrastructure has been repurposed across communities, domains, and scales. In this talk, I examine the early history of “anyone can edit” wiki software -- originally developed in 1995, six years before Wikipedia’s origin -- focusing on the ways in which this technological infrastructure has been repurposed across communities, domains, and scales. While today, the idea of a wiki is associated with large-scale, massively-distributed encyclopedic knowledge production, this was not always the case. As I show, many of the assumptions and practices in pre-Wikipedia wiki communities contradicted the idea of a universal repository to document the sum total of human knowledge. In fact, the title of this presentation comes from a conversation between Wikipedia’s co-founder Jimmy Wales and Ward Cunningham, the creator of the first wiki, who advised Wales that the goals of creating a general-purpose encyclopedia and a wiki might be inherently contradictory. As Wales, Sanger, and other early Wikipedians used Cunningham’s wiki software to produce a collective encyclopedia, they found themselves constantly modifying the wiki platform, incorporating features and affordances that supported the kind of encyclopedic knowledge production they found themselves engaged in. Many of these novel features -- such as a persistent history of edits to articles, separate discussion pages for individual articles, and citations/references -- are now taken for granted aspects of what it means for a wiki to be a wiki. Yet at the time, their existence was far more controversial and precarious. Using archival and software studies methods, I illustrate several ways in which wiki software was adapted for the specific purposes and practices of Wikipedians, departing substantially from the pre-Wikipedia understandings of what wiki-based collaboration is and ought to be. Beyond Wikipedia, this case shows how technological infrastructures intersect with particular configurations of communities, epistemologies, and ideologies. Focusing on how one particular infrastructure was re-used and repurposed for a rather different set of values gives us a useful case for problematizing technologically determinist narratives around media technology and society. -2015-10-23 Talk Bot-Based Collective Blocklists in Twitter: The Counterpublic Moderation of a Privately-Owned Networked Public Space Annual Meeting of the Association of Internet Researchers (AoIR) Phoenix, AZ aoir-blockbots This presentation introduces bot-based collective blocklists (or blockbots) in Twitter, which have been created to help various groups better moderate their own experiences on the site. -2015-11-06 Panelist Crowdsourcing: Theoretical Considerations Crowdsourcing and the Academy Symposium UC-Berkeley Berkeley, CA http://hssa.berkeley.edu/crowdsourcing-symposium crowdsourcing-academy A panel discussing how academics use crowdsourcing in research. -2015-11-12 Talk The Bot Multiple: Unpacking the Materialities of Automated Software Agents Annual Meeting of the Society for the Social Study of Science (4S) Denver, CO 4s-bot-multiple I examine the roles that automated software agents (or bots) play in the governance and moderation of Wikipedia, Twitter, and reddit – three online platforms that differently uphold a related set of commitments to ‘open’ and ‘public’ online participation. This paper examines the roles that automated software agents (or bots) play in the governance and moderation of Wikipedia, Twitter, and reddit – three online platforms that differently uphold a related set of commitments to ‘open’ and ‘public’ online participation. While bots are often discussed as malicious or fake agents (e.g. ‘socialbots’), the bots I discuss in these three platforms are more or less legitimate social actors, delegated substantial authority in autonomously enforcing norms and policies. These bots extend and modify the functionality of sites like Wikipedia, Twitter, and reddit, and are generally developed and deployed by volunteers on their own time – continuously operated on computers that are independent from the servers hosting the site. These governance bots involve alternative relations of power and code, requiring that we go beyond studying software code in order to unpack the sociomaterial configurations at work in such digitally-architected spaces. Instead of taking for granted the pre-existing stability of these sites as unified platforms, bots require that we examine the concrete, historically contingent material conditions under which this code is run. Reporting from a multi-sited ethnography of infrastructure, I demonstrate several ways in which bot development comes on the scene in relation to broader assemblages of server farms, platform code, federated databases, code repositories, issue trackers, application programming interfaces, terms of service, mailing lists, counterpublic groups, and a variety of other entities. I argue that bots give us a compelling set of cases for exploring the multiple materialities at work in highly-distributed online spaces. -2016-01-16 Talk Why bots are my favorite contribution to Wikipedia Wikipedia 15th Anniversary Birthday Bash Wikimedia Foundation San Francisco, CA wiki15-bots A short talk to open up an event celebrating the 15th anniversary of Wikipedia. The prompt we were given was "Why [x] is my favorite contribution to Wikipedia." A short talk to open up an event celebrating the 15th anniversary of Wikipedia. The prompt we were given was "Why [x] is my favorite contribution to Wikipedia." Here is a photo of me handing off the mic: