Analysis of authors and stories on fanfiction.net
Install and run IPython notebook
Start with analysis.ipynb
Author's are discovered by a depth first search starting from a seed author, in my case Less Wrong
A scraper to collect the data can be found here (warning Clojure
wrangling.ipynb
"wrangles" scraped authors into DataFrames
fanfiction authors
key | description | type |
---|---|---|
id | fanfiction author id | string |
fanfiction stories
key | description | type |
---|---|---|
id | fanfiction story id | string |
author | fanfiction author id | string |
categories | story categories | [string] |
chapters | number of chapters | int |
completed | story completed? | bool |
favourites | times favourited | int |
follows | times followed | int |
genres | story genres | [string] |
language | language written in | string |
rating | age rating | string |
submitted | when submitted | datetime |
title | story title | string |
updated | when last updated | datetime |
words | number of words | int |
mapping of author id to favourite author ids
key | description | type |
---|---|---|
id | fanfiction author id | string |
favourite_author | fanfiction author id | string |
mapping of author id to favourite story ids
key | description | type |
---|---|---|
id | fanfiction author id | string |
favourite_story | fanfiction story id | string |
indicator matrix mapping for story id to genres
key | description | type |
---|---|---|
id | fanfiction story id | string |
genre 1 | story genre | 0 or 1 |
genre 2 | story genre | 0 or 1 |
... | ||
genre n | story genre | 0 or 1 |
indicator matrix mapping for story id to categories
key | description | type |
---|---|---|
id | fanfiction story id | string |
category 1 | story category | 0 or 1 |
category 2 | story category | 0 or 1 |
... | ||
category n | story category | 0 or 1 |