Scripts to do some Quora analytics
This repo is under active development. It is far from complete. I recommend not using it unless you know what you're doing. I have directly copied the python crawler script from Brian Bi's Quora Back Up.
Just install all dependencies using : pip install -r requirements.txt
. You all need to keep the chromedriver or phantomjs driver in your PATH for the scripts to run. Download Chrome Driver here
crawler.py
script should be executed first to fetch and save all your quora answers. Then each script can be executed independently. Use -v
(or --verbose
) option on these script to see messages and progress. All these scripts may take a few minutes each depending on number of your answers.
A GUI using Tkinter is under active development.
The script named qindex.py
computes the Q-Index of a user from his / her downloaded answers. Let's define a user's q-index, inspired by the h-index, as the highest number q for which it is true that the user has q answers with at least q upvotes. See Q-Index Topic on Quora for more details.
The script named view_upvote.py
computes multiple metrics related to views and upvotes of all your answers. The major metrics are :
- Total view across all answers
- Total upvotes across all answers
- Average views per answer
- Average upvotes per answer
- Average of (Views / Upvotes) per answers
- Ratio of total views across all answers and total upvotes across all answers
n
most viewed answersn
most upvoted answersn
answers with highest (UpVotes / Views) ratio
This script computes statistics related to word count of your answers. The major metrics are :
- Total words across all answers
- Total characters across all answers
- Average word count for each answer
- Average character count for each answer
- Top n longest answers in character count
This script essentially searches for answers and profiles on quora. It parses the most viewed answers (in last 30 days) on profiles and then adds those answers to database. Next it parses those answers in database and also searches for good profiles in those answers. The script save a copy of all the web pages that it parses. Note -
- The script suffers from rate limitations enforced by Quora
- Options to limit the number of requests has been provided
- Uses Sqlite Database to store statistics
- Uses a list of Top Writers 2016 as seed data
This script will explore the extremes of data collected. This metric will be updated as per demand. The important metrics calculated are :
- Answers with >10K views but no upvotes