Skip to content

sanchittechnogeek/quora-analytics

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

quora-analytics

Scripts to do some Quora analytics

This repo is under active development. It is far from complete. I recommend not using it unless you know what you're doing. I have directly copied the python crawler script from Brian Bi's Quora Back Up.

Installation

Just install all dependencies using : pip install -r requirements.txt. You all need to keep the chromedriver or phantomjs driver in your PATH for the scripts to run. Download Chrome Driver here

Quora answer backup and personal statistics

crawler.py script should be executed first to fetch and save all your quora answers. Then each script can be executed independently. Use -v (or --verbose) option on these script to see messages and progress. All these scripts may take a few minutes each depending on number of your answers.

A GUI using Tkinter is under active development.

Q-Index

The script named qindex.py computes the Q-Index of a user from his / her downloaded answers. Let's define a user's q-index, inspired by the h-index, as the highest number q for which it is true that the user has q answers with at least q upvotes. See Q-Index Topic on Quora for more details.

View - UpVotes Statistics

The script named view_upvote.py computes multiple metrics related to views and upvotes of all your answers. The major metrics are :

  • Total view across all answers
  • Total upvotes across all answers
  • Average views per answer
  • Average upvotes per answer
  • Average of (Views / Upvotes) per answers
  • Ratio of total views across all answers and total upvotes across all answers
  • n most viewed answers
  • n most upvoted answers
  • n answers with highest (UpVotes / Views) ratio

Word Count

This script computes statistics related to word count of your answers. The major metrics are :

  • Total words across all answers
  • Total characters across all answers
  • Average word count for each answer
  • Average character count for each answer
  • Top n longest answers in character count

General Quora Bot

Find Answers

This script essentially searches for answers and profiles on quora. It parses the most viewed answers (in last 30 days) on profiles and then adds those answers to database. Next it parses those answers in database and also searches for good profiles in those answers. The script save a copy of all the web pages that it parses. Note -

  • The script suffers from rate limitations enforced by Quora
  • Options to limit the number of requests has been provided
  • Uses Sqlite Database to store statistics
  • Uses a list of Top Writers 2016 as seed data

Most on Quora

This script will explore the extremes of data collected. This metric will be updated as per demand. The important metrics calculated are :

  • Answers with >10K views but no upvotes

About

Scripts to do some Quora analytics and Crawl through Quora

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%