⬆️ Hacker News ranked by Comment/Score ratio
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.travis
results New Data via Travis build: 802 Jan 19, 2019
.eslintrc FEAT: implement reverse feature, add eslint Dec 17, 2017
.gitignore FEAT: implement reverse feature, add eslint Dec 17, 2017
.travis.yml MISC: alternative format 2 Dec 16, 2017
LICENSE Create LICENSE Apr 21, 2017
README.md Implement date-picker, styling fixes Dec 10, 2017
app.js FEAT: implement reverse feature, add eslint Dec 17, 2017
hn-ratio.py MISC: increase score threshold Dec 17, 2017
index.html FEAT: implement reverse feature, add eslint Dec 17, 2017
package-lock.json FEAT: implement reverse feature, add eslint Dec 17, 2017
package.json FEAT: implement reverse feature, add eslint Dec 17, 2017
rename.sh FEAT: add basic html and d3, cleanup filenames Dec 6, 2017
requirements.txt update dependency Apr 21, 2017
run-hn-ratio-script.sh New results, add script Dec 10, 2017
styles.css

README.md

HN Ratio

Hacker News top 50 stories ranked by Comment/Score ratio.

Why

Being a long-time Hacker News reader, I realized that most interesting HN submissions for me are of two categories:

  • stories that attract a lot of attention and comments, signaling active and extended debates
  • stories that attract a lot of attention but few comments, signaling very insightful post that requires no further comments

Hence, I used the Comment/Score ratio to sort the top HN submissions and locate these stories at the top and the bottom of the list.

Result

Web App

Raw results by date

Note that new "promoted" stories with low votes and HN-backed job listings are filtered.

Why not use HN API in real time?

  • It takes some time to get all the data from Hacker News
  • HN API does not have archive for top stories in past, whereas I want an archive so that I can find what I missed

Currently I have a cron job set up on my VPS to get the data daily, and I need to manually commit the data into the repo once in a while. I can probably set up a CI to do this when I have more time.

How to run the script

$ pip install -r requirements.txt

Then

$ python hn-ratio.py

Result for the day will be generated in both json and plain text format (with basic visualization) in the results directory.