GitHub - santiagobasulto/hn-summary

I want to change my habit of browsing HN all the time, but still don't want to miss important news or events. So, I decided to create this simple static site with summaries and aggregations of all the most popular Stories, Ask HN, Show HN and other categorized posts (News, Dev Blogs, Scientific Publications, etc).

Where does the data come from?

I built a quick wrapper around Algolia's API and I use it to dump a huge CSV of all the HN posts every week. The source dataset is hosted on Kaggle.

Custom groups of posts

I've grouped posts from specific domains in a very arbitrary way. You can check (and collaborate) the groups in domain_groups.json. Open an issue if you consider one domain should be added/moved.

Building and dev

I haven't done much to document this properly. I'm using Jupyter Notebooks for the build process (something that I'll probably need to change to automate it weekly). The only requirements are pandas, jinja2 and the datasource mentioned above.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
docs		docs
.gitignore		.gitignore
README.md		README.md
Static Site Generator.ipynb		Static Site Generator.ipynb
domain_groups.json		domain_groups.json
template.html		template.html
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Where does the data come from?

Custom groups of posts

Building and dev

About

Releases

Packages

Contributors 2

Languages

santiagobasulto/hn-summary

Folders and files

Latest commit

History

Repository files navigation

Where does the data come from?

Custom groups of posts

Building and dev

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages