Skip to content
This repository has been archived by the owner on Feb 13, 2018. It is now read-only.

Experiments to extract taxonomy concepts from historical search queries performed on GOV.UK.

License

Notifications You must be signed in to change notification settings

alphagov/govuk-search-concepts-experiments

Repository files navigation

GOV.UK Search Concepts Experiments

Experiments to extract taxonomy concepts from historical search queries performed on GOV.UK.

Requirements

  • Python 3
  • Google Analytics search report CSV
  • Inventory CSV

Setup

$ git clone git@github.com:alphagov/govuk-search-concepts-experiments.git
$ cd govuk-search-concepts-experiments
$ python3 -m venv .venv
$ source .venv/bin/activate
$ pip install -r requirements.txt

Usage

Extract search terms applicable to the inventory:

$ ./extract_inventory_searches /path/to/inventory.csv /path/to/search-data.csv | tee /dev/tty > terms.csv

Cluster search terms based on term frequency–inverse document frequency:

$ ./cluster_search_terms 100 terms.csv | tee /dev/tty > clusters.txt

About

Experiments to extract taxonomy concepts from historical search queries performed on GOV.UK.

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages