@ukgovdatascience

UK Government Data Science

Data science code from the UK Government

Pinned repositories

  1. data_scientist_career_path

    Draft Data Scientist career path by the Government Data Science Partnership

    40 11

  2. govstyle

    Theme for use with ggplot2 for creating government style visualisations

    R 35 6

  3. rap_companion

    A technical communication document intended to give assistance to people developing a Reproducible Analytical Pipeline.

    R 16 1

  • An R package to help analyse organisational data. Inspired by analysis of the the annual UK Civil Service People Survey which looks at Civil Servants' attitudes to and experience of working in government departments.

    R 2 MIT 3 issues need help Updated Aug 20, 2018
  • Ops and deployment resources for MOJ Analytics platform

    Python 2 MIT Updated Aug 7, 2018
  • Controls the deployment of user Data Science Sandboxes

    Python 1 MIT Updated May 23, 2018
  • A technical communication document intended to give assistance to people developing a Reproducible Analytical Pipeline.

    R 16 1 GPL-3.0 Updated Apr 13, 2018
  • Infrastructure as a code for Data Science processing machine

    HCL 5 1 MIT Updated Feb 27, 2018
  • Python Updated Nov 30, 2017
  • Python 4 Updated Nov 10, 2017
  • Docker container for running lda tagger experiments

    HTML 1 Updated Oct 31, 2017
  • Python 1 Updated Oct 19, 2017
  • HTML 1 MIT Updated Oct 17, 2017
  • files to clean LDA_output, for hierarchy development and for preparing data for model evaluation

    Python Updated Sep 26, 2017
  • R Updated Sep 19, 2017
  • Updated Sep 19, 2017
  • boilerplate for ds project startup

    R 1 3 Updated Sep 7, 2017
  • Stripped down version of the govuk-lda-tagger repository, for use in a docker container

    Python 2 MIT Updated Aug 29, 2017
  • Terraform script to run govuk-lda-tagger Docker image on ECS

    HCL 1 MIT Updated Aug 24, 2017
  • Technical documentation for the data science sandbox: https://github.com/ukgovdatascience/data-science-sandbox-infrastucture

    CSS Updated Aug 22, 2017
  • ⚠️ Prototype R package for the creation of the DCMS Sectors Economic Estimates statistical release

    R 5 10 Updated Aug 16, 2017
  • Demonstration package for easy cross-tabulation of categorical data.

    R 1 Updated Aug 16, 2017
  • An experiment of using the LDA machine learning algorithm to generate topics from documents and tag them with those topics

    Jupyter Notebook 1 4 MIT Updated Aug 10, 2017
  • Software Testing for Data Scientists

    Python Updated Aug 3, 2017
  • Draft Data Scientist career path by the Government Data Science Partnership

    40 11 Updated Aug 2, 2017
  • Repository for storing past accelerator projects

    Jupyter Notebook Updated Jul 25, 2017
  • Theme for use with ggplot2 for creating government style visualisations

    R 35 6 Updated Jul 18, 2017
  • Jupyter Notebook 1 9 Updated Jul 7, 2017
  • ⚠️ Templates of tools to help prevent committing sensitive data to github

    Shell 10 2 Updated Jun 21, 2017
  • 1 Updated Jun 15, 2017
  • Code that creates the GDS LA + TSO/EHO linked register

    R Updated May 7, 2017
  • A script that gets data from the Twitter real-time API, passes it to a message-queue (e.g. RabbitMQ) and stores tweets into MongoDB

    Python 7 Updated Apr 20, 2017
  • ⚠️ Prototype of a Reproducible Analytical Pipeline (RAP) using bookdown

    CSS 585 CC0-1.0 Updated Jan 4, 2017