Skip to content
View SamHollings's full-sized avatar
💭
Thinking about RAP - https://nhsdigital.github.io/rap-community-of-practice/
💭
Thinking about RAP - https://nhsdigital.github.io/rap-community-of-practice/
Block or Report

Block or report SamHollings

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SamHollings/README.md

Sam Hollings' Github

linkedin NHSD RAP CoP Website

Principal Data Scientist and RAP Advocate

I'm an experienced data scientist, having worked in both academia (Physics, Astronomy, Galaxy Modelling, Material Science, Neutron Diffraction, Cell Biology, Genomics) and in the NHS (Secondary Care, Primary Care, Deaths, Mental Health, COVID), using a variety of data science (mostly machine learning and deep learning) techniques.

Currently, I'm mainly focussed on leading the NHSE RAP squad, which is working to upskill the other parts of the analytical community, making resources, training others, and raising awareness of how concepts from DevOps can improve the lives of analysts. Learn more on our website.

I've also recently developed an interest in LLMs, and produced a few tutorials to help people get started:

  • RAG (Retreival Augemented Generation): an LLM which looks things up in a database before responding - a cheap and easy way of make it seem like an LLM has local knowledge
  • RAG with sources Open In Colab : shows you how get the LLM to give sources for it's claims, and generally how to have more control over the prompts used in the pipeline.

The tools I'm most interested in developing for:

  • python
    • pyspark
    • pandas
    • sklearn
    • statsmodels
    • Tensorflow / Keras
    • Pytorch
  • ANSI-SQL (specifically associated with spark)

The repositories I'm making split into a few categories:

  • making data easier to use and access
  • data science code (such as starter code, or explorations of techniques)
  • documentation of existing health data

Pinned

  1. NHSDigital/data-analytics-services NHSDigital/data-analytics-services Public

    This repo collects the open-source work of the Analytics Service within NHS Digital Data Services

    37 2

  2. NHSDigital/rap-community-of-practice NHSDigital/rap-community-of-practice Public

    The RAP community of practice includes all analysts and data scientists who are interested in adopting the working practices included in reproducible analytical pipelines (RAP) at NHS Digital.

    HTML 81 14

  3. llm_tutorial llm_tutorial Public

    Learn how to get started with LLMs using Langchain and Claude

    Jupyter Notebook 12 3

  4. output_checker output_checker Public

    Output Checker project

    Python 5